Hey everyone, exciting news from my end! I’ve recently made a big career move and joined forces with a new AI startup. My journey in the AI space has always been driven by a passion for pushing the bo
I've been on a bit of a journey optimizing performance with large language models, particularly with Qwen3.6 35B. My setup includes an NVIDIA RTX 4070 Super 12GB, AMD Ryzen 7 9700X, and 48GB of DDR5-6
Hey everyone, I wanted to share my recent experiences dealing with Large Language Models (LLMs) and keeping the API costs under control. I’ve been working primarily with OpenAI's GPT-4 and Google’s P
Hey fellow developers, I wanted to share my recent dive into managing costs for AI/LLM projects while setting up our startup. We initially started using OpenAI's GPT-3.5-turbo for our NLP needs. It's
Hey folks, I recently came across some interesting developments from Anthropic regarding their infrastructure migration. They're transitioning to the Colossus2 platform and upgrading their computatio
Hey fellow AI enthusiasts and developers!Let's make this thread a monthly ritual where we can connect talent with opportunities in our field. If you're hiring or looking for a new role, feel free to s
Hey Fellow Developers, I've recently been diving deep into the world of LLMs and have to admit, the number of choices out there is staggering. As I began to plan the integration of GPT-4 models into
Hey everyone! I recently came across an intriguing project involving a collaboration between high-level tech minds and religious leaders. Specifically, one of the co-founders of Anthropic is teaming u
Calling all AI enthusiasts and experts! This thread is your go-to place for sharing your latest AI projects, startups, and collaboration opportunities. Whether you’ve developed a cutting-edge NLP mode
Hi all, I'm Alex, part of a small team dedicated to enhancing the accessibility of AI research. With the original Papers With Code no longer receiving updates after its acquisition, we decided to ta
Hey everyone! I wanted to share a fascinating experience with GPT-4 that could be particularly interesting for those of you involved in the application of AI in mathematical research. Recently, while
I thought it would be beneficial to have a dedicated thread for sharing AI/ML job openings and job seekers. Here's how we can structure our posts to keep it organized: **For Employers:** - **Location
Hey everyone, Just thought I'd drop in to share a cool optimization I recently implemented while working with the llama.cpp model. If you've been using the Llama architecture, you're probably aware t
I had a bit of an adrenaline spike today when my language model issued its first 'rm -rf /' command. It was during a test phase to ensure command blocking was effective, and while my heart skipped a b
Hey folks, I'm currently working on deploying a custom Large Language Model for my company, and I need some insights on how to manage it efficiently, particularly in terms of cost. We're using OpenAI'
Hey folks, I'd like to share a personal update with all of you who are into AI and LLM development. After spending a few years at OpenAI, I've recently made the transition to work with Cohere. This de
Hey everyone! This is a dedicated space for sharing your AI-driven projects, open-source tools, startups, or any exciting collaborations you're working on. We want to hear about your applications, API
Hey fellow AI enthusiasts! Whether you're developing cutting-edge machine learning models, building innovative startups, or crafting useful tools, here's your space to share and connect. Feel free to
Hey folks, I've recently started juggling multiple LLM providers for different requirements (like using GPT-4 for conversational agents and Cohere for embeddings). Managing the growing costs is start
Hey everyone, I've recently been tasked with optimizing our team's expenditures on LLM APIs. We're leveraging several different providers like OpenAI, Anthropic, and Cohere, and managing costs while
Hey folks, I've been diving deep into the RAG (Retrieval-Augmented Generation) pipelines, and I wanted to discuss costs related to each part of the setup: generating embeddings, storing/accessing a ve
Hey folks, I wanted to share an interesting development in AI accessibility. Recently, the Maltese government announced a partnership with OpenAI to provide ChatGPT Plus to their entire population. Th
I've been diving into various LLM platforms recently and encountered a significant hurdle with fragmentation. Currently, I am trying to leverage models like GPT-3.5 from OpenAI for some NLP tasks, alo
Last month, I embarked on the adventure of implementing a chatbot using OpenAI's GPT-4, but I quickly ran into a significant roadblock: cost. I was initially attracted by the power of the model, havin
I've been working on a project that requires processing a hefty volume of legal documents, and I recently made the switch to using the Falcon-40B model to streamline my workflow. Previously, I experim
Discuss AI cost optimization, share architecture patterns, and connect with developers building with LLMs.
A place for developers building with LLMs to share insights about AI cost optimization, architecture patterns, and best practices.
—
—
—
—
Join the conversation
Sign in to post, vote, comment, and connect with other developers.
Create a custom drag-and-drop report for any GitHub repo with AI usage.