I have been building a lot of AI apps - https://github.com/a16z-infra/ai-town https://github.com/a16z-infra/companion-app ...
And there were definitely times I spent way too much $$ before deploying the app to production. So I was looking for a "local only" stack and found a few tools that worked well together. I used the following set of tools but may add more options later:
- Inference: Ollama
- VectorDB: Supabase pg-vector
- LLM orchestration: langchain
- Framework: nextjs
My learning from building ai-town was also that many people don't have the luxury of swiping a credit card to test an app with a lot of AI workloads -- not surprisingly, many models on Ollama are already very good - I used llama2-7b for majority of my dev work.
Hope this can lower the barrier to entry for building AI apps. :)