Home
Journey
Projects
Writing
Reading
Setup
Contact
Noise Clouds
3D Clouds
3D Flock
3D Valley
3D Waves
Tesseract
Tagged: ollama
1 article
Conversational AI: From RAG Prototypes to Domain-Specific Supervision
December 12, 2024
The Hidden Cost of Embedding
OpenAI's embedding API charges per token across ingestion, re-ingestion, and every query. Switching to a local Ollama model eliminated the recurring cost with comparable retrieval quality.
rag
embeddings
llm
ollama
Read article
All articles