Tagged: embeddings
2 articles
The Hidden Cost of Embedding OpenAI's embedding API charges per token across ingestion, re-ingestion, and every query. Switching to a local Ollama model eliminated the recurring cost with comparable retrieval quality. Read article Chunk-Then-Summarise: The Embedding Pipeline That Worked Raw PDF chunks make terrible vectors. Summarising each chunk before embedding produced cleaner searches and more relevant retrieval. Read article