Conversational AI: From RAG Prototypes to Domain-Specific SupervisionNovember 12, 2024 Rewriting a Chat API in Go for Lambda: Cold Starts and Trade-offs The Node.js chat API worked but Lambda cold starts were slow and memory usage was high. Go fixed both, but the rewrite taught me more than I expected.
goawsllm
Read article
Conversational AI: From RAG Prototypes to Domain-Specific SupervisionNovember 4, 2024 Building a Tool Builder: Making AI Tools Configurable Without Code The first API had hardcoded tools. The Slack bot needed different tools. I built a tool system where admins create, configure, and monitor tools from a dashboard.
llmtoolingarchitecture
Read article
October 28, 2024 Getting the Most Out of Cursor IDE After a year of daily use, here's what helps with AI-assisted development in complex projects. Context management, documentation, and knowing when to commit.
llmtoolingcursor
Read article
Conversational AI: From RAG Prototypes to Domain-Specific SupervisionOctober 24, 2024 From Flat Vectors to Graph RAG: When Similarity Search Isn't Enough Vector search finds similar chunks. Graph RAG finds related concepts. The difference matters when questions span multiple topics.
raggraphllm
Read article
Conversational AI: From RAG Prototypes to Domain-Specific SupervisionOctober 15, 2024 Chunk-Then-Summarise: The Embedding Pipeline That Worked Raw PDF chunks make terrible vectors. Summarising each chunk before embedding produced cleaner searches and more relevant retrieval.
ragembeddingsllm
Read article