DEMO: Building a Toy RAG in a Weekend
Placeholder write-up for a tiny RAG experiment with cached embeddings and a scripted evaluator.
DEMO CONTENT — delete or replace this file when adding real blog posts.
Weekend experiment recap (placeholder):
- Goal: wire up a minimal RAG that answers FAQs for a fake product.
- Data prep: chunked 25 markdown docs, built embeddings, stored in a local vector DB.
- Retriever: top-k = 4, rerank with cosine similarity; naive but fine for demo.
- LLM prompts: system message enforced tone + sources; added guardrail to refuse hallucinations.
- Eval: scripted 20 Q&A pairs, measured hit rate and latency. Average latency 1.3s on laptop.
- Next: add metadata filters, swap reranker, push to a serverless endpoint.
Tags: RAGLangChainVector DB