DEMO: Building a Toy RAG in a Weekend

DEMO CONTENT — delete or replace this file when adding real blog posts.

Weekend experiment recap (placeholder):

Goal: wire up a minimal RAG that answers FAQs for a fake product.
Data prep: chunked 25 markdown docs, built embeddings, stored in a local vector DB.
Retriever: top-k = 4, rerank with cosine similarity; naive but fine for demo.
LLM prompts: system message enforced tone + sources; added guardrail to refuse hallucinations.
Eval: scripted 20 Q&A pairs, measured hit rate and latency. Average latency 1.3s on laptop.
Next: add metadata filters, swap reranker, push to a serverless endpoint.