AI Support Assistant (RAG)

01 · The challenge

A course business was spending significant time answering repetitive questions already covered in its own documentation - support that did not scale with the audience.

02 · What I built

An AI assistant that indexes all of a business's documents into a vector database, then answers questions on demand by retrieving the most relevant passages and generating accurate, source-grounded replies through a chat interface.

03 · The hard part

Making a retrieval-augmented AI system economical and reliable. I built a seven-layer cost-optimisation strategy - deduplication, batching, multi-tier caching and strict token budgeting - that cuts AI API spend by an estimated 70%, plus consistency handling across the vector and relational stores so failed indexing never leaves orphaned data.

04 · The outcome

Per-query cost held to a fraction of a cent. Handles unlimited concurrent users around the clock. Conversation memory persists across sessions for natural follow-up.

05 · Stack

TypeScriptNode.jsvector databaseLLM embeddings & generationPostgreSQLDocker