AI Systems

The architecture behind modern AI products — RAG pipelines, agentic workflows, LLM inference at scale, vector databases, function calling, and AI gateways. Every concept comes with an animation and deep-dive.

🧠 Concepts

6 concepts

🔎

15 MIN

AI Systems

RAG Pipeline

Retrieval-augmented generation: how chatbots cite sources without hallucinating.

Vector DB Internals

HNSW, IVF, and product quantization — how databases search billions of embeddings in milliseconds.

LLM Inference at Scale

KV cache, continuous batching, speculative decoding — what actually makes ChatGPT fast.

Agentic Workflows

Multi-agent orchestration: planner, executor, critic — and how they coordinate without falling over.

Function Calling & Tool Use

How LLMs decide when to call APIs, the schemas they emit, and the round-trip back to natural language.

AI Gateway

The traffic-cop in front of your LLM: routing, caching, fallbacks, rate limits, observability.

🎬 📖 🎨 🧪 📺 💻

🚧

More concepts coming

Embeddings · Prompt caching · Speculative decoding · KV cache · Tool routing · Guardrails · Evals · and more.