Architecture patterns
How real AI systems are built. These are the recurring shapes — knowing them gives you a vocabulary to read papers, evaluate vendors, and design your own systems.
Prompt engineering
Writing clear, specific instructions with structure, examples, and a defined output format. The cheapest, highest-leverage skill.
Few-shot prompting
Include 2–5 input/output examples in your prompt to teach the model the pattern you want.
RAG (Retrieval-Augmented Generation)
Pull relevant chunks from your own data into the prompt at query time so the model can answer with grounded information.
Tool use / function calling
Give the model a list of tools it can invoke. It decides when to call them and how to use the results.
Agents
A loop: the model plans, takes an action with a tool, observes the result, and tries again until done.
Multi-agent systems
Specialized agents collaborate — a planner, a coder, a critic — coordinated by an orchestrator.
Fine-tuning
Train a model further on your data so it adopts your style, format, or domain knowledge by default.
Evals
Automated tests for LLM output. Without them, you have no way to know if a prompt change helped or hurt.
Guardrails
Input and output filters that catch unsafe, off-topic, or malformed responses before they reach users.
Context engineering
Deliberately managing what's in the model's working memory — what to include, what to summarize, what to drop.
Prompt caching
Cache long, stable prompt prefixes so repeated calls reuse the work. Big cost and latency wins.
Streaming
Stream tokens to the user as they're generated. Perceived latency drops dramatically.