L0 Chapter 11 🥚 🕒 14 min

AI Glossary: 30 Terms for Non-Technical Readers

Token, Embedding, Transformer, RAG, Fine-tuning… the jargon flying around chats. Here's the cheat sheet.

HelloAI Editors

6/6/2026

You’ve read this far—you can use AI and follow the discussions. But AI jargon density is high. This is a 90%-coverage glossary, grouped by theme.

Each term: one-line definition + one-line analogy. After this, you can follow any AI discussion.

I. Foundational Concepts (5)

1. AI (Artificial Intelligence) The umbrella term for making machines exhibit “intelligent” behavior. Analogy: this is an aspirational word, not a specific tech.

2. ML (Machine Learning) Letting machines “figure rules out” from data, instead of humans writing rules. Analogy: show a dog 100 cat photos saying “this is cat”—the dog learns. That’s ML.

3. DL (Deep Learning) A type of ML using multi-layer neural networks. Analogy: ML is the big set; DL is the “heavy-hitter” subset.

4. LLM (Large Language Model) Deep learning model specialized in text, with many parameters (typically 1B+). Analogy: ChatGPT, Claude are LLMs.

5. AGI (Artificial General Intelligence) AI matching or surpassing human ability across most tasks. Doesn’t exist yet. Analogy: current AI is “specialist”; AGI is “generalist.”

II. Model Concepts (6)

6. Parameters The “tunable knobs” inside a model. “7B model” = 7 billion parameters. Analogy: each parameter is a dial; training = adjusting dials.

7. Token The smallest unit a model processes—could be a character, word, or part of a word. Analogy: “Hello” might be 1 token; “你好” might be 2-3 tokens; billing is per-token.

8. Context Window Max tokens the model can “see” at once. Analogy: model’s “short-term memory.” Exceed it, it forgets earlier content.

9. Embedding Converting a word/sentence/document to a vector where semantically similar things are close. Analogy: put each word in a high-dimensional space—“king” and “queen” are near; “king” and “banana” are far.

10. Transformer The 2017 neural network architecture; the foundation of all modern large models. Analogy: the engine. ChatGPT, Claude, Gemini are different “cars” sharing the Transformer engine.

11. Attention (Mechanism) Transformer’s core: each token “looks back” at others, weighing by relevance. Analogy: reading “the cat chased the ball because it was curious”—your eye darts back to “cat” to figure out “it.” That’s attention.

III. Training Pipeline (5)

12. Pre-training Training a base model on massive internet text (hundreds of billions of tokens). Analogy: making a child read the entire library. GPT-3’s pretraining cost ~$4.6M.

13. Fine-tuning Continuing training on top of a pretrained model, with domain-specific data. Analogy: the same child reads medical textbooks for a week, becoming a medical assistant.

14. SFT (Supervised Fine-Tuning) Using “question-answer” pairs to teach the model to follow instructions. Analogy: show the child 1,000 “good question-answer examples,” let them learn “how should I respond.”

15. RLHF (Reinforcement Learning from Human Feedback) Have human raters score multiple AI answers; the model learns “what humans prefer.” Analogy: ChatGPT’s “polite, warm, detailed” responses come from RLHF.

16. LoRA (Low-Rank Adaptation) A lightweight fine-tuning method—only updates a small subset of parameters, saving 90% compute. Analogy: don’t rebuild the body; just add patches at the joints.

IV. Inference & Use (6)

17. Inference The model generating an answer (vs training). Analogy: training = “learning”; inference = “exam.”

18. Prompt Your instruction to the AI. Analogy: like briefing a new employee—clear briefing = good work.

19. Prompt Engineering Crafting prompts to get more precise AI output. Analogy: the art of efficiently “directing” AI. See L0-05.

20. CoT (Chain of Thought) Have the AI think step-by-step before answering. Analogy: tell it “reason step by step”—its accuracy immediately improves.

21. Few-shot Include several examples in the prompt for AI to imitate. Analogy: “write in the style of these 3 examples”—more effective than “be lively.”

22. Hallucination AI confidently saying wrong things, making things up. Analogy: an amnesiac storyteller—it thinks it remembers, but it’s guessing.

V. Augmentation & Extension (5)

23. RAG (Retrieval-Augmented Generation) First retrieve relevant docs from a knowledge base, then let the AI answer based on the docs. Analogy: at exam time, instead of testing from memory, give the AI a textbook to flip through—accuracy soars.

24. Agent An AI system that can plan, call tools, and act over multiple steps autonomously. Analogy: from “an assistant that answers one question at a time” to “an intern who can schedule meetings, book flights, write emails.”

25. Tool Use AI calling external tools mid-conversation—calculator, web search, APIs. Analogy: can’t compute exactly? Let AI open the “calculator app.”

26. MCP (Model Context Protocol) Anthropic’s standard protocol for AI to safely connect to external tools and data. Analogy: think of it as “AI world’s USB port.”

27. Multimodal AI handling multiple input types simultaneously—text, image, audio, video. Analogy: AI that can “see, hear, and read”—no longer just reading words.

VI. Performance & Systems (4)

28. Quantization Reducing parameter precision (e.g., 32-bit float → 4-bit integer)—smaller model, faster inference. Analogy: compressing a HD photo to JPEG—slight quality loss, much less space.

29. KV Cache Model caching intermediate results during generation, avoiding recomputation. Long conversations get slower because the cache grows. Analogy: keeping your past notes spread on the table during a presentation, instead of mentally recalling everything.

30. Open-source vs Closed

Open: model weights public; anyone can download, modify, deploy locally. (Llama, Mistral, DeepSeek, Qwen)
Closed: only via API. (GPT-4, Claude, Gemini) Analogy: open-source is Linux (free but DIY); closed is Windows (polished but locked).

A “See It, Understand It” Cheat Sheet

The next time you see these combinations:

Jargon	Translation
”This model has 70B parameters"	"Its brain has 70 billion knobs"
"200k context"	"It can read about 150K Chinese characters at once"
"We use RAG"	"We gave the AI a knowledge base"
"It hallucinates"	"It makes things up confidently"
"LoRA fine-tuning"	"Light training on a small dataset"
"Tool use integrated"	"AI can now call other software itself"
"Token cost $0.003"	"A few cents per sentence"
"MoE architecture"	"Multiple ‘experts’ inside, only some active each call"
"Inference acceleration"	"Make AI answer faster"
"Embedding-based retrieval"	"Search docs by semantic similarity”

💡 To become an insider

After reading L0, you can follow most everyday AI talk. To get “insider-deep,” try the L3 path—it covers the math and code behind these terms.

This Is Just the Start

L1-L7 paths use lots of these terms. Bookmark this—come back when stuck.

Next: “L0 Graduation: What’s Next?”—we look back, and look forward.