AGI
CoreArtificial General Intelligence β AI that can perform any intellectual task a human can. Not yet achieved.
Agent
CoreAn AI system that autonomously takes actions, uses tools, and makes decisions to complete multi-step goals.
API
TechnicalApplication Programming Interface β a way for software to communicate. AI APIs let you send prompts and receive responses.
Attention Mechanism
TechnicalThe neural network component that allows models to focus on relevant parts of input β the core of Transformers.
Autoregressive
TechnicalA model that generates output one token at a time, each token conditioned on all previous tokens.
Benchmark
EvaluationA standardized test used to compare AI model performance (e.g. MMLU, HumanEval, HellaSwag).
CLIP
ModelContrastive Language-Image Pre-training β OpenAI model connecting images and text, used in many image AI tools.
Chain-of-Thought
PromptingA prompting technique where you ask the model to reason step-by-step before giving the final answer.
Context Window
CoreThe maximum amount of text (tokens) an LLM can process at once. Larger = more text it can "remember" per conversation.
Constitutional AI
TrainingAnthropic's method of training AI to be helpful, harmless, and honest using a set of principles.
Diffusion Model
ModelA type of generative AI that creates images by learning to reverse a noise-adding process. Powers Stable Diffusion, DALL-E.
DALL-E
ModelOpenAI's image generation model. DALL-E 3 is integrated into ChatGPT Plus.
Embedding
TechnicalA numerical vector representation of text/images that captures semantic meaning. Used in search and RAG systems.
Emergent Behavior
CoreAbilities that appear in large models that were not explicitly trained β like multi-step reasoning, coding, or translation.
Few-Shot Learning
PromptingProviding a few examples in the prompt to guide the model's output style or format without retraining.
Fine-tuning
TrainingFurther training a pre-trained model on a specific dataset to specialize it for a particular task or style.
Foundation Model
CoreA large AI model trained on broad data that can be adapted for many tasks (GPT-4, Claude, Gemini, Llama).
GAN
ModelGenerative Adversarial Network β two networks (generator vs discriminator) competing to create realistic outputs.
GPT
ModelGenerative Pre-trained Transformer β OpenAI's architecture. GPT-4 powers ChatGPT.
Hallucination
CoreWhen an AI confidently generates false information. A key challenge with current LLMs.
Inference
TechnicalThe process of running a trained model to generate outputs. Opposite of training.
Instruct Model
ModelA model fine-tuned to follow instructions rather than just predict the next token (e.g. GPT-4-turbo-instruct).
Knowledge Cutoff
CoreThe date after which an LLM has no training data. Events after this date are unknown to the model.
Latent Space
TechnicalThe compressed mathematical space where AI models represent learned concepts and relationships.
LLM
CoreLarge Language Model β an AI trained on massive text datasets to understand and generate human language.
LoRA
TrainingLow-Rank Adaptation β efficient fine-tuning method that adds small trainable matrices without modifying the full model.
MoE
ModelMixture of Experts β architecture where only a subset of model parameters activate per token, enabling larger models cheaply.
Multimodal
CoreAI that processes multiple types of data β text + images + audio + video (e.g. GPT-4o, Gemini Ultra).
Model Weights
TechnicalThe numerical parameters of a trained neural network. "Open weights" means you can download and run the model.
Neural Network
TechnicalLayers of interconnected mathematical nodes inspired by the brain. The foundation of modern AI.
NLP
CoreNatural Language Processing β AI field focused on understanding and generating human language.
One-Shot Learning
PromptingProviding exactly one example in a prompt to guide model output.
Perplexity
EvaluationA measure of how well a language model predicts a text sample. Lower perplexity = better model.
Prompt
CoreThe input text you give to an AI model. Quality of output heavily depends on prompt quality.
Prompt Engineering
PromptingThe practice of crafting prompts to get optimal outputs from AI models.
Quantization
TechnicalReducing model precision (e.g. 16-bit to 4-bit) to run large models on consumer hardware with minimal quality loss.
RLHF
TrainingReinforcement Learning from Human Feedback β training technique where humans rate outputs to guide model behavior.
RAG
CoreRetrieval-Augmented Generation β combining an LLM with external knowledge retrieval to reduce hallucinations.
Safety Alignment
CoreTechniques to make AI behave safely and according to human values (RLHF, Constitutional AI, etc.).
Semantic Search
TechnicalSearch using meaning/embeddings rather than exact keyword matching. Enables much better relevance.
System Prompt
PromptingInstructions given to an LLM before the user message β sets behavior, persona, and constraints.
Stable Diffusion
ModelOpen-source image generation model by Stability AI. Runs locally, highly customizable.
Temperature
TechnicalA parameter controlling output randomness. Low (0.1) = focused/deterministic. High (1.0) = creative/random.
Token
CoreThe basic unit LLMs process β roughly 4 characters or 0.75 words. Pricing is usually per 1000 tokens.
Transformer
ModelThe neural network architecture (from "Attention Is All You Need" 2017) powering most modern LLMs.
Tokenizer
TechnicalThe component that converts text to tokens before feeding to an LLM. Different models use different tokenizers.
Vector Database
TechnicalA database optimized for storing and querying embeddings. Powers RAG systems (Pinecone, Weaviate, Chroma).
Vision Language Model
ModelA model that understands both images and text β can describe images, answer visual questions, read documents.
Whisper
ModelOpenAI's open-source speech-to-text model. Powers many transcription tools.
Zero-Shot
PromptingAsking a model to perform a task with no examples in the prompt β just instructions.