Last verified: April 2026

· Glossary

AI agent engineering glossary.

A glossary of 36 terms used across this reference. Each entry has an anchor URL: link to /glossary/#term-id from anywhere on the web.

Agent

A software system that uses an LLM to pursue a goal by perceiving its environment, deciding what to do next, taking actions through tools, observing the result, and iterating until the goal is reached or it gives up. Russell & Norvig's broader definition (anything with sensors and actuators) is the classical reference.

whatisanaiagent.com→

Agent loop

The four-step execution cycle: sense, think, act, observe. Iterates until a terminal condition or budget cap.

Agent architecture→

Agentic

An informal adjective that means “has agent-like properties.” Useful as shorthand; not a formal taxonomy term.

whatisanaiagent.com→

Benchmark

A standardised dataset or environment with scoring rules. Public agent benchmarks include AgentBench, SWE-Bench, GAIA, ToolBench, HELM.

Evaluating an agent→

Budget cap

A hard limit on iterations or token usage that aborts an agent that gets stuck. Without a cap, the agent runs unbounded.

Classifier

A function that maps inputs to discrete labels. In the routing pattern, a classifier picks which handler the input goes to.

Routing→

Confidence gate

A threshold check on a model's confidence in its own output. Below the threshold, the agent escalates or falls through to a different handler. Used as a mitigation for routing mis-classification.

Routing→

Context window

The maximum number of tokens an LLM can process in one call. Modern flagship models support 100K-2M tokens depending on vendor and tier.

Coordinator

In a multi-agent system, the agent that owns scheduling and dispatch. Synonymous with orchestrator and supervisor in the relevant framework docs.

Multi-agent systems→

Drift

In a prompt chain, the cumulative effect of small per-step errors that compound until the final output is unrecognisable from the desired result. Mitigated by gates between steps.

Prompt chaining→

Evaluator-optimizer

A pattern in which a generator proposes a candidate, an evaluator critiques it, and the loop repeats until acceptance or a budget cap. Subsumes “LLM as judge.”

Evaluator-optimizer→

Fan-out / fan-in

Fan-out: dispatching the same input to N parallel calls. Fan-in: aggregating the N results back into a single output.

Parallelization→

Function calling

The vendor-specific term for tool use, used by OpenAI, Google, and others. The model emits a structured JSON request to invoke a named function. Equivalent to Anthropic's tool use.

Gate

A deterministic check between LLM calls in a chain that either passes the result through or short-circuits the chain. Used to fail fast on malformed inputs.

Prompt chaining→

Hallucination

Output that is fluent and plausible but not grounded in fact. In agents, the high-stakes form is hallucinated tool calls: the model invents a tool name that does not exist, or fabricates an argument that the tool cannot accept.

Failure modes→

Latency

End-to-end wall-clock time. For an agent, the latency budget is usually larger than the per-call latency because the loop runs multiple times.

LLM as judge

Using an LLM to evaluate another LLM's output against a rubric. The evaluator role in the evaluator-optimizer pattern.

Evaluator-optimizer→

MCP (Model Context Protocol)

A vendor-agnostic protocol for exposing tools, resources, and prompts to LLMs. Introduced by Anthropic and adopted by other vendors.

modelcontextprotocol.io→

Message bus

A publish-subscribe communication channel between agents. Agents publish events; other agents subscribe to the events they care about.

Multi-agent systems→

Multi-agent system

A system in which two or more LLM-based agents collaborate, usually under a coordinator. Most production multi-agent systems are an instance of the orchestrator-worker pattern.

Multi-agent systems→

Orchestrator-worker

A pattern in which a central LLM plans, dispatches subtasks to worker LLMs, and synthesises their results. The most expensive of the five patterns; benefits from worker caps.

Orchestrator-worker→

Parallelization

A pattern in which an input is fanned out to N independent calls and the results are aggregated. Two flavours: sectioning (sub-tasks) and voting (the same task multiple times).

Parallelization→

Planner

The role within an agent or multi-agent system that decides the sequence of actions. May be an explicit prompt, a separate LLM call, or implicit in the model's decision step.

Prompt caching

A vendor-side optimisation that caches the result of computation on a shared prompt prefix. Reduces cost on repeated calls that share a common prefix.

Prompt chaining

A pattern in which LLM calls are arranged as a linear sequence. Each step's output is the next step's input. The simplest of the five patterns.

Prompt chaining→

Prompt injection

An attack in which adversarial instructions are placed in data the agent reads, so the agent treats them as instructions. Direct: in user input. Indirect: in tool output.

Failure modes→

ReAct

Reasoning + Acting: a prompting technique in which the model alternates between thoughts (chain-of-thought reasoning) and actions (tool calls). Yao et al., 2023.

Yao et al. (2023)→

Reflection

An explicit critique step in the agent loop where the model reviews its own prior actions and revises the plan. The evaluator-optimizer pattern formalises reflection as a two-role loop.

Reliability

The proportion of runs of the same task that succeed. For agents, reliability is more decision-relevant than peak capability because the consequence of unreliability is usually retry cost.

Routing

A pattern in which a classifier picks one of N specialised handlers based on the input. Adds a small classification cost per input and saves cost when most inputs route to a cheaper handler.

Routing→

Self-consistency

A voting variant of parallelization: the same prompt is sampled N times, and the modal answer is selected. Wang et al., 2022.

Wang et al. (2022)→

Self-refine

An evaluator-optimizer variant in which the same model plays both generator and evaluator. Madaan et al., 2023.

Madaan et al. (2023)→

Tool call

A single invocation of a tool by an agent. Includes the tool name and arguments emitted by the model. Equivalent to a function call in OpenAI/Google terminology.

Tool use

The architectural difference between an agent and a standalone LLM. The agent calls external functions, reads their output, and decides what to do next.

Agent architecture→

Worker

In an orchestrator-worker or multi-agent system, an agent that executes a subtask dispatched by the coordinator.

Orchestrator-worker→

Workflow

Anthropic's distinction: a workflow follows a predefined path through code; an agent decides the path at runtime. Workflows fail predictably; agents fail in unanticipated ways.