$building.effective.agents
Menu
Last verified: April 2026
· Glossary

AI agent engineering glossary.

A glossary of 36 terms used across this reference. Each entry has an anchor URL: link to /glossary/#term-id from anywhere on the web.

Agent

A software system that uses an LLM to pursue a goal by perceiving its environment, deciding what to do next, taking actions through tools, observing the result, and iterating until the goal is reached or it gives up. Russell & Norvig's broader definition (anything with sensors and actuators) is the classical reference.

Benchmark

A standardised dataset or environment with scoring rules. Public agent benchmarks include AgentBench, SWE-Bench, GAIA, ToolBench, HELM.

Budget cap

A hard limit on iterations or token usage that aborts an agent that gets stuck. Without a cap, the agent runs unbounded.

Classifier

A function that maps inputs to discrete labels. In the routing pattern, a classifier picks which handler the input goes to.

Confidence gate

A threshold check on a model's confidence in its own output. Below the threshold, the agent escalates or falls through to a different handler. Used as a mitigation for routing mis-classification.

Context window

The maximum number of tokens an LLM can process in one call. Modern flagship models support 100K-2M tokens depending on vendor and tier.

Coordinator

In a multi-agent system, the agent that owns scheduling and dispatch. Synonymous with orchestrator and supervisor in the relevant framework docs.

Drift

In a prompt chain, the cumulative effect of small per-step errors that compound until the final output is unrecognisable from the desired result. Mitigated by gates between steps.

Function calling

The vendor-specific term for tool use, used by OpenAI, Google, and others. The model emits a structured JSON request to invoke a named function. Equivalent to Anthropic's tool use.

Gate

A deterministic check between LLM calls in a chain that either passes the result through or short-circuits the chain. Used to fail fast on malformed inputs.

Hallucination

Output that is fluent and plausible but not grounded in fact. In agents, the high-stakes form is hallucinated tool calls: the model invents a tool name that does not exist, or fabricates an argument that the tool cannot accept.

Latency

End-to-end wall-clock time. For an agent, the latency budget is usually larger than the per-call latency because the loop runs multiple times.

Message bus

A publish-subscribe communication channel between agents. Agents publish events; other agents subscribe to the events they care about.

Multi-agent system

A system in which two or more LLM-based agents collaborate, usually under a coordinator. Most production multi-agent systems are an instance of the orchestrator-worker pattern.

Orchestrator-worker

A pattern in which a central LLM plans, dispatches subtasks to worker LLMs, and synthesises their results. The most expensive of the five patterns; benefits from worker caps.

Parallelization

A pattern in which an input is fanned out to N independent calls and the results are aggregated. Two flavours: sectioning (sub-tasks) and voting (the same task multiple times).

Planner

The role within an agent or multi-agent system that decides the sequence of actions. May be an explicit prompt, a separate LLM call, or implicit in the model's decision step.

Prompt caching

A vendor-side optimisation that caches the result of computation on a shared prompt prefix. Reduces cost on repeated calls that share a common prefix.

Prompt chaining

A pattern in which LLM calls are arranged as a linear sequence. Each step's output is the next step's input. The simplest of the five patterns.

Prompt injection

An attack in which adversarial instructions are placed in data the agent reads, so the agent treats them as instructions. Direct: in user input. Indirect: in tool output.

ReAct

Reasoning + Acting: a prompting technique in which the model alternates between thoughts (chain-of-thought reasoning) and actions (tool calls). Yao et al., 2023.

Reflection

An explicit critique step in the agent loop where the model reviews its own prior actions and revises the plan. The evaluator-optimizer pattern formalises reflection as a two-role loop.

Reliability

The proportion of runs of the same task that succeed. For agents, reliability is more decision-relevant than peak capability because the consequence of unreliability is usually retry cost.

Routing

A pattern in which a classifier picks one of N specialised handlers based on the input. Adds a small classification cost per input and saves cost when most inputs route to a cheaper handler.

Tool call

A single invocation of a tool by an agent. Includes the tool name and arguments emitted by the model. Equivalent to a function call in OpenAI/Google terminology.

Tool use

The architectural difference between an agent and a standalone LLM. The agent calls external functions, reads their output, and decides what to do next.

Worker

In an orchestrator-worker or multi-agent system, an agent that executes a subtask dispatched by the coordinator.

Workflow

Anthropic's distinction: a workflow follows a predefined path through code; an agent decides the path at runtime. Workflows fail predictably; agents fail in unanticipated ways.

Read next