$building.effective.agents
Menu
Last verified: April 2026
· Engineering reference · v1.0

Building Effective Agents:
an engineering reference.

An independent reference for engineers building AI agents. Patterns, frameworks, evaluation, failure modes, glossary. Every specific claim links a primary source. No first-person operator voice. No fabricated benchmarks.

Definition

Anthropic, in “Building Effective Agents” (Schluntz and Pagnoni, December 2024), distinguishes workflows from agents. A workflow follows a predefined path through code; an agent decides the path at runtime. The same paper names five composable patterns that cover most production agent designs. This site is the engineering reference that extends that paper.

The voice is third-person reference. Vendor names appear as editorial mentions, never as ranked verdicts. The seven editorial rules that govern this site are public on the methodology page.

The agent loop

Sense, think, act, observe.

Modern agent architectures share a four-step loop. The agent senses (reads input and context), thinks (decides the next action), acts (calls a tool or returns output), and observes (reads the result of that action). The loop iterates until a terminal condition is met or a budget cap fires.

The classical formulation is in Russell & Norvig, AIMA, 4th ed. The modern LLM-era formulation reaches the same shape via tool calls and reflection. See agent architecture for the expanded loop with planning, memory, and reflection.

senseread input + contextthinkdecide next actionobserveread tool resultactcall tool / outputloop until done
What this site is
  • An engineering reference, in the shape of a long-form documentation site rather than a blog.
  • Citations to primary sources: vendor docs, peer-reviewed research, public benchmarks, vendor pricing pages.
  • Custom architecture diagrams, hand-built in line-art SVG, with a consistent visual language across pages.
  • Glossary anchors (/glossary/) that other pages and sister sites can deep-link.
What this site is not
  • Not a tool review site. Per-tool category overviews appear on /frameworks/ with vendor-published trade-offs and benchmark links, not ranked verdicts.
  • Not an operator notebook. No first-person production claims. No fabricated post-mortems, incidents, or simulated logs.
  • Not a vendor in disguise. Brand mentions are editorial and fair-use, the same posture as InfoQ or DEV.to.
  • Not a moving target without dates. Every page carries a Last verified stamp; refresh discipline is on /methodology/.
Read next