REVIEWS

Engineering reviews

Engineering reviews of every tool, framework, and stack we run, have run, or have evaluated in earnest. Each review is grounded in production observation. Each is updated quarterly. Each names what is broken alongside what works.

By Oliver Wakefield-Smith, Digital Signet

Last verified April 2026

How we review: our methodology. The taxonomy below mirrors the structure of our pipeline: coding agents (the build layer), frameworks (the orchestration layer), autonomous & enterprise (the autonomy layer + procurement-vertical bridges).

CODING AGENTS

Claude Code

Strongest model-grade coding agent we have used. Most expensive when unsupervised.

Cursor

Composer is the best multi-file editing experience in 2026. Tab-completion is the rest of why we keep it.

GitHub Copilot Agent

Issue-to-PR mode is competent. PR-creation success is reliable on small issues, less so on medium.

Devin

Task-priced, sandbox-driven. P95 cost-per-task is where the economics get sharp.

Bolt.new

Strong demo-to-deploy speed. Failure modes are post-prototype.

Lovable

Cursor-alternative for non-engineers. The non-engineer claim is overstated.

Replit Agent

Hosted-deploy context is the strength. Local-deploy context goes to Claude Code.

v0.dev

UI-generation specialist. We use it as a sub-tool inside a larger pipeline.

FRAMEWORKS

LangChain

Reputation has shifted since 2024. Right when LangGraph would be overkill.

LangGraph

What we use in production. Linear scaling, explicit graph, the operator credential matters here.

CrewAI

Fastest path to a working role-based prototype. Hits a ceiling around 5 concurrent agents.

AutoGen

Conversation pattern is expressive. The conversation pattern cost economics is the catch.

Open-Source Round-Up

AutoGPT, MetaGPT, Pydantic AI, DSPy, smolagents. One review per framework, summary tier.

AUTONOMOUS & ENTERPRISE

OpenClaw

+9,999,900% YoY signal. Architecture analysis, security stress-test, the rebrand history.

Manus

Full virtual computer. Real on research and scraping. Not real on app-building.

Suna / Kortix

Open-source self-hostable. Strong trend signal, deployment cost is the catch.

OpenAI Operator

Browser-use class. Production risks are real. Includes Anthropic Computer Use sub-section.

Perplexity Comet

Computer-use agent in research workflow. Where it shines and where it does not.

Copilot Studio

Enterprise low-code. What the no-code claim costs you in flexibility.

Agentforce

Salesforce stack. Light editorial; procurement-grade detail at the vertical sites.

Watsonx Orchestrate

IBM enterprise agent stack. Independent technical review.

ABOUT THE AUTHOR

Oliver Wakefield-Smith

Founder, Digital Signet

Oliver runs Digital Signet, a research and product studio that operates ~500 production sites with AI agents as the engineering layer. The Digital Signet portfolio is built using a continuous AI-agent build pipeline, one of the largest agent-operated publishing operations on the open web. The handbook draws directly from those deployments: real cost data, real failure modes, real recovery patterns.

oliver@digitalsignet.com|About this site|Digital Signet