Category: Tools

Coverage of coding agents, MCP servers, sandboxing infrastructure, and the developer tool category that emerged when LLMs became practical engineering collaborators. Articles dissect Claude Code’s five-layer architecture and permission system, the .claude/ folder treated as a protocol rather than a configuration directory, OpenAI Codex with 3 million users and how its architecture differs from Claude Code, GLM-5.1’s 8-hour autonomous coding session with 6,000 tool calls on SWE-bench Pro, Hashline’s discovery that one tool change can shift agent benchmark scores by 60 points, and the Model Context Protocol architecture behind 97 million installs in 16 months.

The category centers on what coding agents can actually do, where they break in production, and which architectural decisions matter when an agent runs unattended for hours. Coverage of MCP centers on the protocol mechanics, the security implications of widespread tool access (MCPShield’s 23 attack vectors, ToolHijacker’s 96.7% defense bypass), and the governance frameworks emerging in regulated industries to address the agentic action layer.

The standard: every tool review either includes reproducible benchmarks the author ran personally, primary source documentation from the vendor, or both. No affiliate recommendations. No vendor-supplied talking points. If a tool ships with limitations the marketing copy obscures, the article surfaces the limitations explicitly. The bias is toward open-source projects where the source code is available for verification.

WebMCP Is Not MCP: What Chrome’s modelContext Actually Ships

May 2, 2026

Chrome 146 shipped navigator.modelContext in February. It looks like MCP in a browser. It is not. Wire protocol, auth model, and security all differ.
SmolVM: Firecracker-Backed MicroVM Sandbox for AI Agent Code Execution

April 26, 2026

SmolVM gives AI agents a hardware-isolated disposable VM using Firecracker. Here’s why Docker containers are the wrong sandbox for LLM-generated code, how the snapshot-fork pattern works, and how…
Darkbloom Has 8 Security Layers, Not 4: What the Press Missed

April 18, 2026

Eigen Labs launched Darkbloom on April 15 as a decentralized inference network routing requests to idle Apple Silicon Macs. Every outlet has covered the four-layer privacy architecture. The…
GLM-5.1 Ran Autonomously for 8 Hours Across 6,000 Tool Calls. How It Beat Claude Opus 4.6 on SWE-Bench Pro and Lost on Verified.

April 13, 2026

Z.ai released GLM-5.1 open-source under MIT on April 7, 2026. The 744B-parameter MoE scored 58.4 on SWE-Bench Pro, beating Claude Opus 4.6 and GPT-5.4. It also ran 655…
Perplexity Computer Is a Productized Router on Top of Research That Has Been in the Open for Two Years. Here Is What It Actually Does.

April 9, 2026

Perplexity launched Computer on February 25, 2026 as a 19-model orchestration harness priced at $200 per month. For ML engineers, the marketing number is not the interesting part.…
MCP Hit 97 Million Installs in 16 Months. Here Is How the Protocol Actually Works Under the Hood.

March 30, 2026

The Model Context Protocol reached 97 million monthly SDK downloads in March 2026, 16 months after Anthropic introduced it. React took 3 years to reach the same milestone.…
The .claude/ Folder Is Not a Config File. It Is a Protocol. Here Is What Every Component Does and Why It Matters.

March 28, 2026

Anthropic’s Claude Code stores its entire behavior system in a .claude/ folder most developers never open. It contains layered configuration scopes, self-triggering skills, programmable hooks with a critical…
Video.js v10: How One Developer Rewrote 16 Years of Code to Be 88% Smaller

March 26, 2026

Video.js v10 beta shipped March 2026 as a ground-up rewrite combining Video.js, Plyr, Vidstack, Media Chrome, and Mux Player into one project. The result: 88% smaller bundles, first-class…