AgentsFeb 22, 2026

Anthropic Tool Calling 2.0: Programmatic & Optimized.

Anthropic's new programmatic tool-calling allows models to output code instead of JSON to orchestrate multiple tools. Combined with dynamic HTML filtering for web fetch and deferred tool loading via Tool Search, it reduces token waste by up to 50% while improving reliability for complex, long-running agentic tasks.

What happened?

Anthropic released a series of major updates to their tool-calling capabilities, fundamentally changing how agents interact with external systems. The core shift is from static JSON output to dynamic execution environments.

Key features include:

Programmatic Tool Calling — instead of returning a JSON block and waiting for the server to run it, Claude can now output a block of code (TypeScript or Python) to orchestrate multiple tools, use loops, and handle conditional logic deterministically.
Dynamic Filtering for Web Fetch — a middleware layer that automatically extracts relevant content from HTML before it ever hits the context window, reducing token consumption by ~24% on average.
Tool Search (Deferred Loading) — allows agents to search through hundreds of tools dynamically rather than loading every schema into the initial prompt, saving up to 80% of the context window.
Tool Use Examples — developers can now provide few-shot examples directly in the tool definition to guide complex parameter formatting and nested JSON structures.

Why this is interesting

This is the end of the "JSON ping-pong" era for complex agents.

Deterministic Orchestration — by letting the model write code to handle tool results (e.g., "for each email ID, fetch the content and summarize"), we eliminate the non-deterministic behavior that often happens when a model has to manually repeat tool calls.
The "Context Efficiency" War — Anthropic is leaning hard into optimization. While context windows are getting larger, the effective context is still limited. By filtering noise at the fetch layer and deferring tool schemas, they are making 200k context feel like 1M.
Code as the Glue — this aligns with the "executable code actions" research. Models are naturally better at writing code to solve a logic puzzle than they are at outputting a series of JSON steps. It's moving from "chatting with a tool" to "writing a script to use tools."
Enterprise Readiness — the input_examples field for tool definitions solves a massive headache for production agents: ensuring the model actually follows complex schema constraints without needing massive system prompts.

Sources