CLAUDE LABJP
SWE-BENCH — Claude Opus 4.8 scores 69.2% on SWE-Bench Pro, topping GPT-5.5 and Gemini 3.1 Pro (May)TOKYO — Code with Claude heads to Tokyo on June 10, its first Asia stop after SF and London (Jun)LIMITS — Claude Code raises weekly limits by 50% for all Pro/Max/Team/Enterprise users through July 13 (Jun)EFFORT — claude.ai now lets users control how much effort Claude puts into a task (Jun)SPEED — Opus 4.8's fast mode runs 2.5x faster at the same price as Opus 4.7 (May)WORKFLOW — Claude Code's Dynamic Workflows distribute work across hundreds of parallel subagents (May)SWE-BENCH — Claude Opus 4.8 scores 69.2% on SWE-Bench Pro, topping GPT-5.5 and Gemini 3.1 Pro (May)TOKYO — Code with Claude heads to Tokyo on June 10, its first Asia stop after SF and London (Jun)LIMITS — Claude Code raises weekly limits by 50% for all Pro/Max/Team/Enterprise users through July 13 (Jun)EFFORT — claude.ai now lets users control how much effort Claude puts into a task (Jun)SPEED — Opus 4.8's fast mode runs 2.5x faster at the same price as Opus 4.7 (May)WORKFLOW — Claude Code's Dynamic Workflows distribute work across hundreds of parallel subagents (May)
ARTICLES

All Articles

All (1013) Claude AI (278) Claude Code (385) Cowork (81) API & SDK (269)
API & SDK/2026-04-24Advanced

Giving Claude Agents Long-Term Memory in Production — Seven Pitfalls and the Patterns That Fix Them

A production playbook for Claude agents with long-term memory — seven pitfalls that break memory agents live, and the design patterns that fix each one.

API & SDK/2026-04-24Advanced

Shadow Mode with Claude Agent SDK — Measuring Agent Accuracy on Live Traffic Without Touching Users

You want to ship an AI agent to production, but you can't measure its real accuracy without exposing real users. Shadow mode solves that paradox. This guide shows how to run a Claude Agent SDK agent alongside your existing workflow, log the deltas, and promote it step by step.

API & SDK/2026-04-23Advanced

High-Availability Patterns for the Claude API — Making Sonnet/Haiku/Opus Fallback Work in Production

A single-model Claude API integration will fall over the first time rate limits or a regional hiccup land at peak hours. This is the production pattern for a Sonnet → Opus → Haiku fallback chain, with circuit breakers, streaming coverage, and the pitfalls you only learn the hard way.

API & SDK/2026-04-23Intermediate

Using tool_choice to Cut Wasted Inference: Four Modes and Cost Patterns for Production

tool_choice is one of the most underused parameters in the Claude API. The four modes — auto, any, tool, and none — each change both behavior and token cost. Here are the patterns I reach for in production, with runnable code.

API & SDK/2026-04-23Advanced

Running Claude API Parallel Tool Use in Production — Controlling Concurrency, Designing for Partial Failure, and Cutting Latency

Claude API's parallel tool use can cut agent latency in half — but partial failures and state conflicts show up fast in production. Here's how to control concurrency, design error handling, and add observability.

API & SDK/2026-04-23Advanced

Production Prompt-Injection Defense for the Claude API — Detection, Sanitization, and Layered Guardrails

A practical, code-first design guide for defending Claude API applications against prompt injection — covering input sanitization, channel separation, output validation, and red-teaming for long-term safety.

API & SDK/2026-04-22Advanced

Implementing Progressive Delivery with the Claude Agent SDK: Canary, Feature Flags, and Automatic Rollback Patterns for Production

Production-grade patterns for safely rolling out AI agents built with the Claude Agent SDK. Combines canary traffic splitting, feature flags, and SLO-driven automatic rollback with runnable TypeScript/Hono implementation code.

API & SDK/2026-04-22Advanced

Production-Grade Resilience Patterns for Claude API Streaming

Streaming with the Claude API looks easy until you run it in production. This is a battle-tested collection of patterns — disconnection recovery, deduplication, partial tool_use handling — with code you can drop into your codebase today.

API & SDK/2026-04-22Advanced

Building Fault-Tolerant Long-Running AI Workflows with Claude Agent SDK × Temporal.io — A Production Design Guide to Durable Execution and Saga Patterns

A complete production guide to combining Claude Agent SDK with Temporal.io to build AI workflows that survive crashes, restarts, and multi-day human approval gates. Durable Execution, retry policies, saga compensation, and signal integration patterns.

API & SDK/2026-04-22Intermediate

Handling Frequent 529 Overloaded Errors from the Claude API — A Practical Playbook

A 529 Overloaded response from the Claude API is a very different animal from a 429 rate limit. Here is the retry, fallback, and circuit breaker playbook I actually use in production to keep services responsive when Anthropic's platform is temporarily saturated.

API & SDK/2026-04-22Advanced

Designing Idempotency in the Claude Agent SDK: Production Patterns for Safe Retries

How to prevent double-charged customers, duplicate emails, and inventory drift when your Claude Agent SDK retries or resumes. Covers idempotency keys, outbox patterns, and wrapper decorators with working code.

API & SDK/2026-04-22Intermediate

How to Set budget_tokens for Claude Extended Thinking: A Practical Guide Based on Cost, Quality, and Latency

Are you setting budget_tokens to 'something generous and hoping for the best'? Here is a practical framework for choosing the right value per task type, grounded in real measurements.