Context Management

[Release] openclaw/openclaw v2026.5.28-beta.2: openclaw 2026.5.28-beta.2

OpenClaw v2026.5.28-beta.2 delivers major stability and safety improvements across agent runtime recovery, channel delivery, mobile/chat surfaces, and provider coverage. Key enhancements include steadier subagent workspace separation, safer session identity handling across multiple channels (Matrix, Slack, Discord, Teams, etc.), broader mobile UI refresh, stricter input validation, expanded provider support (Claude Opus 4.8, Fal Krea, NVIDIA models), and faster CLI/auth failure detection with clearer recovery paths.

steipeteMay 29, 2026

[Release] openclaw/openclaw v2026.5.28-beta.1: openclaw 2026.5.28-beta.1

OpenClaw v2026.5.28-beta.1 delivers major stability and safety improvements across agent runtime recovery, channel delivery, mobile/chat surfaces, CLI/auth tooling, and performance optimization. Key enhancements include steadier subagent workspace separation, safer session identity handling across plugins and messaging platforms, broader mobile UI refresh with state preservation, faster CLI failure modes with clearer recovery guidance, and reduced repeated work in hot paths through improved caching. The release also expands documentation, adds ClawHub plugin features, and strengthens QA/E2E validation with bounded logging and cross-OS artifact handling.

Translating Claude’s thoughts into language

steipeteMay 29, 2026

VIDintermediate

Claude and other AI models process information through numerical activations rather than words, encoding their thoughts in high-dimensional vector spaces. The video explores how these internal numerical representations translate into human language, examining the gap between how models think (in numbers) and how they communicate (in words). Understanding this translation process is crucial for interpreting model behavior and improving AI interpretability.

AnthropicMay 8, 2026

[Release] openai/openai-agents-python v0.15.2: v0.15.2

OpenAI Agents Python v0.15.2 introduces context management model settings and fixes critical issues with conversation session replay, tool execution, and error handling. The release includes 11 bug fixes addressing assistant conversation item IDs, function tool tracing, MCP tool metadata isolation, and stream terminal rejection. Comprehensive test coverage improvements ensure reliability across realtime tool behaviors, handoff mechanisms, and tool identity helpers.

seratchMay 6, 2026

[Release] openclaw/openclaw v2026.4.14-beta.1: openclaw 2026.4.14-beta.1

OpenClaw v2026.4.14-beta.1 is a maintenance release addressing 30+ security fixes, stability improvements, and feature enhancements across messaging platforms, browser automation, scheduling, memory systems, and agent tooling. Key improvements include replacing markdown parsing to prevent ReDoS attacks, fixing Telegram forum topic handling, correcting cron scheduler behavior, and enforcing SSRF/security policies across multiple integrations. The release emphasizes background task optimization and proper context preservation across system components.

steipeteApr 14, 2026

Building a session browser Mar 2026 • Claude Agent SDK Agent Patterns List, read, rename, tag, and fork Agent SDK sessions on disk to build a conversation history sidebar without writing a transcript parser.

The Claude Agent SDK provides built-in session management functions to build conversation history sidebars without writing custom parsers. This cookbook demonstrates how to list, read, rename, tag, and fork sessions stored as JSONL transcripts on disk. By leveraging SDK primitives like list_sessions(), get_session_messages(), and rename_session(), developers can implement session browsers for desktop apps, IDE extensions, and chatbots without managing file I/O directly.

Context engineering: memory, compaction, and tool clearing Mar 2026 • Tools Agent Patterns Compare context engineering strategies for long-running agents and learn when each applies, what it costs, and how they compose.

Apr 7, 2026

ARTintermediate

This cookbook teaches context engineering strategies for long-running AI agents, focusing on three key techniques: compaction (summarizing context), tool-result clearing (removing re-fetchable tool outputs), and memory (persistent external storage). The guide addresses context rot—the degradation of model performance as context windows grow—and provides practical implementations using Claude's API. Through a research agent example, it demonstrates how to combine these strategies to manage token growth, maintain conversation continuity, and persist knowledge across sessions.

Mar 31, 2026

Knowledge graph construction with Claude Mar 2026 • RAG & Retrieval Tools Build knowledge graphs from unstructured text using Claude for entity extraction, relation mining, deduplication, and multi-hop graph querying.

This guide teaches how to build knowledge graphs from unstructured text using Claude's structured outputs for entity extraction, relation mining, and entity resolution. Rather than training traditional NER and relation classifiers, Claude handles each stage via prompts, enabling multi-hop graph reasoning without a database. The approach uses Haiku for high-volume extraction and Sonnet for entity resolution, with techniques transferable to production databases like Neo4j or PostgreSQL.

Getting started with projects in Claude.ai

Mar 31, 2026

VIDbeginner

This tutorial introduces Projects in Claude.ai, a feature that enables users to organize their work by maintaining persistent context and custom instructions across conversations. Projects allow you to group related conversations, documents, and settings in one place, making it easier to manage complex workflows and maintain consistency across multiple interactions with Claude.

AnthropicMar 18, 2026

Speculative prompt caching May 2025 • Responses Reduce time-to-first-token by warming cache speculatively while users formulate their queries.

Speculative prompt caching is a technique that reduces time-to-first-token (TTFT) by warming up Claude's cache while users are still formulating their queries. Instead of waiting until a user submits their question to load context into the cache, the system begins cache warming immediately when the user starts typing. This cookbook demonstrates the pattern using SQLite source code as example context, showing how to implement it with the Anthropic Python SDK using async operations and prompt caching controls.

Memory & context management with Claude Sonnet 4.6 May 2025 • Tools Agent Patterns Build AI agents with persistent memory using Claude's memory tool and context editing.

This cookbook demonstrates how to build AI agents with persistent memory using Claude's memory tool and context editing capabilities. It addresses challenges of long-running agents losing learned patterns between sessions and context window overflow by implementing cross-conversation learning and automatic context management. The guide covers practical implementations for use cases like code review assistants, research assistants, and customer support bots, with setup instructions and best practices for memory security and organization.

Session memory compaction Jan 2026 • Agent Patterns Responses Manage long-running Claude conversations with instant session memory compaction using background threading and prompt caching.

This cookbook teaches developers how to manage long-running Claude conversations by implementing session memory compaction using background threading and prompt caching. Rather than waiting for context limits to be exceeded (reactive approach), the pattern enables instant compaction by proactively building summaries in the background. The guide covers writing effective session memory prompts, implementing background threading for zero-latency compaction, and applying prompt caching to reduce costs by ~80%. It includes Python code examples demonstrating both traditional (slow) and instant (fast) compaction strategies for conversational applications.

Automatic context compaction Manage context limits in long-running agentic workflows by automatically compressing conversation history.

This cookbook demonstrates automatic context compaction for managing token limits in long-running agentic workflows. It shows how the Claude Agent Python SDK can automatically compress conversation history when token usage exceeds a threshold, enabling tasks to continue beyond the 200k token context limit. The example uses a customer service agent processing support tickets, where each ticket requires multiple tool calls that accumulate in conversation history. By implementing context compaction with the compaction_control parameter, agents can maintain focus and efficiency across many iterations without manual context management.

Tool search with embeddings Scale Claude applications to thousands of tools using semantic embeddings for dynamic tool discovery.

This cookbook demonstrates how to scale Claude applications from dozens to thousands of tools using semantic embeddings for dynamic tool discovery. Instead of front-loading all tool definitions (which consumes context and increases latency), the approach provides Claude with a single tool_search tool that returns relevant capabilities on demand, reducing context usage by 90%+. The guide walks through implementing client-side semantic search using SentenceTransformer embeddings to match user queries with appropriate tools from large libraries, making it practical for production applications managing extensive tool ecosystems.

[Release] openclaw/openclaw v2026.3.7: openclaw 2026.3.7

RELadvanced

OpenClaw v2026.3.7 introduces major enhancements to context engine plugins, persistent channel bindings for Discord/Telegram, multi-language UI support (Spanish), improved web search capabilities, and Docker containerization optimizations. The release includes a breaking change requiring explicit gateway auth mode configuration. Key additions enable alternative context management strategies, durable ACP thread routing, per-topic agent overrides, and enhanced security with config validation.

steipeteMar 8, 2026

[Release] openai/openai-agents-python v0.9.0: v0.9.0

OpenAI Agents Python SDK v0.9.0 drops Python 3.9 support and introduces configurable timeout handling for function tools with options for timeout_seconds, timeout_behavior, and timeout_error_function. The release also narrows the Agent#as_tool() return type to FunctionTool, adds ToolOutputTrimmer for context management, and includes fixes for tracing and audio buffer handling.

seratchMar 6, 2026

[Release] openclaw/openclaw v2026.2.23: openclaw 2026.2.23

OpenClaw v2026.2.23 release adds first-class Kilo Gateway and Moonshot provider support, introduces prompt caching documentation and per-agent parameter overrides, and hardens session maintenance with disk-budget controls and security headers. The release includes numerous fixes for browser SSRF policies, Telegram polling/reactions, context overflow detection, and agent reasoning/compaction behavior.