Agent DailyAgent Daily

Coding Workflows

Structured methodologies for agent-assisted software development

Build a playbook about Coding Workflows

Save articles from this feed, then generate a personalized implementation guide

See a sample →
19 results
RELintermediate
[Release] anthropics/claude-code v2.1.105: v2.1.105

Claude Code v2.1.105 introduces significant improvements to worktree management, plugin architecture, and user experience. Key additions include the `path` parameter for EnterWorktree, PreCompact hook blocking, background monitor support for plugins, and the `/proactive` alias for `/loop`. The release focuses on stability with better API stream handling, improved error messaging, and numerous bug fixes across file operations, terminal rendering, and MCP integration.

ashwin-antApr 14, 2026
RELintermediate
[Release] anthropics/claude-code v2.1.101: v2.1.101

Claude Code v2.1.101 introduces significant improvements to team collaboration, enterprise security, and user experience. Key additions include a `/team-onboarding` command for generating ramp-up guides, OS CA certificate store trust for enterprise TLS proxies, and auto-creation of cloud environments for remote sessions. The release focuses on enhancing error messaging, fixing critical bugs in session management, and improving plugin/MCP tool reliability across various authentication providers and platforms.

ashwin-antApr 10, 2026
TUTintermediate
Managed Agents tutorial: prompt versioning and rollback Apr 2026 • Agent Patterns Evals Server-side prompt versioning — create v1, evaluate against a labelled test set, ship v2, detect a regression, roll back by pinning sessions to version 1. Covers agents.update, version pinning on sessions.create, and where the review gate moves when prompts are not code.

This tutorial demonstrates server-side prompt versioning and rollback for Managed Agents, enabling PMs to update agent prompts without code deployments. It covers creating an agent (v1), evaluating it against a labeled test set, shipping an updated prompt (v2), detecting performance regressions, and rolling back by pinning sessions to a specific version. The workflow replaces traditional code-based prompt management with immutable versioned prompts that can be quickly reverted if issues arise.

Apr 9, 2026
TUTintermediate
Build an SRE incident response agent with Claude Managed Agents Apr 2026 • Agent Patterns Observability A webhook-triggered responder that investigates logs and runbooks with a custom Skill, fixes infrastructure code, and gates the PR behind a human-approval custom tool — with the full audit trail in the Console.

This tutorial demonstrates building a webhook-triggered SRE incident response agent using Claude Managed Agents that automatically investigates production alerts, consults runbooks, proposes infrastructure fixes via pull requests, and gates merging behind human approval. The agent combines built-in sandbox tools (bash, read, edit) with custom tools for PR management and human-in-the-loop approval, providing complete audit trails in the Anthropic Console. The example uses mocked PagerDuty, GitHub, and Datadog integrations to focus on agent patterns, with guidance for swapping in real services.

Apr 9, 2026
TUTbeginner
Managed Agents tutorial: iterate on a failing test suite Apr 2026 • Agent Patterns Tools Entry-point tutorial for the Claude Managed Agents API. Walks through agent / environment / session creation, file mounts, and the streaming event loop by getting an agent to fix three planted bugs in a calc.py package.

This tutorial introduces the Claude Managed Agents API by walking through a practical debugging workflow where an agent iteratively fixes failing tests in a Python package. It covers the three core resources (Agent, Environment, Session), file mounting, and the streaming event loop pattern. The example demonstrates how agents autonomously discover the iterate-observe-fix loop by running tests, reading failures, editing code, and repeating until all assertions pass.

Apr 9, 2026
TUTintermediate
Building a session browser Mar 2026 • Claude Agent SDK Agent Patterns List, read, rename, tag, and fork Agent SDK sessions on disk to build a conversation history sidebar without writing a transcript parser.

The Claude Agent SDK provides built-in session management functions to build conversation history sidebars without writing custom parsers. This cookbook demonstrates how to list, read, rename, tag, and fork sessions stored as JSONL transcripts on disk. By leveraging SDK primitives like list_sessions(), get_session_messages(), and rename_session(), developers can implement session browsers for desktop apps, IDE extensions, and chatbots without managing file I/O directly.

Apr 7, 2026
RELintermediate
[Release] anthropics/claude-code v2.1.91: v2.1.91

Claude Code v2.1.91 introduces MCP tool result persistence overrides supporting up to 500K characters, adds shell execution disabling options, enables multi-line prompts in deep links, and allows plugins to ship executables. The release includes critical fixes for transcript chain breaks, terminal key handling, plan mode persistence in remote sessions, and JSON schema validation, plus performance improvements and enhanced API guidance.

ashwin-antApr 3, 2026
RELintermediate
[Release] anthropics/claude-code v2.1.83: v2.1.83

Claude Code v2.1.83 introduces significant improvements to policy management, environment reactivity, security, and user experience. Key additions include a drop-in directory for managed settings, new hook events for environment changes, transcript search functionality, and enhanced subprocess credential scrubbing. The release addresses numerous stability issues, performance regressions, and UI glitches while improving startup latency and memory usage across various platforms.

ashwin-antMar 25, 2026
RELintermediate
[Release] anthropics/claude-code v2.1.77: v2.1.77

Claude Code v2.1.77 introduces significant improvements to token limits for Claude Opus and Sonnet models (up to 128k), enhances sandbox filesystem controls with `allowRead` settings, and fixes numerous critical bugs affecting bash commands, memory management, API tracking, and terminal UI interactions. The release also improves performance with faster startup times and session resumption, better plugin validation, and refined agent communication workflows.

ashwin-antMar 17, 2026
TUTintermediate
The observability agent Sep 2025 • Claude Agent SDK Agent Patterns Connect agents to external systems via MCP servers for GitHub monitoring and CI workflows.

This cookbook demonstrates how to connect Claude agents to external systems using MCP (Model Context Protocol) servers for GitHub monitoring and CI workflows. The guide covers integrating Git and GitHub MCP servers to enable agents to interact with repositories, manage workflows, and perform observability tasks. By leveraging MCP servers, agents gain access to specialized tools for Git operations, GitHub platform integration, and CI/CD monitoring without relying on command-line interfaces.

Mar 8, 2026
TUTbeginner
The one-liner research agent Sep 2025 • Claude Agent SDK Agent Patterns Build a research agent using Claude Code SDK with WebSearch for autonomous research.

This cookbook teaches how to build a research agent using Claude Agent SDK with WebSearch tool for autonomous information gathering and synthesis. The guide demonstrates creating a functional research agent in just a few lines of code, then progresses to production improvements including conversation memory, system prompts for specialized behavior, and multimodal research capabilities. The agent autonomously decides when and how to search, follows promising leads, and synthesizes findings without predefined workflows.

Mar 8, 2026
TUTintermediate
Building custom Skills for Claude Oct 2025 • Skills Create, deploy, and manage custom skills extending Claude with specialized organizational workflows.

This cookbook guide teaches developers how to create, deploy, and manage custom skills for Claude that extend its capabilities with organization-specific workflows and domain knowledge. Custom skills are specialized expertise packages bundled as markdown files, scripts, and resources that codify organizational knowledge, ensure consistency, and automate complex workflows while maintaining privacy. The guide covers skill architecture, SKILL.md requirements, progressive disclosure for token optimization, and provides utility functions for skill management including creation, listing, and deletion.

Mar 8, 2026
TUTintermediate
Automatic context compaction Manage context limits in long-running agentic workflows by automatically compressing conversation history.

This cookbook demonstrates automatic context compaction for managing token limits in long-running agentic workflows. It shows how the Claude Agent Python SDK can automatically compress conversation history when token usage exceeds a threshold, enabling tasks to continue beyond the 200k token context limit. The example uses a customer service agent processing support tickets, where each ticket requires multiple tool calls that accumulate in conversation history. By implementing context compaction with the compaction_control parameter, agents can maintain focus and efficiency across many iterations without manual context management.

Mar 8, 2026
RELintermediate
[Release] anthropics/claude-code v2.1.71: v2.1.71

Claude Code v2.1.71 introduces the `/loop` command for recurring prompt execution and cron scheduling tools, adds rebindable voice activation keybindings, and expands bash auto-approval allowlist with common utilities. The release focuses on stability improvements, fixing critical issues like stdin freezes in long sessions, CoreAudio initialization delays, OAuth token refresh failures, and image processing context overflow. Multiple UX and plugin management enhancements improve startup performance, fork isolation, and multi-instance reliability.

ashwin-antMar 8, 2026
RELintermediate
[Release] openai/openai-agents-python v0.9.2: v0.9.2

OpenAI Agents Python v0.9.2 introduces a new `reasoning_item_id_policy: 'omit'` option for RunConfig to prevent 400 errors when using reasoning models with inconsistent item sets. The release includes fixes for reasoning item filtering in nested handoffs and voice trace handling, plus documentation updates.

seratchMar 6, 2026
REPintermediate
Show HN: Nous – Open-Source Agent Framework with Autonomous, SWE Agents, WebUI

Nous is an open-source TypeScript agent framework combining features of CrewAI, OpenDevon, and LangFuse, designed for building autonomous and software engineering agents with integrated tooling. It includes a Web UI, database persistence, tracing, human-in-the-loop functionality, and a novel autonomous agent that generates Python code executed in a WebAssembly sandbox for improved reasoning and reduced LLM costs.

campersMar 6, 2026
RELbeginner
Show HN: Phind V2 – A GPT-4 agent that’s connected to the internet and your code

Phind V2 launches as an intelligent pair programming agent powered by GPT-4 that dynamically chooses tools like web search, codebase analysis, and recursive reasoning to solve complex technical problems. Key features include a VS Code extension for IDE integration, an Answer Profile system for personalized responses, and the ability to ask clarifying questions and perform multi-step debugging without constant user input.

rushingcreekMar 6, 2026
RELintermediate
Launch HN: Skyvern (YC S23) – open-source AI agent for browser automations

Skyvern is an open-source AI agent platform that automates browser-based workflows using LLMs, allowing users to define goal-based prompts to complete complex tasks across websites without brittle code-based solutions. The platform features a React UI for real-time monitoring, workflow chaining, authenticated sessions with 2FA support, and cached workflows for reusable interactions, with token costs reduced 80% using GPT-4O.

suchintanMar 6, 2026
RELintermediate
[Release] anthropics/claude-code v2.1.59: v2.1.59

Claude Code v2.1.59 introduces auto-memory context saving with /memory management, a new /copy command for interactive code block selection, and improved bash command prefix suggestions for compound commands. The release also includes memory optimization for multi-agent sessions, fixes for MCP OAuth token refresh race conditions, and better error messaging for deleted working directories.

ashwin-antMar 6, 2026