Agent DailyAgent Daily

Automation

Automating repetitive tasks and workflows with agents

Build a playbook about Automation

Save articles from this feed, then generate a personalized implementation guide

See a sample →
21 results
TUTintermediate
Build an SRE incident response agent with Claude Managed Agents Apr 2026 • Agent Patterns Observability A webhook-triggered responder that investigates logs and runbooks with a custom Skill, fixes infrastructure code, and gates the PR behind a human-approval custom tool — with the full audit trail in the Console.

This tutorial demonstrates building a webhook-triggered SRE incident response agent using Claude Managed Agents that automatically investigates production alerts, consults runbooks, proposes infrastructure fixes via pull requests, and gates merging behind human approval. The agent combines built-in sandbox tools (bash, read, edit) with custom tools for PR management and human-in-the-loop approval, providing complete audit trails in the Anthropic Console. The example uses mocked PagerDuty, GitHub, and Datadog integrations to focus on agent patterns, with guidance for swapping in real services.

Apr 9, 2026
TUTintermediate
Build a Slack data analyst bot with Claude Managed Agents Apr 2026 • Agent Patterns Integrations Mention the bot with a CSV to get an analysis report in-thread, with multi-turn follow-ups on the same session.

This cookbook demonstrates building a Slack data analyst bot using Claude Managed Agents and Bolt for Python. Users mention the bot with a CSV file to receive narrative analysis reports in-thread, with support for multi-turn follow-ups within the same session. The implementation handles file uploads, streams agent progress updates, and manages session persistence across Slack threads.

Apr 9, 2026
TUTintermediate
Threat intelligence enrichment agent Apr 2026 • Tools Agent Patterns Build an agent that autonomously investigates IOCs by querying multiple threat intel sources, cross-referencing findings, mapping to MITRE ATT&CK, and producing structured reports for SIEM and SOAR integration.

This cookbook demonstrates building a Claude-powered threat intelligence enrichment agent that autonomously investigates Indicators of Compromise (IOCs) by querying multiple threat intel sources, correlating findings, mapping to MITRE ATT&CK, and generating structured reports for SIEM/SOAR integration. The agent uses Claude's tool-use capabilities to decide which intelligence sources to query, chain tool calls based on discoveries, and convert free-text analysis into analyst-ready JSON reports. The architecture uses simulated threat intel backends that can be swapped with real APIs (VirusTotal, AbuseIPDB, Shodan, etc.) without changing orchestration logic.

Apr 7, 2026
RELintermediate
Launch HN: Canary (YC W26) – AI QA that understands your code

Canary is an AI QA agent that analyzes pull requests by understanding codebases, identifying affected user workflows, and automatically generating and executing end-to-end tests. The platform connects to repositories, reads diffs to understand intent, runs tests against preview apps, and reports results directly on PRs with recordings. Beyond PR testing, it supports regression suites and continuous testing via plain English prompts, addressing the gap where modern AI tools accelerate development but lack comprehensive real-world behavior testing before merge.

VisweshycMar 19, 2026
TUTintermediate
The observability agent Sep 2025 • Claude Agent SDK Agent Patterns Connect agents to external systems via MCP servers for GitHub monitoring and CI workflows.

This cookbook demonstrates how to connect Claude agents to external systems using MCP (Model Context Protocol) servers for GitHub monitoring and CI workflows. The guide covers integrating Git and GitHub MCP servers to enable agents to interact with repositories, manage workflows, and perform observability tasks. By leveraging MCP servers, agents gain access to specialized tools for Git operations, GitHub platform integration, and CI/CD monitoring without relying on command-line interfaces.

Mar 8, 2026
TUTbeginner
The one-liner research agent Sep 2025 • Claude Agent SDK Agent Patterns Build a research agent using Claude Code SDK with WebSearch for autonomous research.

This cookbook teaches how to build a research agent using Claude Agent SDK with WebSearch tool for autonomous information gathering and synthesis. The guide demonstrates creating a functional research agent in just a few lines of code, then progresses to production improvements including conversation memory, system prompts for specialized behavior, and multimodal research capabilities. The agent autonomously decides when and how to search, follows promising leads, and synthesizes findings without predefined workflows.

Mar 8, 2026
TUTintermediate
Claude Skills for financial applications Oct 2025 • Skills Build financial dashboards and portfolio analytics using Claude's Excel, PowerPoint, PDF skills.

This cookbook teaches how to build financial dashboards, portfolio analytics, and automated reporting workflows using Claude's Excel, PowerPoint, and PDF skills. It covers three main use cases: creating comprehensive financial models with formulas and charts in Excel, generating executive presentations from financial data, and automating multi-format reporting pipelines. The guide includes setup instructions, data loading examples, and helper functions for working with the Anthropic SDK to create professional financial documents directly through Claude's interface.

Mar 8, 2026
TUTintermediate
The site reliability agent Feb 2026 • Claude Agent SDK Agent Patterns Build an incident response agent with read-write MCP tools for autonomous diagnosis, remediation, and post-mortem documentation.

This cookbook demonstrates building an autonomous SRE incident response agent using the Claude Agent SDK with read-write MCP tools for safe infrastructure access. The agent can investigate incidents by querying metrics and logs, diagnose root causes, apply remediations by editing configs and restarting services, and document post-mortems. The pattern uses a subprocess-based MCP server with scoped tool access, clear tool descriptions, and human-in-the-loop workflows to enable autonomous yet controlled incident response.

Mar 8, 2026
TUTbeginner
Introduction to Claude Skills Create documents, analyze data, automate workflows with Claude's Excel, PowerPoint, PDF skills.

This guide introduces Claude Skills, specialized capability packages that enable document creation, data analysis, and workflow automation through Excel, PowerPoint, and PDF generation. Skills use a three-tier progressive loading model (metadata → full instructions → linked files) to optimize token usage and efficiency. The tutorial covers environment setup, API configuration, skill discovery, and practical quick-start examples for Excel, PowerPoint, and PDF workflows.

Mar 8, 2026
TUTintermediate
Programmatic tool calling (PTC) Reduce latency and token consumption by letting Claude write code that calls tools programmatically in the code execution environment.

Programmatic Tool Calling (PTC) enables Claude to write and execute code that calls tools directly within the code execution environment, eliminating round-trips through the model for each tool invocation. This approach significantly reduces latency and token consumption, especially when dealing with large datasets or sequential tool dependencies. The cookbook demonstrates PTC using a team expense management API scenario where Claude analyzes employee expenses across multiple tool calls, filters irrelevant data programmatically, and identifies budget overages without excessive context window usage.

Mar 8, 2026
RELintermediate
[Release] openclaw/openclaw v2026.3.1: openclaw 2026.3.1

OpenClaw v2026.3.1 release introduces major enhancements across multiple platforms including adaptive thinking defaults for Claude 4.6, Kubernetes health check endpoints, expanded Android device capabilities (camera, notifications, sensors), Discord/Telegram session management improvements, and WebSocket-first OpenAI streaming. The release also adds localization support (German, zh-CN), Feishu document/chat tooling, LanceDB memory customization, and numerous reliability fixes for Android, Windows, LINE, and Slack integrations.

steipeteMar 6, 2026
REPbeginner
Show HN: Open-source code search with OpenAI's function calling

gpt-code-search is an open-source tool that enables natural language code search across any codebase using OpenAI's GPT-4 and function calling. It runs locally without requiring code indexing or repository uploads, offering functions like search_codebase, get_file_tree, and get_file_contents to help the LLM retrieve and analyze code context.

narenkmanoMar 6, 2026
TOOintermediate
Show HN: MCP server for Blender that builds 3D scenes via natural language

A custom MCP (Model Context Protocol) server that connects Blender to LLMs like ChatGPT and Claude, enabling natural language control of 3D scene generation. Users can describe complex environments (villages, landscapes, animations) in plain text and the system automatically builds them in Blender with support for spatial reasoning, iterative editing, and object hierarchy management.

pronoMar 6, 2026
TOOintermediate
Show HN: Wild Moose – Autonomous agent for production debugging

Wild Moose is an autonomous AI agent designed for production debugging that connects to observability data like logs and metrics from Datadog. Rather than focusing on code generation, it performs root-cause analysis by exploring multiple data sources through conversational interfaces, combining code execution with API invocations to answer debugging questions.

yasmindMar 6, 2026
REPintermediate
Show HN: Nous – Open-Source Agent Framework with Autonomous, SWE Agents, WebUI

Nous is an open-source TypeScript agent framework combining features of CrewAI, OpenDevon, and LangFuse, designed for building autonomous and software engineering agents with integrated tooling. It includes a Web UI, database persistence, tracing, human-in-the-loop functionality, and a novel autonomous agent that generates Python code executed in a WebAssembly sandbox for improved reasoning and reduced LLM costs.

campersMar 6, 2026
REPbeginner
Show HN: BrowserOS -- browser agents with GPT-OSS, local llms

BrowserOS is an open-source Chromium fork enabling non-developers to create and run browser agents locally. After testing three UX approaches (drag-and-drop workflows, one-shot agents, and plan-follower agents), the team found that having users provide simple natural language plans while the LLM executes each step achieved the best balance—improving success rates from 30% to ~80% even with local models like those run via Ollama or LMStudio.

felarofMar 6, 2026
ARTintermediate
Why it finally makes sense to build, not buy, a workplace search product

The article argues that building custom workplace search products using LLMs is now economically viable and preferable to buying expensive vendor solutions. Modern tools like LangChain, LlamaIndex, and vector databases enable companies to build sophisticated internal search chatbots in days rather than months, with full customization and lower costs than traditional enterprise search products.

jasonwcfanMar 6, 2026
RELintermediate
Show HN: Shuttle AI – Instant apps, from a single prompt

Shuttle AI is a tool that generates and deploys fully functional Rust backends from a single natural language prompt using multiple coordinated GPT agents. Users can create complete services like blog platforms or Twitter clones with commands like `shuttle-ai build "Build me a blog service"`, which handles specification generation, code creation, error checking, infrastructure provisioning, and cloud deployment.

openqueryMar 6, 2026
DISintermediate
Show HN: A business SIM where humans beat GPT-5 by 9.8 X

Skyfall AI created Mini Amusement Parks (MAPs), a RollerCoaster Tycoon-style business simulator benchmark to evaluate whether AI agents can manage real business operations with stochastic events, incomplete information, and resource constraints. Testing revealed humans outperformed GPT-5 agents by 9.8x, with AI systems failing at long-term planning, maintenance prioritization, and handling randomness—demonstrating that current LLMs lack the operational intelligence needed for true AI CEO capabilities.

sumit_pspMar 6, 2026
RELintermediate
Launch HN: Skyvern (YC S23) – open-source AI agent for browser automations

Skyvern is an open-source AI agent platform that automates browser-based workflows using LLMs, allowing users to define goal-based prompts to complete complex tasks across websites without brittle code-based solutions. The platform features a React UI for real-time monitoring, workflow chaining, authenticated sessions with 2FA support, and cached workflows for reusable interactions, with token costs reduced 80% using GPT-4O.

suchintanMar 6, 2026