Agent DailyAgent Daily

Coding Assistance

Using agents as programming assistants and pair programmers

Build a playbook about Coding Assistance

Save articles from this feed, then generate a personalized implementation guide

See a sample →
12 results
RELintermediate
Launch HN: Canary (YC W26) – AI QA that understands your code

Canary is an AI QA agent that analyzes pull requests by understanding codebases, identifying affected user workflows, and automatically generating and executing end-to-end tests. The platform connects to repositories, reads diffs to understand intent, runs tests against preview apps, and reports results directly on PRs with recordings. Beyond PR testing, it supports regression suites and continuous testing via plain English prompts, addressing the gap where modern AI tools accelerate development but lack comprehensive real-world behavior testing before merge.

VisweshycMar 19, 2026
VIDbeginner
Claude Code on desktop

Anthropic has launched Claude Code in desktop applications as a research preview feature. This enables users to run multiple local and remote code execution environments directly from their desktop. The feature allows developers to write, test, and execute code seamlessly within the Claude interface on desktop platforms.

AnthropicMar 18, 2026
TUTintermediate
Parallel tool calls on Claude 3.7 Sonnet Mar 2025 • Tools Enable parallel tool calls on Claude 3.7 Sonnet using batch tool meta-pattern workaround.

Claude 3.7 Sonnet may not make parallel tool calls by default even when enabled. This cookbook demonstrates a workaround using a "batch tool" meta-pattern that wraps multiple tool invocations, encouraging the model to call multiple tools simultaneously in a single response. By introducing a batch_tool that accepts an array of tool invocations, developers can improve latency and efficiency by processing multiple tool calls in parallel rather than sequentially.

Mar 8, 2026
TUTintermediate
Session memory compaction Jan 2026 • Agent Patterns Responses Manage long-running Claude conversations with instant session memory compaction using background threading and prompt caching.

This cookbook teaches developers how to manage long-running Claude conversations by implementing session memory compaction using background threading and prompt caching. Rather than waiting for context limits to be exceeded (reactive approach), the pattern enables instant compaction by proactively building summaries in the background. The guide covers writing effective session memory prompts, implementing background threading for zero-latency compaction, and applying prompt caching to reduce costs by ~80%. It includes Python code examples demonstrating both traditional (slow) and instant (fast) compaction strategies for conversational applications.

Mar 8, 2026
TUTintermediate
Programmatic tool calling (PTC) Reduce latency and token consumption by letting Claude write code that calls tools programmatically in the code execution environment.

Programmatic Tool Calling (PTC) enables Claude to write and execute code that calls tools directly within the code execution environment, eliminating round-trips through the model for each tool invocation. This approach significantly reduces latency and token consumption, especially when dealing with large datasets or sequential tool dependencies. The cookbook demonstrates PTC using a team expense management API scenario where Claude analyzes employee expenses across multiple tool calls, filters irrelevant data programmatically, and identifies budget overages without excessive context window usage.

Mar 8, 2026
RELintermediate
[Release] microsoft/autogen python-v0.7.2: python-v0.7.2

Microsoft AutoGen released python-v0.7.2 with several improvements including making DockerCommandLineCodeExecutor the default for MagenticOne team, adding approval_func option to CodeExecutorAgent, and adding parallel_tool_call support to OpenAI model client config. The release also includes bug fixes for structured logging serialization and new Redis agent memory support for JSON and MARKDOWN formats.

ekzhuMar 6, 2026
REPbeginner
Show HN: Open-source code search with OpenAI's function calling

gpt-code-search is an open-source tool that enables natural language code search across any codebase using OpenAI's GPT-4 and function calling. It runs locally without requiring code indexing or repository uploads, offering functions like search_codebase, get_file_tree, and get_file_contents to help the LLM retrieve and analyze code context.

narenkmanoMar 6, 2026
REPintermediate
Show HN: Nous – Open-Source Agent Framework with Autonomous, SWE Agents, WebUI

Nous is an open-source TypeScript agent framework combining features of CrewAI, OpenDevon, and LangFuse, designed for building autonomous and software engineering agents with integrated tooling. It includes a Web UI, database persistence, tracing, human-in-the-loop functionality, and a novel autonomous agent that generates Python code executed in a WebAssembly sandbox for improved reasoning and reduced LLM costs.

campersMar 6, 2026
RELintermediate
Show HN: Shuttle AI – Instant apps, from a single prompt

Shuttle AI is a tool that generates and deploys fully functional Rust backends from a single natural language prompt using multiple coordinated GPT agents. Users can create complete services like blog platforms or Twitter clones with commands like `shuttle-ai build "Build me a blog service"`, which handles specification generation, code creation, error checking, infrastructure provisioning, and cloud deployment.

openqueryMar 6, 2026
RELbeginner
Show HN: Phind V2 – A GPT-4 agent that’s connected to the internet and your code

Phind V2 launches as an intelligent pair programming agent powered by GPT-4 that dynamically chooses tools like web search, codebase analysis, and recursive reasoning to solve complex technical problems. Key features include a VS Code extension for IDE integration, an Answer Profile system for personalized responses, and the ability to ask clarifying questions and perform multi-step debugging without constant user input.

rushingcreekMar 6, 2026
RELbeginner
[Release] anthropics/claude-code v2.1.68: v2.1.68

Claude Code v2.1.68 updates Opus 4.6 to default to medium effort for Max and Team subscribers, reintroduces the 'ultrathink' keyword for enabling high effort, and removes Opus 4 and 4.1 from the first-party API with automatic migration to Opus 4.6.

ashwin-antMar 6, 2026
RELintermediate
[Release] anthropics/claude-code v2.1.69: v2.1.69

Claude Code v2.1.69 release adds the `/claude-api` skill for API development, expands voice STT support to 10 new languages (20 total), and introduces numerous UX improvements including effort level display, numeric keypad support, and custom session naming for remote control. The release includes significant bug fixes addressing security issues, memory leaks, plugin trust dialogs, OAuth keychain corruption, and various UI/terminal interaction problems.

ashwin-antMar 6, 2026