releaseintermediate
[Release] openclaw/openclaw v2026.5.10-beta.1: openclaw 2026.5.10-beta.1
By steipetegithub
View original on githubOpenClaw v2026.5.10-beta.1 introduces significant enhancements to QA automation, real-time voice diagnostics, and gateway security. Key additions include Telegram live PR evidence automation with Crabbox integration, Discord voice diagnostics, and private skill archive uploads. The release also includes numerous bug fixes across CLI, agent routing, model compatibility, and chat delivery systems, plus dependency updates for improved performance and security.
Key Points
- •Telegram QA automation: Added live PR evidence capture with Convex-leased credentials, Crabbox transcript capture, motion GIF previews, and inline PR comments for automated testing workflows
- •Discord voice diagnostics: Implemented real-time voice analysis including speaker turn detection, playback reset handling, barge-in detection, and audio cutoff analysis
- •Telegram Desktop scenario builder: New tool that leases Crabbox, installs native Telegram Desktop, configures OpenClaw gateway with bot credentials, and records VNC artifacts
- •Real-time voice instructions: Added `talk.realtime.instructions` to allow operators to append custom voice style guidance while preserving built-in agent-consult behavior
- •Private skill archive uploads: Gated opt-in feature (`skills.install.allowUploadedArchives`) enabling trusted Gateway clients to stage and install zip-backed skills with explicit operator approval
- •Pure-JS Opus decoder default: Switched to `opusscript` decoder by default to avoid slow native addon compilation outside dedicated voice-performance lanes
- •Agent session scoping fix: Preserved scoped background exec/process session references across embedded compaction without exposing unrelated scope sessions
- •CLI/onboarding improvements: Enhanced setup, configure, and channel commands with better wayfinding that explains next useful commands instead of terse labels
- •Model compatibility fixes: Resolved issues with OpenAI-compatible models returning JSON bodies, xAI Grok reasoning exposure, and Anthropic-compatible thinking block handling
- •Security and stability: Fixed silent model hangs with LLM idle watchdog, improved error handling for malformed requests, and enhanced console capture to prevent session key leakage
Found this useful? Add it to a playbook for a step-by-step implementation guide.
Workflow Diagram
Start Process
Step A
Step B
Step C
Complete
Concepts
Artifacts (4)
skills.install.allowUploadedArchives configyamlconfig
skills:
install:
allowUploadedArchives: true # Enable private skill archive uploads for trusted Gateway clientstalk.realtime.instructions configyamlconfig
talk:
realtime:
instructions: "<custom voice style instructions>" # Append operator-defined voice guidance while preserving built-in agent-consult behaviorDiscord voice configurationyamlconfig
discord:
voice:
decoder: "opusscript" # Pure-JS Opus decoder (default, avoids native addon compilation)
diagnostics:
enabled: true
features:
- speaker_turn_detection
- playback_reset_handling
- barge_in_detection
- audio_cutoff_analysis
interruptResponseOnInputAudio: false # Honor disabled input-audio interruption locallyTelegram callback handler fixjavascriptscript
# Handle managed select button callbacks before raw callback fallback
# Preserve delimiter-containing option values such as 'env|prod'
# Fixes issue #79816
function handleTelegramCallback(callback) {
// Process managed select button callbacks first
if (isManagedSelectButton(callback)) {
return processManagedSelect(callback);
}
// Fall back to raw callback handling
return processRawCallback(callback);
}