You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
This audit analyzed 16 workflow runs from the last 24 hours across multiple agentic engines (Claude, Copilot, and Codex). The overall system shows good health with a 68.75% success rate, though several recurring issues require attention, particularly around MCP server initialization and timeout handling.
3 failures primarily related to MCP client initialization issues
$1.63 total cost with efficient token usage averaging 751K tokens per run
No missing tool requests indicating good tool coverage
Zero firewall denials showing proper network configuration
📈 Workflow Health Trends
Success/Failure Patterns
The 30-day trend shows gradual improvement in success rates, climbing from 65% to nearly 69%. This positive trajectory suggests that recent stability improvements and bug fixes are having measurable impact. The weekly pattern visible in the data indicates consistent workflow activity with mid-week peaks in execution volume.
Token Usage & Costs
Token consumption remains stable around 1.8-2.2M tokens daily with costs hovering around $1.50-$1.70 per day. The 7-day moving average shows minimal volatility, indicating predictable resource usage. Today's spike to 2.25M tokens was driven by the Documentation Unbloat workflow's extensive file editing operations.
Full Audit Report
Audit Summary
Period: Last 24 hours (November 26-27, 2025)
Runs Analyzed: 16
Workflows Active: 7
Success Rate: 68.75%
Total Duration: 50.6 minutes
Total Tokens: 2,253,910
Total Cost: $1.63
Issues Found: 77 errors, 19 warnings
Workflow Breakdown by Engine
Engine
Runs
Success
Failure
Cancelled
Claude
3
1
2
0
Copilot
8
5
1
2
Codex
1
1
0
0
N/A (cancelled early)
4
0
0
4
Critical Issues
1. MCP Client Initialization Failures
Severity: 🔴 High
Multiple workflows failed to start MCP clients for playwright and serena servers:
Failed to start MCP client for playwright: McpError: MCP error -32001: Request timed out (2 occurrences)
Failed to start MCP client for serena: Error: spawn uvx ENOENT (2 occurrences)
⚠️Important Note: Many logged "errors" (45 out of 77) are actually normal log messages, debug output, or JSON serialization artifacts being incorrectly classified as errors:
Log messages containing {\"type\":\"result\",...} or {\"type\":\"user\",...}
Success result messages like THREAT_DETECTION_RESULT:{...}
Normal workflow completion messages
Debug output from safeoutputs MCP server
Recommendation: Improve error detection logic to filter out these false positives.
reacted with thumbs up emoji reacted with thumbs down emoji reacted with laugh emoji reacted with hooray emoji reacted with confused emoji reacted with heart emoji reacted with rocket emoji reacted with eyes emoji
Uh oh!
There was an error while loading. Please reload this page.
-
This audit analyzed 16 workflow runs from the last 24 hours across multiple agentic engines (Claude, Copilot, and Codex). The overall system shows good health with a 68.75% success rate, though several recurring issues require attention, particularly around MCP server initialization and timeout handling.
Key Highlights:
📈 Workflow Health Trends
Success/Failure Patterns
The 30-day trend shows gradual improvement in success rates, climbing from 65% to nearly 69%. This positive trajectory suggests that recent stability improvements and bug fixes are having measurable impact. The weekly pattern visible in the data indicates consistent workflow activity with mid-week peaks in execution volume.
Token Usage & Costs
Token consumption remains stable around 1.8-2.2M tokens daily with costs hovering around $1.50-$1.70 per day. The 7-day moving average shows minimal volatility, indicating predictable resource usage. Today's spike to 2.25M tokens was driven by the Documentation Unbloat workflow's extensive file editing operations.
Full Audit Report
Audit Summary
Workflow Breakdown by Engine
Critical Issues
1. MCP Client Initialization Failures
Severity: 🔴 High
Multiple workflows failed to start MCP clients for playwright and serena servers:
Failed to start MCP client for playwright: McpError: MCP error -32001: Request timed out(2 occurrences)Failed to start MCP client for serena: Error: spawn uvx ENOENT(2 occurrences)Affected Workflows:
Root Cause: The
uvxcommand is not available in the PATH, preventing MCP servers that use uvx (Python package executor) from starting.Impact: Workflows cannot use Playwright browser automation or Serena MCP server capabilities, limiting functionality.
2. MCP Timeout Errors
Severity: 🔴 High
Generic MCP timeout errors occurring during workflow execution:
MCP error -32001: Request timed out(2 occurrences)Affected Workflows:
Root Cause: MCP server requests exceeding timeout thresholds, potentially due to slow server initialization or network issues.
Impact: Workflow execution interrupted, requiring retries.
3. JSON Parsing Errors
Severity: 🟡 Medium
Unexpected token '#', "### Ran Pl"... is not valid JSON(1 occurrence)Affected Workflows:
Root Cause: Playwright MCP server returning markdown-formatted responses instead of JSON, causing parsing failures.
Impact: Tool calls fail with cryptic error messages, requiring manual investigation.
Error Analysis
Error Distribution
Noise vs. Signal
{\"type\":\"result\",...}or{\"type\":\"user\",...}THREAT_DETECTION_RESULT:{...}Recommendation: Improve error detection logic to filter out these false positives.
Warning Analysis
Warning Distribution
Most warnings are benign:
Performance Metrics
Resource Usage
Highest Cost Workflow
Documentation Unbloat (§19718637730)
This workflow performed extensive documentation cleanup involving multiple file reads, edits, and a full documentation site build verification.
Most Efficient Workflow
Smoke Claude (§19719726355)
Completed smoke test validation efficiently with minimal turns and tokens.
Tool Usage Analysis
Top Tools by Call Count
MCP Server Adoption
Active MCP Servers:
Problematic MCP Servers:
Firewall Analysis
Network Activity Summary
Allowed Domains
All network requests were to expected and authorized endpoints:
Firewall Health: ✅ Excellent
No denied requests indicates proper firewall configuration. All workflows had access to required external services.
Affected Workflows
Failed Workflows ❌
Smoke Copilot (§19719726259)
.claude.jsonmissingSmoke Claude (§19716996642)
Smoke Copilot No Firewall (§19719726276)
Successful Workflows ✅
Notable successes:
Documentation Unbloat (§19718637730)
Changeset Generator (§19716996656)
Smoke Codex (§19716996651)
Missing Tools
✅ No missing tools reported in this audit period.
This is excellent - all tool requests were satisfied by available tools. No new tool capabilities are needed at this time.
MCP Server Failures
Identified Failures
Root Causes
uvx(Python package executor) which is not installed in the runner environmentRecommendations
🔥 Critical Priority
Install uvx in Runner Environment
Increase MCP Initialization Timeouts
Fix Playwright MCP Response Format
Improve Error Classification
📋 Medium Priority
Add .claude.json to Smoke Test Setup
.claude.jsonfileMonitor IPv6 Network Issues
💡 Low Priority
Historical Context
This is the first automated audit report, so no historical comparison is available yet. Future audits will track:
Next Steps
References:
Beta Was this translation helpful? Give feedback.
All reactions