[workflow-analysis] Weekly Workflow Analysis — 2026-02-23 #17875
Closed
Replies: 1 comment
-
|
This discussion was automatically closed because it expired on 2026-02-24T09:39:32.173Z.
|
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
This report covers workflow runs analyzed on 2026-02-23, capturing all observable runs within the past week. 35 runs were analyzed, totaling 148.6M tokens, ~$2.43 in estimated cost, and 99 agent turns. 10 errors were recorded across 5 distinct failing workflows.
Summary
Overall success rate: 71% (25/35 completed runs succeeded; 3 are still in-progress)
Critical Issues
Issue Monster — Systematic Recurring Failure (6/6 runs failed)
Issue Monster (
issue-monster.lock.yml) has failed on every run today (runs #2036–#2041). Each failure is characterized by:secret_verification_resultoutput is set but the agent never executesThis pattern indicates the agent is failing at startup — either during secret validation (
COPILOT_GITHUB_TOKEN) or lockdown mode validation — before the LLM is even invoked. Other Copilot workflows (Auto-Triage Issues, CI Failure Doctor, Contribution Check) are succeeding, which may point to a workflow-specific configuration issue rather than a global token problem.Recommendation: Investigate the
Validate lockdown mode requirementsandValidate COPILOT_GITHUB_TOKEN secretsteps in the failed agent job. Check whether theskip-if-matchcondition (is:pr is:open is:draft author:app/copilot-swe-agent, max 5) is being evaluated correctly, or if the workflow has a configuration mismatch with the runner environment.Documentation Noob Tester — Upload Assets Failure + High Firewall Blocking
The agent job succeeded and created a discussion (#17863), but the
upload_assetsjob failed. The workflow also generated an unusually high network block rate:www.google.com(188),content-autofill.googleapis.com(56),accounts.google.com(27),172.30.0.20:4321(27),avatars.githubusercontent.com(12),clients2.google.com(10),android.clients.google.com(3)The blocked domains are consistent with a Chromium/Playwright browser process running within the sandbox and attempting to reach Google services (sign-in, autofill, telemetry). This does not prevent the test from running, but it generates excessive firewall noise and may indicate the browser configuration should suppress Google-specific telemetry/autofill requests.
Recommendation: Configure the browser instance used by Documentation Noob Tester to disable autofill, telemetry, and sync features (e.g.,
--disable-sync,--no-default-browser-check,--disable-client-side-phishing-detection). Separately investigate theupload_assetsfailure; the agent succeeded but the artifact upload step failed.Failure Summary
All Failed Runs
agentagentagentagentagentagentagentagentagentupload_assetsPerformance Analysis
Token Usage by Workflow
Notable: Codex-based workflows consume very high token counts relative to their turn counts. Changeset Generator run #1554 used 59.7M tokens in only 3 turns (~20M tokens/turn), likely due to large context payloads.
Cost breakdown: Total $2.43, entirely attributed to the Go Fan workflow (Claude engine, $2.43 for 2.4M tokens, 59 turns, 7.6m). All Copilot and Codex runs show no cost data.
Firewall & Network Analysis
Network Request Breakdown
Overall: 2,341 total requests | 1,085 allowed (46%) | 1,256 blocked (54%)
The majority of blocked traffic (924) is unclassified (
-domain), likely local/loopback traffic. Key patterns:Healthy (allow-listed):
api.githubcopilot.com: 532 allowed (Copilot LLM)github.com: 323 allowed (API calls)api.openai.com: 132 allowed (Codex/OpenAI)api.anthropic.com: 46 allowed (Claude)registry.npmjs.org: 36 allowed (package installs)Blocked (unexpected):
www.google.com: 188 blocked — browser telemetry from Documentation Noob Testercontent-autofill.googleapis.com: 56 blocked — browser autofill from Documentation Noob Testeraccounts.google.com: 27 blocked — browser sign-in from Documentation Noob Tester172.30.0.20:4321: 27 blocked — internal IP (local dev server?)proxy.golang.org: 3 blocked — Dependabot Dependency Checker attempting Go module proxy (not in its allow-list)Actionable:
Dependabot Dependency Checkershould addproxy.golang.orgto its network allow-list if it needs to check Go dependencies. Currently blocked 3/3 requests to that domain.Missing Tools
Missing Tool Reports
playwrightplaywright MCP toolsGitHub MCP serverSmoke Codex: Both runs (#1996, #1997) flagged missing Playwright, but both succeeded overall by working around the limitation. This is a recurring gap if the smoke test requires page navigation verification.
AI Moderator: The codex engine had only
safeoutputstools available, meaning it could not fetch issue or comment content to perform moderation. This run succeeded overall (2 turns, 195K tokens), but the AI Moderator's effectiveness may be limited without GitHub MCP access.Workflow Inventory Observations
Compilation & Status Concerns
Uncompiled workflows (compiled: "No") — changes to these
.mdfiles won't take effect until compiled:agent-performance-analyzerdaily-doc-updater(status: active — runs despite being uncompiled)daily-issues-reportdaily-news(status: active — runs despite being uncompiled)daily-rendering-scripts-verifierdiscussion-task-minergithub-mcp-tools-report(status: active)layout-spec-maintainertypistExpired workflow:
daily-team-statusshowstime_remaining: "Expired"— thestop-afterdeadline has passed. If this workflow is still relevant, the stop-after date should be extended. If not, it can be archived.Recommendations
Issue Monster (Critical): Investigate the
agentjob startup sequence — specifically steps "Validate COPILOT_GITHUB_TOKEN secret" and "Validate lockdown mode requirements" — to identify why 6 consecutive runs fail in under 30 seconds with 0 token usage. Check for a workflow-specific secret or configuration mismatch. Runs: §22300114470, §22298919454, §22294636876Documentation Noob Tester (Medium): Add browser flags to suppress Google telemetry/autofill to reduce firewall noise (546 blocked requests per run). Separately, fix the
upload_assetsstep failure. Run: §22297365436Dependabot Dependency Checker (Low): Add
proxy.golang.orgto the network allow-list to enable Go dependency checks.Uncompiled workflows (Low): Compile workflows marked as
compiled: "No"— especiallydaily-doc-updater,daily-news, andgithub-mcp-tools-reportwhich are active but running from stale compiled files.Smoke Codex (Low): Consider adding a Playwright MCP server to the smoke test environment if web navigation verification is a meaningful test signal.
AI Moderator (Medium): Verify whether the GitHub MCP server should be available in the AI Moderator environment. Without it, the agent cannot read issue/comment content for moderation.
References:
Beta Was this translation helpful? Give feedback.
All reactions