📊 Agentic Workflow Lock File Statistics - November 2025 #4908
Closed
Replies: 1 comment
-
|
This discussion was automatically closed because it was created by an agentic workflow more than 3 days ago. |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
This report provides a comprehensive statistical analysis of all 95 agentic workflow lock files (
.lock.yml) in the repository, revealing usage patterns, popular configurations, structural characteristics, and interesting insights about how gh-aw workflows are designed and deployed.The analysis examined workflow triggers, safe outputs, permissions, file sizes, job complexity, and tool configurations to understand the landscape of agentic workflows in this repository.
Full Statistical Report
Executive Summary
File Size Distribution
Key Statistics:
arxiv.lock.yml(80.23 KB)poem-bot.lock.yml(460.65 KB)Analysis: The majority of lock files (88.4%) fall within the 200-400 KB range, indicating relatively consistent workflow complexity. Only one outlier (poem-bot) exceeds 400 KB, suggesting it has significantly more features or safe output configurations.
Trigger Analysis
Most Popular Triggers
Insights:
workflow_dispatch, enabling on-demand execution/review,/tidy), enabling conversational interactionCommon Trigger Combinations
Pattern: The most common pattern is
schedule + workflow_dispatch(54 workflows), enabling both automated periodic execution and manual on-demand runs - a best practice for operational flexibility.Trigger Distribution by Workflow Count
Finding: 61.1% of workflows use exactly 2 triggers, typically combining automation with manual control. Only 9 workflows have no triggers (likely reusable workflows or shared configurations).
Safe Outputs Analysis
Safe Output Types Distribution
Key Insights:
create-discussionis the most popular (34.7%), indicating a preference for threaded conversations over issuesSafe Output Combinations
Notable: The
poem-botworkflow stands out with 10 different safe output types, making it the most versatile workflow in the repository.Discussion Categories
For workflows using
create-discussion, the most popular categories are:Standardization Opportunity: There are two variants of "audits" (lowercase and capitalized), indicating a potential for category name standardization.
Structural Characteristics
Job Complexity
cloclo.lock.yml)Distribution: Most workflows have 10-15 jobs, with
cloclobeing an outlier at 21 jobs, suggesting it handles multiple complex tasks or has extensive orchestration requirements.Step Complexity
poem-bot.lock.yml)Insights:
poem-botwith 99 steps is the most step-intensive workflow, correlating with its 10 safe output typesAverage Lock File Structure
Based on statistical medians, a typical
.lock.ymlfile has:schedule + workflow_dispatch)Permission Patterns
Most Common Permissions
Key Findings:
contents,issues, andpull-requestspermissions - the essential trio for repository interactionactionspermission, likely for workflow metadata and run informationsecurity-eventspermission, indicating careful scoping to security-specific use casesSecurity Posture: Workflows follow the principle of least privilege, with most requesting only the 3-4 core permissions needed for their specific tasks.
Tool & MCP Server Patterns
Most Used MCP Servers
Insights:
githubMCP server appears 3,491 times across workflows, making it by far the most essential integrationplaywrightindicate significant browser automation for testing docs, scraping, or UI verificationarxivanddeepwikienable knowledge retrieval workflowscontext7suggests it's either new or experimentalTool Configuration Patterns
Based on workflow analysis:
Interesting Findings
1. The "Poem Bot" Powerhouse
poem-bot.lock.ymlis the most complex workflow by multiple metrics:This suggests it's a highly versatile, multi-purpose workflow capable of interacting with the repository in numerous ways.
2. Schedule + Manual: The Winning Combination
54 out of 95 workflows (56.8%) use the
schedule + workflow_dispatchtrigger combination, establishing this as the de facto standard pattern for agentic workflows. This enables:3. Discussion-Driven Reporting
With 33 workflows creating discussions (vs. 14 creating issues), there's a clear preference for discussion-based reporting. Discussions provide:
4. Minimal Variance in Structure
The low standard deviation in file sizes (70.51 KB) and consistent job counts (avg: 12.1, median: 12) indicate that workflows follow similar architectural patterns, likely due to the gh-aw framework's opinionated structure. This consistency benefits:
5. Security-Conscious Permission Model
Only 7 permission types are used across all workflows, with most workflows requesting just 3-4. This minimalism indicates:
6. Shared Configuration Pattern
9 workflows have 0 triggers, located in
shared/andtests/directories. These are reusable workflow components, promoting:7. Command-Driven Interactivity
14 workflows support
commandtriggers, enabling conversational interaction:/review- Code review workflows/tidy- Cleanup operations/plan- Planning workflowsThis pattern transforms workflows from passive automation to interactive assistants.
Historical Trends
This is the first comprehensive analysis - future runs will compare against this baseline.
Baseline Metrics for Future Comparison:
Recommendations
1. Standardize Discussion Categories
Consolidate category naming (e.g., "audits" vs "Audits") to improve organization and discoverability of reports.
2. Document the "Standard Workflow Pattern"
The
schedule + workflow_dispatchwith 12 jobs and ~60 steps represents the canonical workflow structure. Documenting this pattern would help new contributors understand best practices.3. Consider Workflow Size Monitoring
With files ranging from 80 KB to 460 KB, consider implementing size warnings when workflows exceed 400 KB, as this may indicate over-complexity or opportunity for refactoring.
4. Expand MCP Server Usage
Only 5 MCP servers are actively used. Evaluating additional integrations (e.g., Slack, Notion, Jira) could enhance workflow capabilities.
5. Leverage Shared Configurations
With strong structural consistency across workflows, extracting more common patterns into shared configurations could reduce duplication and maintenance burden.
6. Safe Output Combinations
20% of workflows have no safe outputs. Evaluate whether these are purely observational or if they could benefit from notification/reporting capabilities.
Methodology
/tmp/gh-aw/cache-memory/for script persistence and historical data tracking.github/workflows/**/*.lock.ymlfilesstatisticsmodule for mean, median, standard deviationAnalysis Scripts Available in Cache
The following scripts are available for future runs:
/tmp/gh-aw/cache-memory/scripts/analyze_lockfiles.sh- Bash-based lock file data extraction/tmp/gh-aw/cache-memory/scripts/analyze.py- Python YAML parser and data collector/tmp/gh-aw/cache-memory/scripts/compute_stats.py- Statistical analysis and metric computationHistorical Data
Baseline data for this analysis is stored at:
/tmp/gh-aw/cache-memory/history/lockfile-stats-2025-11-27.jsonFuture analyses can compare against this baseline to identify trends in workflow growth, complexity changes, and pattern evolution.
Generated by Lockfile Statistics Analysis Agent on 2025-11-27
Beta Was this translation helpful? Give feedback.
All reactions