🤖 Update Mock Benchmark Results by thomhurst · Pull Request #6160 · thomhurst/TUnit

thomhurst · 2026-06-05T03:30:11Z

Automated Mock Benchmark Update

This PR updates the mock benchmark documentation with the latest results from the Mock Benchmarks workflow.

Benchmarks Produced

Individual benchmark artifacts are available for download:

mock-benchmark-MockCreation
mock-benchmark-Setup
mock-benchmark-Invocation
mock-benchmark-Verification
mock-benchmark-Callback
mock-benchmark-CombinedWorkflow
mock-benchmark-summary (aggregated overview)

Libraries Compared

Libraries are discovered dynamically from the benchmark output. See the generated summary.json for the full list.

Changes

Updated mock benchmark data in docs/static/benchmarks/mocks/latest.json
Regenerated mock benchmark documentation in docs/docs/benchmarks/mocks/
Updated mock benchmark summary in docs/static/benchmarks/mocks/summary.json

Workflow Run

Run ID: 26993139016
Triggered: schedule

🤖 This PR was automatically created and will be merged automatically once CI checks pass.

claude

Automated Benchmark Update Review

This PR is a routine automated update to mock benchmark documentation — no production code changes. The automation mechanism looks correct: scheduled trigger, data regenerated from CI artifacts, markdown and JSON both updated consistently, timestamps advanced.

Data Quality Observation

A few benchmark values warrant attention, not as a blocker but as context for readers:

High error-to-mean ratios in Invocation benchmarks — some entries have error bars that are a large fraction of the mean. For example in the new data:

Imposter: Mean 241.29 ns, Error 186.342 ns (≈77% of mean)
NSubstitute: Mean 607.70 ns, Error 516.136 ns (≈85% of mean)

These wide error intervals make the absolute numbers less meaningful. This is a known challenge with micro-benchmarking on shared CI runners (CPU throttling, noisy neighbours), but readers comparing numbers day-to-day may draw incorrect conclusions.

Large Moq regressions in a single day — Moq's CombinedWorkflow result went from 312 μs to 406 μs (~30% slower), and Verification from 246 μs to 374 μs (~52% slower). No Moq code changed, so this is almost certainly CI noise. If these numbers end up in a public-facing comparison, they could unfairly misrepresent Moq's actual performance.

Suggestions (not blockers)

Consider adding a footnote or tooltip in the generated markdown for benchmarks where , flagging them as high-variance. This would prevent misleading day-to-day comparisons.
For the summary/overview page, using a rolling average (e.g. last 7 runs) rather than a single-run snapshot would smooth out CI noise and make the numbers more trustworthy.

Summary

The automation itself is well-structured and the PR is correct as generated. The above are suggestions for improving benchmark data quality over time, not issues with this specific update.

claude

Automated Benchmark Update Review

This PR is a routine automated update to mock benchmark documentation — no production code changes. The automation mechanism looks correct: scheduled trigger, data regenerated from CI artifacts, markdown and JSON both updated consistently, timestamps advanced.

Data Quality Observation

A few benchmark values warrant attention, not as a blocker but as context for readers:

High error-to-mean ratios in Invocation benchmarks — some entries have error bars that are a large fraction of the mean. For example in the new data:

Imposter: Mean 241.29 ns, Error 186.342 ns (≈77% of mean)
NSubstitute: Mean 607.70 ns, Error 516.136 ns (≈85% of mean)

These wide error intervals make the absolute numbers less meaningful. This is a known challenge with micro-benchmarking on shared CI runners (CPU throttling, noisy neighbours), but readers comparing numbers day-to-day may draw incorrect conclusions.

Large Moq regressions in a single day — Moq's CombinedWorkflow result went from 312 μs to 406 μs (~30% slower), and Verification from 246 μs to 374 μs (~52% slower). No Moq code changed, so this is almost certainly CI noise. If these numbers end up in a public-facing comparison, they could unfairly misrepresent Moq's actual performance.

Suggestions (not blockers)

Consider adding a footnote or tooltip in the generated markdown for benchmarks where Error / Mean > 20%, flagging them as high-variance. This would prevent misleading day-to-day comparisons.
For the summary/overview page, using a rolling average (e.g. last 7 runs) rather than a single-run snapshot would smooth out CI noise and make the numbers more trustworthy.

Summary

The automation itself is well-structured and the PR is correct as generated. The above are suggestions for improving benchmark data quality over time, not issues with this specific update.

codacy-production · 2026-06-05T03:32:24Z

Up to standards ✅

🟢 Issues 0 issues

Results:
0 new issues

View in Codacy

_{NEW Get contextual insights on your PRs based on Codacy's metrics, along with PR and Jira context, without leaving GitHub. Enable AI reviewer}
_{TIP This summary will be updated as you push new changes.}

chore: update mock benchmark results

b15b056

thomhurst added automated benchmarks documentation Improvements or additions to documentation ignore-for-release labels Jun 5, 2026

thomhurst had a problem deploying to Pull Requests June 5, 2026 03:30 — with GitHub Actions Failure

thomhurst deployed to Pull Requests June 5, 2026 03:30 — with GitHub Actions Active

thomhurst temporarily deployed to Pull Requests June 5, 2026 03:30 — with GitHub Actions Inactive

thomhurst merged commit 3c3ff4c into main Jun 5, 2026
8 of 10 checks passed

thomhurst deleted the automated-mock-benchmarks-update branch June 5, 2026 03:30

claude Bot reviewed Jun 5, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

🤖 Update Mock Benchmark Results#6160

🤖 Update Mock Benchmark Results#6160
thomhurst merged 1 commit into
mainfrom
automated-mock-benchmarks-update

thomhurst commented Jun 5, 2026

Uh oh!

Uh oh!

claude Bot left a comment

Uh oh!

claude Bot left a comment

Uh oh!

codacy-production Bot commented Jun 5, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Uh oh!

Conversation

thomhurst commented Jun 5, 2026

Automated Mock Benchmark Update

Benchmarks Produced

Libraries Compared

Changes

Workflow Run

Uh oh!

Uh oh!

claude Bot left a comment

Choose a reason for hiding this comment

Automated Benchmark Update Review

Data Quality Observation

Suggestions (not blockers)

Summary

Uh oh!

claude Bot left a comment

Choose a reason for hiding this comment

Automated Benchmark Update Review

Data Quality Observation

Suggestions (not blockers)

Summary

Uh oh!

codacy-production Bot commented Jun 5, 2026

Up to standards ✅

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant