feat: add /speckit.qa command — systematic QA testing by arunt14 · Pull Request #2041 · github/spec-kit

arunt14 · 2026-04-01T05:57:09Z

Description

Problem

Spec-Kit's /speckit.implement command follows a TDD approach during development, but there is no dedicated QA testing step after implementation is complete. Unit tests written during TDD verify individual components, but they don't validate the end-to-end user experience against the acceptance criteria defined in spec.md. There's no way to answer: "Does the built application actually do what the spec says it should?"

This gap is especially problematic for web applications, where visual rendering, navigation flows, and form interactions can break even when unit tests pass.

Industry context: GStack's /qa and /browse commands demonstrate that persistent browser automation for visual QA — combined with acceptance criteria validation — catches an entire class of issues that unit tests miss.

Solution

Adds /speckit.qa — a new core command for systematic quality assurance testing that validates acceptance criteria from spec.md against actual application behavior. Supports two modes:

Browser QA Mode (web applications):

Navigates routes and interacts with UI using Playwright/Puppeteer
Captures screenshots at key state transitions as evidence
Validates UI elements, text content, form submissions, navigation behavior
Records console errors and network failures on test failure

CLI QA Mode (non-web applications):

Runs existing test suites (npm test, pytest, go test, cargo test, etc.)
Validates API endpoints with response schema and status code checks
Verifies command-line output patterns and exit codes
Checks database state and file system changes

Output: QA report with pass/fail evidence, acceptance criteria coverage matrix, and verdict (QA PASSED / QA PASSED WITH NOTES / QA FAILED).

Business Value

Bridges the gap between TDD and user acceptance — validates that the product works as specified, not just that the code doesn't throw
Provides auditable evidence — screenshots, response captures, and test results create a verifiable QA trail
Catches visual and integration regressions that unit tests miss entirely
Supports any project type — browser mode for web apps, CLI mode for everything else, with automatic detection
Completes the quality pipeline: implement → review → qa → ship

Workflow Position

/speckit.implement → /speckit.review → /speckit.qa ⭐ → /speckit.ship → /speckit.retro

Files Added

File	Purpose
`templates/commands/qa.md`	Command template with dual-mode testing, acceptance criteria mapping, extension hooks (`before_qa`, `after_qa`)
`templates/qa-template.md`	QA report template with test results table, criteria coverage matrix, failure details, and environment info

Testing

Tested locally with uv run specify --help
Ran existing tests with uv sync && uv run pytest — 878/972 pass (44 failures are pre-existing, unrelated to this change)
Verified YAML frontmatter is valid and parseable
Verified command file is auto-discovered by specify init
Confirmed no changes to Python source code — template-only addition

AI Disclosure

AI assistance was used in this PR
How: GitHub Copilot CLI (Claude) was used to draft the command template and output template content, following the existing patterns established by implement.md and analyze.md. All content was reviewed and validated against the project's template conventions.

Note: Happy to discuss the scope and approach with maintainers. This is one of five related-but-independent PRs adding lifecycle commands inspired by GStack's role-based workflow.

Add a new core command for systematic quality assurance testing with two modes: - Browser QA: Playwright/Puppeteer-driven UI testing with screenshots - CLI QA: Test runner execution, API validation, command-line verification Validates acceptance criteria from spec.md against actual application behavior. Generates QA report with pass/fail evidence and coverage metrics. Inspired by GStack's /qa and /browse commands.

Copilot

Pull request overview

Adds a new core /speckit.qa command template plus a QA report template to support systematic, evidence-based QA validation of a feature’s acceptance criteria (browser-driven and/or CLI-driven), fitting into the Spec Kit command workflow as a post-implementation quality gate.

Changes:

Added templates/commands/qa.md to define the /speckit.qa workflow, including before/after extension hook support.
Added templates/qa-template.md to standardize the generated QA report format.

Reviewed changes

Copilot reviewed 2 out of 2 changed files in this pull request and generated 3 comments.

File	Description
templates/commands/qa.md	New command template that outlines QA context loading, scenario extraction, browser/CLI execution, reporting, and extension hooks.
templates/qa-template.md	New markdown template for QA reports (results table, AC coverage matrix, suite summary, failure details, environment, metrics).

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Copilot · 2026-04-01T06:00:04Z

templates/commands/qa.md

+   - Flag any pre-existing test failures vs. new failures from implementation changes
+
+9. **Generate QA Report**:
+   Create the QA report at `FEATURE_DIR/qa/qa-{timestamp}.md` using the QA report template. The report must include:


Step 9 says to generate the report “using the QA report template” but doesn’t reference the canonical template file path. Other commands explicitly point to templates/*-template.md (e.g., checklist/tasks), which makes outputs consistent and avoids drift. Please reference templates/qa-template.md (or .specify/templates/qa-template.md depending on runtime path conventions) and specify that the report should follow it verbatim.

Suggested change

Create the QA report at `FEATURE_DIR/qa/qa-{timestamp}.md` using the QA report template. The report must include:

Create the QA report at `FEATURE_DIR/qa/qa-{timestamp}.md` using the canonical QA report template at `templates/qa-template.md`, following it verbatim (do not add, remove, or reorder sections). The report must include:

Copilot · 2026-04-01T06:00:05Z

templates/commands/qa.md

+10. **Provide QA Verdict**:
+    Based on results, provide one of:
+    - ✅ **QA PASSED**: All critical scenarios pass, no blockers. Safe to proceed to `/speckit.ship`
+    - ⚠️ **QA PASSED WITH NOTES**: Critical paths pass but some edge cases or non-critical scenarios failed. List items.
+    - ❌ **QA FAILED**: Critical user flows or acceptance criteria are not met. Must fix and re-test.


The verdict terminology is inconsistent: the report/template sections use “✅ ALL PASSED / ⚠️ PARTIAL PASS / ❌ FAILURES FOUND”, while Step 10 instructs “QA PASSED / QA PASSED WITH NOTES / QA FAILED”. This is likely to produce mismatched reports and makes automated parsing harder. Please standardize on one set of verdict labels (and ensure the template’s “Verdict” line matches the command’s Step 10 guidance).

Copilot · 2026-04-01T06:00:05Z

templates/commands/qa.md

+---
+description: Run systematic QA testing against the implemented feature, validating acceptance criteria through browser-driven or CLI-based testing.
+scripts:
+  sh: scripts/bash/check-prerequisites.sh --json --require-tasks --include-tasks
+  ps: scripts/powershell/check-prerequisites.ps1 -Json -RequireTasks -IncludeTasks


Adding templates/commands/qa.md makes qa a core command name when discovered from the templates directory, but the hard-coded fallback list in src/specify_cli/extensions.py (_FALLBACK_CORE_COMMAND_NAMES) doesn’t include qa. In environments where command discovery isn’t available, extension validation could incorrectly allow an extension namespace/id of qa. Please add qa to the fallback set to keep core/extension conflict checks consistent.

mnriem · 2026-04-01T11:36:21Z

Please deliver this as an extension. See https://github.com/github/spec-kit/tree/main/extensions

- Reference templates/qa-template.md explicitly in step 9 - Standardize verdict terminology (ALL PASSED/PARTIAL PASS/FAILURES FOUND) - Add 'qa' to _FALLBACK_CORE_COMMAND_NAMES in extensions.py

arunt14 · 2026-04-01T12:27:20Z

Thanks for the review! I've addressed all 3 comments in the latest push:

Step 9 template reference: Now explicitly references templates/qa-template.md with instruction to follow it verbatim.
Verdict terminology: Standardized to ALL PASSED / PARTIAL PASS / FAILURES FOUND across both the command steps and post-QA actions sections, consistent with the template's verdict line.
Fallback core command names: Added qa to _FALLBACK_CORE_COMMAND_NAMES in src/specify_cli/extensions.py to prevent extension namespace conflicts.

mnriem · 2026-04-01T12:29:13Z

Please review https://github.com/github/spec-kit/blob/main/extensions/EXTENSION-PUBLISHING-GUIDE.md for publishing a community hosted extension

arunt14 · 2026-04-01T13:59:12Z

Thanks @mnriem for the guidance! I've reviewed the Extension Publishing Guide and understand the direction - these should be delivered as community extensions rather than core commands. I'll restructure these as proper extensions following the extension.yml manifest pattern and the standard extension directory structure. Will close this PR and resubmit as an extension contribution to catalog.community.json.

arunt14 · 2026-04-01T14:22:47Z

Closing in favor of PR #2049 — restructured as a community extension per maintainer feedback. Extension repo and v1.0.0 release are available.

arunt14 requested a review from mnriem as a code owner April 1, 2026 05:57

Copilot AI review requested due to automatic review settings April 1, 2026 05:57

Copilot started reviewing on behalf of arunt14 April 1, 2026 05:57 View session

arunt14 force-pushed the feat/qa-command branch from cf35d8a to 272ae8c Compare April 1, 2026 05:59

Copilot AI reviewed Apr 1, 2026

View reviewed changes

fix: address PR review comments for qa command

501a6bf

- Reference templates/qa-template.md explicitly in step 9 - Standardize verdict terminology (ALL PASSED/PARTIAL PASS/FAILURES FOUND) - Add 'qa' to _FALLBACK_CORE_COMMAND_NAMES in extensions.py

arunt14 mentioned this pull request Apr 1, 2026

feat: add 5 lifecycle extensions to community catalog #2049

Merged

6 tasks

arunt14 closed this Apr 1, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: add /speckit.qa command — systematic QA testing#2041

feat: add /speckit.qa command — systematic QA testing#2041
arunt14 wants to merge 2 commits intogithub:mainfrom
arunt14:feat/qa-command

arunt14 commented Apr 1, 2026 •

edited

Loading

Uh oh!

Copilot AI left a comment

Uh oh!

Copilot AI Apr 1, 2026

Uh oh!

Copilot AI Apr 1, 2026

Uh oh!

Copilot AI Apr 1, 2026

Uh oh!

mnriem commented Apr 1, 2026

Uh oh!

arunt14 commented Apr 1, 2026

Uh oh!

mnriem commented Apr 1, 2026

Uh oh!

arunt14 commented Apr 1, 2026

Uh oh!

arunt14 commented Apr 1, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

	Create the QA report at `FEATURE_DIR/qa/qa-{timestamp}.md` using the QA report template. The report must include:
	Create the QA report at `FEATURE_DIR/qa/qa-{timestamp}.md` using the canonical QA report template at `templates/qa-template.md`, following it verbatim (do not add, remove, or reorder sections). The report must include:

Conversation

arunt14 commented Apr 1, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Problem

Solution

Business Value

Workflow Position

Files Added

Testing

AI Disclosure

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Copilot AI Apr 1, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Apr 1, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Apr 1, 2026

Choose a reason for hiding this comment

Uh oh!

mnriem commented Apr 1, 2026

Uh oh!

arunt14 commented Apr 1, 2026

Uh oh!

mnriem commented Apr 1, 2026

Uh oh!

arunt14 commented Apr 1, 2026

Uh oh!

arunt14 commented Apr 1, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

arunt14 commented Apr 1, 2026 •

edited

Loading