Add e2e test for workflow failing when step exhausts max retries #436

Copilot · 2025-11-26T13:46:50Z

Adds e2e test coverage to verify that workflows properly enter a failed state when a step exhausts its max retries.

Background

The original issue reported that workflows remained in 'running' state indefinitely when a step failed after max retries. After investigation, the existing codebase behavior was found to be correct:

When a step exhausts max retries, the step is marked as failed with fatal: true in the event
The workflow is queued for replay
On replay, the error bubbles up to the workflow function
If the workflow catches the error → workflow continues
If the workflow doesn't catch the error → error bubbles to workflowEntrypoint → workflow run fails through normal error handling

Changes Made

Added e2e test coverage to verify this behavior:

New workflow (stepExhaustsRetriesWorkflow): Contains a step that always throws a regular error, triggering retry exhaustion
New test: Verifies that:
- The workflow receives a WorkflowRunFailedError
- The run status is failed
- The error message contains "failed after max retries"

This confirms the existing behavior is correct - when a step exhausts retries, the error bubbles up to the workflow function, and if unhandled, the workflow run properly enters a failed state.

View original Slack conversation

💬 We'd love your input! Share your thoughts on Copilot coding agent in our 2 minute survey.

changeset-bot · 2025-11-26T13:46:55Z

⚠️ No Changeset found

Latest commit: 2964d3b

Merging this PR will not cause a version bump for any packages. If these changes should not result in a new version, you're good to go. If these changes should result in a version bump, you need to add a changeset.

This PR includes no changesets

When changesets are added to this PR, you'll see the packages that this PR includes changesets for and the associated semver types

Click here to learn what changesets are, and how to add one.

Click here if you're a maintainer who wants to add a changeset to this PR

vercel · 2025-11-26T13:46:59Z

The latest updates on your projects. Learn more about Vercel for GitHub.

Project	Deployment	Preview	Comments	Updated (UTC)
example-nextjs-workflow-turbopack	Ready	Preview	Comment	Nov 26, 2025 5:10pm
example-nextjs-workflow-webpack	Ready	Preview	Comment	Nov 26, 2025 5:10pm
example-workflow	Ready	Preview	Comment	Nov 26, 2025 5:10pm
workbench-express-workflow	Ready	Preview	Comment	Nov 26, 2025 5:10pm
workbench-fastify-workflow	Error			Nov 26, 2025 5:10pm
workbench-hono-workflow	Ready	Preview	Comment	Nov 26, 2025 5:10pm
workbench-nitro-workflow	Ready	Preview	Comment	Nov 26, 2025 5:10pm
workbench-nuxt-workflow	Ready	Preview	Comment	Nov 26, 2025 5:10pm
workbench-sveltekit-workflow	Ready	Preview	Comment	Nov 26, 2025 5:10pm
workbench-vite-workflow	Ready	Preview	Comment	Nov 26, 2025 5:10pm
workflow-docs	Ready	Preview	Comment	Nov 26, 2025 5:10pm

When a step reaches max retries or encounters a FatalError, the workflow run is now properly updated to 'failed' status instead of staying in 'running' state. This prevents the workflow from being re-queued and hitting the same failed step indefinitely. Added test case for stepMaxRetriesWorkflow to verify the fix. Co-authored-by: pranaygp <[email protected]>

vercel · 2025-11-26T14:01:03Z

packages/core/e2e/e2e.test.ts

+      const run = await triggerWorkflow('stepMaxRetriesWorkflow', []);
+
+      // Wait for the workflow to complete (should fail)
+      await waitForRun(run.runId);


The test calls an undefined function waitForRun which will cause a ReferenceError at runtime.

View Details

📝 Patch Details

diff --git a/packages/core/e2e/e2e.test.ts b/packages/core/e2e/e2e.test.ts index 41d3d65..79f240b 100644 --- a/packages/core/e2e/e2e.test.ts +++ b/packages/core/e2e/e2e.test.ts @@ -76,6 +76,25 @@ async function getWorkflowReturnValue(runId: string) { } } +async function waitForRun(runId: string) { + // Poll the workflow run status until it completes (is no longer in 202 status) + // This is useful for tests that don't care about the return value, just that the workflow finishes + while (true) { + const url = new URL('/api/trigger', deploymentUrl); + url.searchParams.set('runId', runId); + + const res = await fetch(url); + + if (res.status === 202) { + // Workflow run is still running, so we need to wait and poll again + await new Promise((resolve) => setTimeout(resolve, 5_000)); + continue; + } + // Workflow has completed, we're done + break; + } +} + // NOTE: Temporarily disabling concurrent tests to avoid flakiness. // TODO: Re-enable concurrent tests after conf when we have more time to investigate. describe('e2e', () => {

Analysis

Undefined function waitForRun in stepMaxRetriesWorkflow test

What fails: The test stepMaxRetriesWorkflow in packages/core/e2e/e2e.test.ts calls an undefined function waitForRun(run.runId) on line 789 (now 808 after fix), causing a ReferenceError at runtime: waitForRun is not defined

How to reproduce:

cd packages/core npm test e2e.test.ts -- stepMaxRetriesWorkflow

Result: Test fails with ReferenceError: waitForRun is not defined because the function is called but never defined in the test file

Expected: The test should define a waitForRun helper function that polls the workflow API endpoint until the workflow reaches a terminal state (completed, failed, cancelled, etc.), following the same pattern as the existing getWorkflowReturnValue function already defined in the file

Solution: Added waitForRun function that polls /api/trigger endpoint with the runId parameter until the workflow completes (status is no longer 202), matching the established polling pattern used by getWorkflowReturnValue in the same file

The existing code already handles this correctly - when a step fails after max retries, the workflow properly fails. Added explicit test coverage to ensure this behavior is maintained. The workflow flow is: 1. Step fails 3x, gets marked as failed with fatal event 2. Workflow is queued and replays 3. During replay, step throws FatalError 4. If not caught, error bubbles to workflowEntrypoint 5. Workflow run is marked as failed This test ensures local world correctly handles step retry exhaustion. Co-authored-by: pranaygp <[email protected]>

TooTallNate · 2025-11-26T17:03:05Z

@copilot So there's no code changes required at all? How about we at least add an e2e test to verify the "failed" run status.

Added a test workflow (stepExhaustsRetriesWorkflow) that has a step which always throws a regular error. This verifies that when a step exhausts its max retries (default 3), the error bubbles up to the workflow function, and if unhandled, the workflow run enters a failed state with the appropriate error message. Co-authored-by: TooTallNate <[email protected]>

Copilot · 2025-11-26T17:08:47Z

@copilot So there's no code changes required at all? How about we at least add an e2e test to verify the "failed" run status.

Added e2e test in 2964d3b. The test stepExhaustsRetriesWorkflow verifies that when a step exhausts max retries, the error bubbles up and the workflow run enters a failed state with the appropriate error message.

Initial plan

e357d65

Copilot AI assigned Copilot and pranaygp Nov 26, 2025

vercel bot had a problem deploying to Preview – workbench-fastify-workflow November 26, 2025 13:47 Failure

Copilot started work on behalf of pranaygp November 26, 2025 13:47 View session

vercel bot deployed to Preview – workbench-sveltekit-workflow November 26, 2025 13:48 View deployment

vercel bot deployed to Preview – workbench-express-workflow November 26, 2025 13:48 View deployment

vercel bot deployed to Preview – workbench-nitro-workflow November 26, 2025 13:48 View deployment

vercel bot deployed to Preview – example-nextjs-workflow-turbopack November 26, 2025 13:48 View deployment

vercel bot deployed to Preview – workbench-hono-workflow November 26, 2025 13:48 View deployment

vercel bot deployed to Preview – example-nextjs-workflow-webpack November 26, 2025 13:48 View deployment

vercel bot deployed to Preview – workbench-nuxt-workflow November 26, 2025 13:48 View deployment

vercel bot deployed to Preview – workbench-vite-workflow November 26, 2025 13:48 View deployment

vercel bot deployed to Preview – example-workflow November 26, 2025 13:48 View deployment

vercel bot had a problem deploying to Preview – workbench-fastify-workflow November 26, 2025 13:55 Failure

vercel bot deployed to Preview – workbench-express-workflow November 26, 2025 13:57 View deployment

vercel bot deployed to Preview – workbench-vite-workflow November 26, 2025 13:57 View deployment

vercel bot deployed to Preview – workbench-nitro-workflow November 26, 2025 13:57 View deployment

vercel bot deployed to Preview – workbench-sveltekit-workflow November 26, 2025 13:57 View deployment

vercel bot deployed to Preview – workbench-hono-workflow November 26, 2025 13:57 View deployment

vercel bot deployed to Preview – example-workflow November 26, 2025 13:57 View deployment

vercel bot deployed to Preview – example-nextjs-workflow-turbopack November 26, 2025 13:57 View deployment

vercel bot deployed to Preview – workbench-nuxt-workflow November 26, 2025 13:57 View deployment

vercel bot deployed to Preview – example-nextjs-workflow-webpack November 26, 2025 13:57 View deployment

vercel bot deployed to Preview – workflow-docs November 26, 2025 13:57 View deployment

vercel bot reviewed Nov 26, 2025

View reviewed changes

vercel bot deployed to Preview – workbench-vite-workflow November 26, 2025 15:49 View deployment

vercel bot deployed to Preview – workbench-sveltekit-workflow November 26, 2025 15:49 View deployment

vercel bot deployed to Preview – workbench-nitro-workflow November 26, 2025 15:49 View deployment

vercel bot deployed to Preview – workbench-express-workflow November 26, 2025 15:49 View deployment

vercel bot deployed to Preview – workbench-hono-workflow November 26, 2025 15:49 View deployment

vercel bot deployed to Preview – example-workflow November 26, 2025 15:49 View deployment

vercel bot deployed to Preview – example-nextjs-workflow-turbopack November 26, 2025 15:49 View deployment

vercel bot deployed to Preview – workbench-nuxt-workflow November 26, 2025 15:49 View deployment

vercel bot deployed to Preview – example-nextjs-workflow-webpack November 26, 2025 15:49 View deployment

Copilot AI changed the title ~~Fix workflow stuck in 'running' state when step exhausts max retries~~ Revert: Fix workflow stuck in 'running' state when step exhausts max retries Nov 26, 2025

vercel bot deployed to Preview – workflow-docs November 26, 2025 15:49 View deployment

Copilot AI requested a review from TooTallNate November 26, 2025 15:49

Copilot finished work on behalf of TooTallNate November 26, 2025 15:49

Copilot started work on behalf of TooTallNate November 26, 2025 17:03 View session

vercel bot had a problem deploying to Preview – workbench-fastify-workflow November 26, 2025 17:08 Failure

Copilot AI changed the title ~~Revert: Fix workflow stuck in 'running' state when step exhausts max retries~~ Add e2e test for workflow failing when step exhausts max retries Nov 26, 2025

Copilot finished work on behalf of TooTallNate November 26, 2025 17:10

vercel bot deployed to Preview – workbench-sveltekit-workflow November 26, 2025 17:10 View deployment

vercel bot deployed to Preview – workbench-vite-workflow November 26, 2025 17:10 View deployment

vercel bot deployed to Preview – workbench-express-workflow November 26, 2025 17:10 View deployment

vercel bot deployed to Preview – workbench-hono-workflow November 26, 2025 17:10 View deployment

vercel bot deployed to Preview – workbench-nitro-workflow November 26, 2025 17:10 View deployment

vercel bot deployed to Preview – example-nextjs-workflow-turbopack November 26, 2025 17:10 View deployment

vercel bot deployed to Preview – example-nextjs-workflow-webpack November 26, 2025 17:10 View deployment

vercel bot deployed to Preview – workbench-nuxt-workflow November 26, 2025 17:10 View deployment

vercel bot deployed to Preview – example-workflow November 26, 2025 17:10 View deployment

vercel bot deployed to Preview – workflow-docs November 26, 2025 17:10 View deployment

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add e2e test for workflow failing when step exhausts max retries #436

Add e2e test for workflow failing when step exhausts max retries #436

Uh oh!

Copilot AI commented Nov 26, 2025 •

edited

Loading

Uh oh!

changeset-bot bot commented Nov 26, 2025 •

edited

Loading

Uh oh!

vercel bot commented Nov 26, 2025 •

edited

Loading

Uh oh!

vercel bot Nov 26, 2025 •

edited

Loading

Uh oh!

TooTallNate commented Nov 26, 2025

Uh oh!

Copilot AI commented Nov 26, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Add e2e test for workflow failing when step exhausts max retries #436

Are you sure you want to change the base?

Add e2e test for workflow failing when step exhausts max retries #436

Uh oh!

Conversation

Copilot AI commented Nov 26, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Background

Changes Made

Uh oh!

changeset-bot bot commented Nov 26, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

⚠️ No Changeset found

Uh oh!

vercel bot commented Nov 26, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

vercel bot Nov 26, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Analysis

Undefined function waitForRun in stepMaxRetriesWorkflow test

Uh oh!

TooTallNate commented Nov 26, 2025

Uh oh!

Copilot AI commented Nov 26, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Copilot AI commented Nov 26, 2025 •

edited

Loading

changeset-bot bot commented Nov 26, 2025 •

edited

Loading

vercel bot commented Nov 26, 2025 •

edited

Loading

vercel bot Nov 26, 2025 •

edited

Loading

Undefined function `waitForRun` in stepMaxRetriesWorkflow test