Centralized runtime control of Edge Worker concurrency in distributed deployments by dheerajturaga · Pull Request #62896 · apache/airflow

dheerajturaga · 2026-03-04T22:22:58Z

In distributed Airflow deployments using EdgeExecutor, edge workers run
at remote sites that are often unreachable from the central control
plane — behind firewalls, in air-gapped networks, or on edge computing
nodes. Prior to this change, the worker's concurrency (the number of
tasks it runs in parallel) could only be set at startup via the -c CLI
flag. Adjusting it required direct shell access to the remote machine
and a full worker restart, causing job execution downtime and defeating
the operational value of the edge architecture.

This contribution solves the problem entirely by introducing a
server-driven concurrency control mechanism that works within the
existing security and connectivity constraints of edge deployments.
Rather than requiring a new communication channel, it repurposes the
worker's existing heartbeat protocol — the only bidirectional link
between the central Airflow API server and the remote worker — to
deliver concurrency updates. An administrator runs:

airflow edge set-worker-concurrency --edge-hostname <worker> --concurrency <n>

This writes the desired concurrency to the central database. On the
worker's next heartbeat, the API server
returns the new value in its response, and the worker adopts it
immediately — no restart, no remote access, no downtime.

The design follows the established pattern for queue management
introduced in this provider, but extends it to a runtime-mutable
execution parameter for the first time. It required coordinated changes
across five architectural layers: the database schema (new migration),
the SQLAlchemy model, the FastAPI worker API response contract, the
async worker heartbeat loop, and the CLI command surface. The solution
preserves the core architectural guarantee of the edge executor — that
workers never hold a direct database connection — while making a
previously static configuration parameter dynamically controllable from
the central site.

This capability was made possible by the database schema versioning foundation
introduced in #61155

Was generative AI tooling used to co-author this PR?

Yes (please specify the tool below)
ClaudeCode

In distributed Airflow deployments using EdgeExecutor, edge workers run at remote sites that are often unreachable from the central control plane — behind firewalls, in air-gapped networks, or on edge computing nodes. Prior to this change, the worker's concurrency (the number of tasks it runs in parallel) could only be set at startup via the -c CLI flag. Adjusting it required direct shell access to the remote machine and a full worker restart, causing job execution downtime and defeating the operational value of the edge architecture. This contribution solves the problem entirely by introducing a server-driven concurrency control mechanism that works within the existing security and connectivity constraints of edge deployments. Rather than requiring a new communication channel, it repurposes the worker's existing heartbeat protocol — the only bidirectional link between the central Airflow API server and the remote worker — to deliver concurrency updates. An administrator runs: airflow edge set-worker-concurrency --edge-hostname <worker> --concurrency <n> This writes the desired concurrency to the central database. On the worker's next heartbeat (typically within seconds), the API server returns the new value in its response, and the worker adopts it immediately — no restart, no remote access, no downtime. The design follows the established pattern for queue management introduced in this provider, but extends it to a runtime-mutable execution parameter for the first time. It required coordinated changes across five architectural layers: the database schema (new migration), the SQLAlchemy model, the FastAPI worker API response contract, the async worker heartbeat loop, and the CLI command surface. The solution preserves the core architectural guarantee of the edge executor — that workers never hold a direct database connection — while making a previously static configuration parameter dynamically controllable from the central site This capability was made possible by DB schema versioning introduced in apache#61155

jscheffl

Cool improvement! Actually had the exact same demand today with our deployment.

One remark, one question.

And one additional: Would you also extend the web UI in a follow-up that similar like queues the concurrency can be adjusted in UI?

providers/edge3/src/airflow/providers/edge3/models/db.py

providers/edge3/src/airflow/providers/edge3/worker_api/datamodels.py

dheerajturaga · 2026-03-05T21:43:41Z

Cool improvement! Actually had the exact same demand today with our deployment.

One remark, one question.

And one additional: Would you also extend the web UI in a follow-up that similar like queues the concurrency can be adjusted in UI?

Yes! UI and docs will be updated in a follow up once this is merged!
I had this ask from my userbase for a while now but I needed #61155 first to allow for this :)

…stalls When edge3 tables existed before alembic tracking was introduced, airflow db migrate would stamp directly to head without applying incremental migrations, leaving the schema out of sync (e.g. missing the concurrency column added in 3.2.0).

jscheffl · 2026-03-06T17:16:27Z

Cool! Then once the CI errors are fixed I can make a second pass review and then LGTM!

dheerajturaga · 2026-03-06T21:03:00Z

Cool! Then once the CI errors are fixed I can make a second pass review and then LGTM!

@jscheffl done!

dheerajturaga requested a review from jscheffl as a code owner March 4, 2026 22:22

boring-cyborg bot added area:providers provider:edge Edge Executor / Worker (AIP-69) / edge3 labels Mar 4, 2026

dheerajturaga changed the title ~~Centralized runtime control of Edge Worker concurrency in distributed Airflow deployments~~ Centralized runtime control of Edge Worker concurrency in distributed deployments Mar 4, 2026

jscheffl reviewed Mar 5, 2026

View reviewed changes

providers/edge3/src/airflow/providers/edge3/models/db.py Outdated Show resolved Hide resolved

providers/edge3/src/airflow/providers/edge3/worker_api/datamodels.py Show resolved Hide resolved

dheerajturaga added 5 commits March 5, 2026 17:03

Fix unit test

0c470b1

update migration files to 3.2.0

fbf6136

Not sure why prek got rid of this

90f734d

Fix tests

2a89a31

Fix more tests

7d2d3b5

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Centralized runtime control of Edge Worker concurrency in distributed deployments#62896

Centralized runtime control of Edge Worker concurrency in distributed deployments#62896
dheerajturaga wants to merge 7 commits intoapache:mainfrom
dheerajturaga:feat/edge-cli-concurrency

dheerajturaga commented Mar 4, 2026 •

edited

Loading

Uh oh!

jscheffl left a comment

Uh oh!

Uh oh!

Uh oh!

dheerajturaga commented Mar 5, 2026 •

edited

Loading

Uh oh!

jscheffl commented Mar 6, 2026

Uh oh!

dheerajturaga commented Mar 6, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

dheerajturaga commented Mar 4, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Was generative AI tooling used to co-author this PR?

Uh oh!

jscheffl left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

dheerajturaga commented Mar 5, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

jscheffl commented Mar 6, 2026

Uh oh!

dheerajturaga commented Mar 6, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

dheerajturaga commented Mar 4, 2026 •

edited

Loading

dheerajturaga commented Mar 5, 2026 •

edited

Loading