Skip to content

Conversation

@anish-shanbhag
Copy link
Contributor

Overview:

Optimizes the implementation of _match_workers using a set for num_gpu lookup.

Details:

In my local testing, this speeds up aiconfigurator cli default --model DEEPSEEK_V3 --total_gpus 32 --system h200_sxm from 63.3s --> 61.4s

Where should the reviewer start?

Related Issues: (use one of the action keywords Closes / Fixes / Resolves / Relates to)

  • closes GitHub issue: #xxx

@copy-pr-bot
Copy link

copy-pr-bot bot commented Nov 19, 2025

This pull request requires additional validation before any workflows can run on NVIDIA's runners.

Pull request vetters can view their responsibilities here.

Contributors can view more details about this message here.

@github-actions github-actions bot added the perf label Nov 19, 2025
@tianhaox tianhaox merged commit d8e7a0f into ai-dynamo:main Nov 20, 2025
4 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants