Skip to content

Actions: deepspeedai/DeepSpeed

nv-accelerate-v100

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
4,340 workflow runs
4,340 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

Reland perf fix for nan inf check
nv-accelerate-v100 #13724: Pull request #7184 synchronize by nelyahu
March 30, 2025 08:05 8m 15s nelyahu:reland_perf_fix_for_nan_inf_check
March 30, 2025 08:05 8m 15s
Reland perf fix for nan inf check
nv-accelerate-v100 #13723: Pull request #7184 opened by nelyahu
March 30, 2025 08:04 Action required nelyahu:reland_perf_fix_for_nan_inf_check
March 30, 2025 08:04 Action required
nv-accelerate-v100
nv-accelerate-v100 #13722: Scheduled
March 30, 2025 00:08 8m 19s master
March 30, 2025 00:08 8m 19s
nv-accelerate-v100
nv-accelerate-v100 #13721: Merge group checks requested
March 29, 2025 00:37 9m 33s
March 29, 2025 00:37 9m 33s
nv-accelerate-v100
nv-accelerate-v100 #13720: Scheduled
March 29, 2025 00:07 8m 17s master
March 29, 2025 00:07 8m 17s
nv-accelerate-v100
nv-accelerate-v100 #13719: Merge group checks requested
March 28, 2025 22:48 8m 20s
March 28, 2025 22:48 8m 20s
nv-accelerate-v100
nv-accelerate-v100 #13718: Merge group checks requested
March 28, 2025 21:10 9m 16s
March 28, 2025 21:10 9m 16s
nv-accelerate-v100
nv-accelerate-v100 #13717: Merge group checks requested
March 28, 2025 21:02 8m 19s
March 28, 2025 21:02 8m 19s
nv-accelerate-v100
nv-accelerate-v100 #13716: Merge group checks requested
March 28, 2025 21:02 8m 18s
March 28, 2025 21:02 8m 18s
Add DataStates-LLM: Asynchronous Checkpointing Engine Support
nv-accelerate-v100 #13715: Pull request #7166 synchronize by mauryaavinash95
March 28, 2025 20:30 Action required DataStates:dev
March 28, 2025 20:30 Action required
Add DataStates-LLM: Asynchronous Checkpointing Engine Support
nv-accelerate-v100 #13714: Pull request #7166 synchronize by mauryaavinash95
March 28, 2025 20:22 Action required DataStates:dev
March 28, 2025 20:22 Action required
Add DataStates-LLM: Asynchronous Checkpointing Engine Support
nv-accelerate-v100 #13713: Pull request #7166 synchronize by mauryaavinash95
March 28, 2025 20:13 Action required DataStates:dev
March 28, 2025 20:13 Action required
Add pyproject.toml with legacy build backend to keep most logic in setup.py
nv-accelerate-v100 #13712: Pull request #7033 synchronize by loadams
March 28, 2025 18:12 23m 17s loadams/pyproject-toml
March 28, 2025 18:12 23m 17s
nv-accelerate-v100
nv-accelerate-v100 #13711: Merge group checks requested
March 28, 2025 18:03 16m 34s
March 28, 2025 18:03 16m 34s
nv-accelerate-v100
nv-accelerate-v100 #13710: Merge group checks requested
March 28, 2025 17:59 12m 48s
March 28, 2025 17:59 12m 48s
Add pyproject.toml with legacy build backend to keep most logic in setup.py
nv-accelerate-v100 #13709: Pull request #7033 synchronize by loadams
March 28, 2025 17:55 9m 9s loadams/pyproject-toml
March 28, 2025 17:55 9m 9s
nv-accelerate-v100
nv-accelerate-v100 #13708: Merge group checks requested
March 28, 2025 16:30 8m 3s
March 28, 2025 16:30 8m 3s
Use transformers latest on v100 tests
nv-accelerate-v100 #13707: Pull request #7088 synchronize by loadams
March 28, 2025 16:00 8m 7s loadams/unpin-transformers-latest
March 28, 2025 16:00 8m 7s
nv-accelerate-v100
nv-accelerate-v100 #13706: Merge group checks requested
March 28, 2025 15:03 8m 16s
March 28, 2025 15:03 8m 16s
Fix pre-compile on cpu-only machines
nv-accelerate-v100 #13705: Pull request #7168 synchronize by AlongWY
March 28, 2025 06:56 Action required AlongWY:patch-1
March 28, 2025 06:56 Action required
[bugfix] update results of state_dict loading, embedding resizing to secondary partitions (hpz)
nv-accelerate-v100 #13704: Pull request #7130 synchronize by cyr0930
March 28, 2025 06:38 Action required cyr0930:bug/2nd_part
March 28, 2025 06:38 Action required
[XPU] Support XCCL on deepspeed side
nv-accelerate-v100 #13703: Pull request #7113 synchronize by hwchen2017
March 28, 2025 06:01 8m 17s ys950902:sy/xccl_enable
March 28, 2025 06:01 8m 17s
gather output layout support for column parallel
nv-accelerate-v100 #13702: Pull request #7181 synchronize by hwchen2017
March 28, 2025 05:56 8m 14s inkcherry:gather_output
March 28, 2025 05:56 8m 14s
async tp allreduce
nv-accelerate-v100 #13701: Pull request #7115 synchronize by hwchen2017
March 28, 2025 05:45 8m 14s inkcherry:async_tp
March 28, 2025 05:45 8m 14s
fixed: Modified the topkgating function and modified the test_moe file for testing
nv-accelerate-v100 #13700: Pull request #7163 synchronize by xiongjyu
March 28, 2025 04:14 Action required xiongjyu:master
March 28, 2025 04:14 Action required