Skip to content

Actions: deepspeedai/DeepSpeed

nv-torch-latest-v100

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
4,365 workflow runs
4,365 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

Reland perf fix for nan inf check
nv-torch-latest-v100 #13879: Pull request #7184 synchronize by nelyahu
March 30, 2025 08:05 In progress nelyahu:reland_perf_fix_for_nan_inf_check
March 30, 2025 08:05 In progress
Reland perf fix for nan inf check
nv-torch-latest-v100 #13878: Pull request #7184 opened by nelyahu
March 30, 2025 08:04 Action required nelyahu:reland_perf_fix_for_nan_inf_check
March 30, 2025 08:04 Action required
nv-torch-latest-v100
nv-torch-latest-v100 #13877: Scheduled
March 30, 2025 00:24 6h 0m 21s master
March 30, 2025 00:24 6h 0m 21s
nv-torch-latest-v100
nv-torch-latest-v100 #13876: Merge group checks requested
March 29, 2025 00:37 1h 20m 28s
March 29, 2025 00:37 1h 20m 28s
nv-torch-latest-v100
nv-torch-latest-v100 #13875: Scheduled
March 29, 2025 00:21 6h 0m 20s master
March 29, 2025 00:21 6h 0m 20s
nv-torch-latest-v100
nv-torch-latest-v100 #13874: Merge group checks requested
March 28, 2025 22:48 6h 0m 19s
March 28, 2025 22:48 6h 0m 19s
nv-torch-latest-v100
nv-torch-latest-v100 #13873: Merge group checks requested
March 28, 2025 21:10 6h 1m 26s
March 28, 2025 21:10 6h 1m 26s
nv-torch-latest-v100
nv-torch-latest-v100 #13872: Merge group checks requested
March 28, 2025 21:02 6h 0m 21s
March 28, 2025 21:02 6h 0m 21s
nv-torch-latest-v100
nv-torch-latest-v100 #13871: Merge group checks requested
March 28, 2025 21:02 1h 18m 33s
March 28, 2025 21:02 1h 18m 33s
Add DataStates-LLM: Asynchronous Checkpointing Engine Support
nv-torch-latest-v100 #13870: Pull request #7166 synchronize by mauryaavinash95
March 28, 2025 20:30 Action required DataStates:dev
March 28, 2025 20:30 Action required
Add DataStates-LLM: Asynchronous Checkpointing Engine Support
nv-torch-latest-v100 #13869: Pull request #7166 synchronize by mauryaavinash95
March 28, 2025 20:22 Action required DataStates:dev
March 28, 2025 20:22 Action required
Add DataStates-LLM: Asynchronous Checkpointing Engine Support
nv-torch-latest-v100 #13868: Pull request #7166 synchronize by mauryaavinash95
March 28, 2025 20:13 Action required DataStates:dev
March 28, 2025 20:13 Action required
Add pyproject.toml with legacy build backend to keep most logic in setup.py
nv-torch-latest-v100 #13867: Pull request #7033 synchronize by loadams
March 28, 2025 18:12 1h 14m 25s loadams/pyproject-toml
March 28, 2025 18:12 1h 14m 25s
nv-torch-latest-v100
nv-torch-latest-v100 #13866: Merge group checks requested
March 28, 2025 18:03 2h 58m 20s
March 28, 2025 18:03 2h 58m 20s
nv-torch-latest-v100
nv-torch-latest-v100 #13865: Merge group checks requested
March 28, 2025 17:59 3h 2m 32s
March 28, 2025 17:59 3h 2m 32s
nv-torch-latest-v100
nv-torch-latest-v100 #13864: Manually run by loadams
March 28, 2025 17:56 7m 21s loadams/update-torch-27
March 28, 2025 17:56 7m 21s
Add pyproject.toml with legacy build backend to keep most logic in setup.py
nv-torch-latest-v100 #13863: Pull request #7033 synchronize by loadams
March 28, 2025 17:55 17m 2s loadams/pyproject-toml
March 28, 2025 17:55 17m 2s
nv-torch-latest-v100
nv-torch-latest-v100 #13862: Merge group checks requested
March 28, 2025 16:30 4h 32m 3s
March 28, 2025 16:30 4h 32m 3s
Use transformers latest on v100 tests
nv-torch-latest-v100 #13861: Pull request #7088 synchronize by loadams
March 28, 2025 16:00 6h 0m 21s loadams/unpin-transformers-latest
March 28, 2025 16:00 6h 0m 21s
nv-torch-latest-v100
nv-torch-latest-v100 #13860: Merge group checks requested
March 28, 2025 15:03 56m 43s
March 28, 2025 15:03 56m 43s
Fix pre-compile on cpu-only machines
nv-torch-latest-v100 #13859: Pull request #7168 synchronize by AlongWY
March 28, 2025 06:56 Action required AlongWY:patch-1
March 28, 2025 06:56 Action required
[bugfix] update results of state_dict loading, embedding resizing to secondary partitions (hpz)
nv-torch-latest-v100 #13858: Pull request #7130 synchronize by cyr0930
March 28, 2025 06:38 Action required cyr0930:bug/2nd_part
March 28, 2025 06:38 Action required
[XPU] Support XCCL on deepspeed side
nv-torch-latest-v100 #13857: Pull request #7113 synchronize by hwchen2017
March 28, 2025 06:01 6h 0m 24s ys950902:sy/xccl_enable
March 28, 2025 06:01 6h 0m 24s
gather output layout support for column parallel
nv-torch-latest-v100 #13856: Pull request #7181 synchronize by hwchen2017
March 28, 2025 05:56 6h 0m 21s inkcherry:gather_output
March 28, 2025 05:56 6h 0m 21s
async tp allreduce
nv-torch-latest-v100 #13855: Pull request #7115 synchronize by hwchen2017
March 28, 2025 05:45 1h 10m 55s inkcherry:async_tp
March 28, 2025 05:45 1h 10m 55s