Skip to content

Actions: deepspeedai/DeepSpeed

nv-lightning-v100

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
4,330 workflow runs
4,330 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

Reland perf fix for nan inf check
nv-lightning-v100 #14981: Pull request #7184 synchronize by nelyahu
March 30, 2025 08:05 4m 32s nelyahu:reland_perf_fix_for_nan_inf_check
March 30, 2025 08:05 4m 32s
Reland perf fix for nan inf check
nv-lightning-v100 #14980: Pull request #7184 opened by nelyahu
March 30, 2025 08:04 Action required nelyahu:reland_perf_fix_for_nan_inf_check
March 30, 2025 08:04 Action required
nv-lightning-v100
nv-lightning-v100 #14979: Scheduled
March 30, 2025 00:25 4m 26s master
March 30, 2025 00:25 4m 26s
nv-lightning-v100
nv-lightning-v100 #14978: Merge group checks requested
March 29, 2025 00:37 8m 11s
March 29, 2025 00:37 8m 11s
nv-lightning-v100
nv-lightning-v100 #14977: Scheduled
March 29, 2025 00:22 4m 31s master
March 29, 2025 00:22 4m 31s
nv-lightning-v100
nv-lightning-v100 #14976: Merge group checks requested
March 28, 2025 22:48 4m 26s
March 28, 2025 22:48 4m 26s
nv-lightning-v100
nv-lightning-v100 #14975: Merge group checks requested
March 28, 2025 21:10 13m 43s
March 28, 2025 21:10 13m 43s
nv-lightning-v100
nv-lightning-v100 #14974: Merge group checks requested
March 28, 2025 21:02 15m 39s
March 28, 2025 21:02 15m 39s
nv-lightning-v100
nv-lightning-v100 #14973: Merge group checks requested
March 28, 2025 21:02 11m 11s
March 28, 2025 21:02 11m 11s
Add DataStates-LLM: Asynchronous Checkpointing Engine Support
nv-lightning-v100 #14972: Pull request #7166 synchronize by mauryaavinash95
March 28, 2025 20:30 Action required DataStates:dev
March 28, 2025 20:30 Action required
Add DataStates-LLM: Asynchronous Checkpointing Engine Support
nv-lightning-v100 #14971: Pull request #7166 synchronize by mauryaavinash95
March 28, 2025 20:22 Action required DataStates:dev
March 28, 2025 20:22 Action required
Add DataStates-LLM: Asynchronous Checkpointing Engine Support
nv-lightning-v100 #14970: Pull request #7166 synchronize by mauryaavinash95
March 28, 2025 20:13 Action required DataStates:dev
March 28, 2025 20:13 Action required
Add pyproject.toml with legacy build backend to keep most logic in setup.py
nv-lightning-v100 #14969: Pull request #7033 synchronize by loadams
March 28, 2025 18:12 12m 21s loadams/pyproject-toml
March 28, 2025 18:12 12m 21s
nv-lightning-v100
nv-lightning-v100 #14968: Merge group checks requested
March 28, 2025 18:03 21m 59s
March 28, 2025 18:03 21m 59s
nv-lightning-v100
nv-lightning-v100 #14967: Merge group checks requested
March 28, 2025 17:59 7m 34s
March 28, 2025 17:59 7m 34s
Add pyproject.toml with legacy build backend to keep most logic in setup.py
nv-lightning-v100 #14966: Pull request #7033 synchronize by loadams
March 28, 2025 17:55 5m 34s loadams/pyproject-toml
March 28, 2025 17:55 5m 34s
nv-lightning-v100
nv-lightning-v100 #14965: Merge group checks requested
March 28, 2025 16:30 4m 27s
March 28, 2025 16:30 4m 27s
Use transformers latest on v100 tests
nv-lightning-v100 #14964: Pull request #7088 synchronize by loadams
March 28, 2025 16:00 4m 32s loadams/unpin-transformers-latest
March 28, 2025 16:00 4m 32s
nv-lightning-v100
nv-lightning-v100 #14963: Merge group checks requested
March 28, 2025 15:03 4m 26s
March 28, 2025 15:03 4m 26s
Fix pre-compile on cpu-only machines
nv-lightning-v100 #14962: Pull request #7168 synchronize by AlongWY
March 28, 2025 06:56 Action required AlongWY:patch-1
March 28, 2025 06:56 Action required
[bugfix] update results of state_dict loading, embedding resizing to secondary partitions (hpz)
nv-lightning-v100 #14961: Pull request #7130 synchronize by cyr0930
March 28, 2025 06:38 Action required cyr0930:bug/2nd_part
March 28, 2025 06:38 Action required
[XPU] Support XCCL on deepspeed side
nv-lightning-v100 #14960: Pull request #7113 synchronize by hwchen2017
March 28, 2025 06:01 4m 33s ys950902:sy/xccl_enable
March 28, 2025 06:01 4m 33s
gather output layout support for column parallel
nv-lightning-v100 #14959: Pull request #7181 synchronize by hwchen2017
March 28, 2025 05:56 4m 29s inkcherry:gather_output
March 28, 2025 05:56 4m 29s
async tp allreduce
nv-lightning-v100 #14958: Pull request #7115 synchronize by hwchen2017
March 28, 2025 05:45 4m 29s inkcherry:async_tp
March 28, 2025 05:45 4m 29s
fixed: Modified the topkgating function and modified the test_moe file for testing
nv-lightning-v100 #14957: Pull request #7163 synchronize by xiongjyu
March 28, 2025 04:14 Action required xiongjyu:master
March 28, 2025 04:14 Action required