Skip to content

Actions: deepspeedai/DeepSpeed

hpu-gaudi2

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
1,653 workflow runs
1,653 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

Reland perf fix for nan inf check
hpu-gaudi2 #1990: Pull request #7184 opened by nelyahu
March 30, 2025 08:04 Action required nelyahu:reland_perf_fix_for_nan_inf_check
March 30, 2025 08:04 Action required
hpu-gaudi2
hpu-gaudi2 #1989: Scheduled
March 30, 2025 00:13 2h 17m 58s master
March 30, 2025 00:13 2h 17m 58s
hpu-gaudi2
hpu-gaudi2 #1988: Scheduled
March 29, 2025 00:11 6h 50m 59s master
March 29, 2025 00:11 6h 50m 59s
Add DataStates-LLM: Asynchronous Checkpointing Engine Support
hpu-gaudi2 #1987: Pull request #7166 synchronize by mauryaavinash95
March 28, 2025 20:30 -1s DataStates:dev
March 28, 2025 20:30 -1s
Add DataStates-LLM: Asynchronous Checkpointing Engine Support
hpu-gaudi2 #1986: Pull request #7166 synchronize by mauryaavinash95
March 28, 2025 20:22 Action required DataStates:dev
March 28, 2025 20:22 Action required
Add DataStates-LLM: Asynchronous Checkpointing Engine Support
hpu-gaudi2 #1985: Pull request #7166 synchronize by mauryaavinash95
March 28, 2025 20:13 Action required DataStates:dev
March 28, 2025 20:13 Action required
[bugfix] update results of state_dict loading, embedding resizing to secondary partitions (hpz)
hpu-gaudi2 #1984: Pull request #7130 synchronize by cyr0930
March 28, 2025 06:38 Action required cyr0930:bug/2nd_part
March 28, 2025 06:38 Action required
gather output layout support for column parallel
hpu-gaudi2 #1983: Pull request #7181 synchronize by hwchen2017
March 28, 2025 05:56 1h 42m 48s inkcherry:gather_output
March 28, 2025 05:56 1h 42m 48s
async tp allreduce
hpu-gaudi2 #1982: Pull request #7115 synchronize by hwchen2017
March 28, 2025 05:45 57m 0s inkcherry:async_tp
March 28, 2025 05:45 57m 0s
gather output layout support for column parallel
hpu-gaudi2 #1981: Pull request #7181 synchronize by inkcherry
March 28, 2025 03:21 Action required inkcherry:gather_output
March 28, 2025 03:21 Action required
gather output layout support for column parallel
hpu-gaudi2 #1980: Pull request #7181 synchronize by inkcherry
March 28, 2025 03:18 Action required inkcherry:gather_output
March 28, 2025 03:18 Action required
gather output layout support for column parallel
hpu-gaudi2 #1979: Pull request #7181 opened by inkcherry
March 28, 2025 03:18 Action required inkcherry:gather_output
March 28, 2025 03:18 Action required
hpu-gaudi2
hpu-gaudi2 #1977: Scheduled
March 28, 2025 00:11 2h 18m 5s master
March 28, 2025 00:11 2h 18m 5s
async tp allreduce
hpu-gaudi2 #1976: Pull request #7115 synchronize by hwchen2017
March 27, 2025 21:14 57m 43s inkcherry:async_tp
March 27, 2025 21:14 57m 43s
DeepCompile for enhanced compiler integration
hpu-gaudi2 #1975: Pull request #7154 synchronize by tohtana
March 27, 2025 18:52 1h 55m 32s tohtana/deepcompile
March 27, 2025 18:52 1h 55m 32s
Enable ZeRO set/get APIs for NVMe offload
hpu-gaudi2 #1974: Pull request #7046 synchronize by loadams
March 27, 2025 16:55 2h 55m 23s olruwase/update_nvme_offload_states
March 27, 2025 16:55 2h 55m 23s
DeepCompile for enhanced compiler integration
hpu-gaudi2 #1973: Pull request #7154 synchronize by loadams
March 27, 2025 16:16 2h 36m 21s tohtana/deepcompile
March 27, 2025 16:16 2h 36m 21s
Fix issue #5242 grad_norm and loss is nan
hpu-gaudi2 #1971: Pull request #7171 synchronize by hwchen2017
March 27, 2025 04:02 5h 53m 26s Glaceon-Hyy:fix_grad_norm
March 27, 2025 04:02 5h 53m 26s
Fix issue #5242 grad_norm and loss is nan
hpu-gaudi2 #1969: Pull request #7171 synchronize by Glaceon-Hyy
March 27, 2025 03:24 Action required Glaceon-Hyy:fix_grad_norm
March 27, 2025 03:24 Action required
Variable batch size and LR scheduler
hpu-gaudi2 #1968: Pull request #7104 synchronize by bm-synth
March 27, 2025 01:58 4h 58m 11s bm-synth:variable_batch_size_and_lr_2
March 27, 2025 01:58 4h 58m 11s
hpu-gaudi2
hpu-gaudi2 #1964: Scheduled
March 27, 2025 00:11 7h 50m 31s master
March 27, 2025 00:11 7h 50m 31s
DeepCompile for enhanced compiler integration
hpu-gaudi2 #1963: Pull request #7154 synchronize by tohtana
March 27, 2025 00:06 6h 57m 47s tohtana/deepcompile
March 27, 2025 00:06 6h 57m 47s
DeepCompile for enhanced compiler integration
hpu-gaudi2 #1962: Pull request #7154 synchronize by tohtana
March 26, 2025 23:59 7m 23s tohtana/deepcompile
March 26, 2025 23:59 7m 23s