Skip to content

Actions: microsoft/onnxruntime

ONNX Runtime DirectML Builds

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
705 workflow runs
705 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

[webgpu] Flash attention for generation
ONNX Runtime DirectML Builds #707: Pull request #23808 synchronize by qjia7
April 1, 2025 07:29 1h 13m 53s attention_generate_fa
April 1, 2025 07:29 1h 13m 53s
Expose TRT preview features as EP option
ONNX Runtime DirectML Builds #706: Pull request #24212 synchronize by toothache
April 1, 2025 07:24 1h 23m 37s toothache:preview_features
April 1, 2025 07:24 1h 23m 37s
[WIP] Support export of Llama with DynamicCache and transformers>=4.48
ONNX Runtime DirectML Builds #705: Pull request #24231 synchronize by xadupre
April 1, 2025 07:21 1h 23m 34s xadupre:llama2
April 1, 2025 07:21 1h 23m 34s
Expose TRT preview features as EP option
ONNX Runtime DirectML Builds #704: Pull request #24212 synchronize by toothache
April 1, 2025 06:59 24m 30s toothache:preview_features
April 1, 2025 06:59 24m 30s
[WIP] Support export of Llama with DynamicCache and transformers>=4.48
ONNX Runtime DirectML Builds #703: Pull request #24231 synchronize by xadupre
April 1, 2025 06:49 31m 43s xadupre:llama2
April 1, 2025 06:49 31m 43s
[WIP][Native WebGPU] Add Conv, ConTranspose and FusedConv
ONNX Runtime DirectML Builds #702: Pull request #24186 synchronize by satyajandhyala
April 1, 2025 06:42 1h 12m 53s sajandhy/webgpu-ep-add-conv
April 1, 2025 06:42 1h 12m 53s
[webgpu] optimize SkipLayerNormalization operator
ONNX Runtime DirectML Builds #701: Pull request #24164 synchronize by xhcao
April 1, 2025 04:08 Action required xhcao:skip-norm-layer
April 1, 2025 04:08 Action required
[webgpu] Optimize MatMulNBits for f16 Block32 prefill performance
ONNX Runtime DirectML Builds #700: Pull request #23908 synchronize by daijh
April 1, 2025 04:04 Action required daijh:matmul-f16-block32-prefill
April 1, 2025 04:04 Action required
MlasTranspose multi-threads support.
ONNX Runtime DirectML Builds #699: Pull request #24261 synchronize by msy-kato
April 1, 2025 03:16 Action required msy-kato:feature-mlastranspose-multithread-v2
April 1, 2025 03:16 Action required
[WebGPU EP] fixes bugs in split implementation (#24259)
ONNX Runtime DirectML Builds #698: Commit 5068ab9 pushed by prathikr
April 1, 2025 03:04 1h 14m 0s main
April 1, 2025 03:04 1h 14m 0s
[webgpu] fix the reflect mode issue of Pad
ONNX Runtime DirectML Builds #697: Pull request #24202 synchronize by xhcao
April 1, 2025 02:20 Action required xhcao:fix-pad-reflect
April 1, 2025 02:20 Action required
Bump vite from 6.2.3 to 6.2.4 in /js/web/test/e2e/exports/testcases/v…
ONNX Runtime DirectML Builds #696: Commit e227415 pushed by fs-eire
April 1, 2025 00:29 1h 51m 14s main
April 1, 2025 00:29 1h 51m 14s
Adding build-system to pyproject.toml
ONNX Runtime DirectML Builds #695: Pull request #24216 synchronize by jchen351
April 1, 2025 00:22 1h 38m 2s Cjian/py12-st
April 1, 2025 00:22 1h 38m 2s
MlasTranspose multi-threads support.
ONNX Runtime DirectML Builds #694: Pull request #24261 opened by msy-kato
April 1, 2025 00:01 1h 40m 41s msy-kato:feature-mlastranspose-multithread-v2
April 1, 2025 00:01 1h 40m 41s
Support Gemma3 with Clip fused attention
ONNX Runtime DirectML Builds #693: Pull request #24187 synchronize by titaiwangms
March 31, 2025 23:54 1h 31m 52s titaiwangms:titaiwang/gemma3-vision
March 31, 2025 23:54 1h 31m 52s
Implement load cancellation ability
ONNX Runtime DirectML Builds #692: Pull request #24257 synchronize by yuslepukhin
March 31, 2025 23:44 1h 27m 56s yuslepukhin/load_cancelletion
March 31, 2025 23:44 1h 27m 56s
[TEST] depthtospace
ONNX Runtime DirectML Builds #691: Pull request #23929 synchronize by prathikr
March 31, 2025 23:27 1h 37m 19s prathikrao/depth-to-space-webgpu-ep-test
March 31, 2025 23:27 1h 37m 19s
[WIP][Native WebGPU] Add Conv, ConTranspose and FusedConv
ONNX Runtime DirectML Builds #690: Pull request #24186 synchronize by satyajandhyala
March 31, 2025 23:03 1h 53m 55s sajandhy/webgpu-ep-add-conv
March 31, 2025 23:03 1h 53m 55s
Implement load cancellation ability
ONNX Runtime DirectML Builds #689: Pull request #24257 synchronize by yuslepukhin
March 31, 2025 22:55 48m 57s yuslepukhin/load_cancelletion
March 31, 2025 22:55 48m 57s
[WebGPU EP] fixes bugs in split implementation
ONNX Runtime DirectML Builds #688: Pull request #24259 synchronize by prathikr
March 31, 2025 22:20 1h 55m 46s prathikrao/split-webgpu-ep-bugfix
March 31, 2025 22:20 1h 55m 46s
Update xcode and iphoneSimulatorVersion after MacOS-14
ONNX Runtime DirectML Builds #687: Pull request #24260 opened by jchen351
March 31, 2025 22:18 1h 58m 35s Cjian/xcode
March 31, 2025 22:18 1h 58m 35s
[WebGPU EP] fixes bugs in split implementation
ONNX Runtime DirectML Builds #686: Pull request #24259 opened by prathikr
March 31, 2025 22:13 7m 31s prathikrao/split-webgpu-ep-bugfix
March 31, 2025 22:13 7m 31s
Add support for uint8_t as data type for GatherBlockQuantized
ONNX Runtime DirectML Builds #685: Pull request #24239 synchronize by sushraja-msft
March 31, 2025 22:05 2h 0m 41s user/sushraja/gather_dequantize
March 31, 2025 22:05 2h 0m 41s
Enabling c++20 on linux
ONNX Runtime DirectML Builds #684: Pull request #17816 synchronize by jchen351
March 31, 2025 22:04 1h 22m 54s Cjian/linux_c++20
March 31, 2025 22:04 1h 22m 54s
Exclude onnxruntime-inference-examples directory from Component Gover…
ONNX Runtime DirectML Builds #683: Pull request #24258 opened by jchen351
March 31, 2025 21:58 1h 15m 32s Cjian/npm_next
March 31, 2025 21:58 1h 15m 32s