Skip to content

Actions: microsoft/onnxruntime

ONNX Runtime WebGPU Builds

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
710 workflow runs
710 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

[webgpu] Flash attention for generation
ONNX Runtime WebGPU Builds #712: Pull request #23808 synchronize by qjia7
April 1, 2025 07:29 1h 28m 57s attention_generate_fa
April 1, 2025 07:29 1h 28m 57s
Expose TRT preview features as EP option
ONNX Runtime WebGPU Builds #711: Pull request #24212 synchronize by toothache
April 1, 2025 07:24 1h 34m 28s toothache:preview_features
April 1, 2025 07:24 1h 34m 28s
[WIP] Support export of Llama with DynamicCache and transformers>=4.48
ONNX Runtime WebGPU Builds #710: Pull request #24231 synchronize by xadupre
April 1, 2025 07:21 1h 30m 8s xadupre:llama2
April 1, 2025 07:21 1h 30m 8s
Expose TRT preview features as EP option
ONNX Runtime WebGPU Builds #709: Pull request #24212 synchronize by toothache
April 1, 2025 06:59 24m 31s toothache:preview_features
April 1, 2025 06:59 24m 31s
[WIP] Support export of Llama with DynamicCache and transformers>=4.48
ONNX Runtime WebGPU Builds #708: Pull request #24231 synchronize by xadupre
April 1, 2025 06:49 31m 42s xadupre:llama2
April 1, 2025 06:49 31m 42s
[WIP][Native WebGPU] Add Conv, ConTranspose and FusedConv
ONNX Runtime WebGPU Builds #707: Pull request #24186 synchronize by satyajandhyala
April 1, 2025 06:42 1h 31m 42s sajandhy/webgpu-ep-add-conv
April 1, 2025 06:42 1h 31m 42s
[webgpu] optimize SkipLayerNormalization operator
ONNX Runtime WebGPU Builds #706: Pull request #24164 synchronize by xhcao
April 1, 2025 04:08 Action required xhcao:skip-norm-layer
April 1, 2025 04:08 Action required
[webgpu] Optimize MatMulNBits for f16 Block32 prefill performance
ONNX Runtime WebGPU Builds #705: Pull request #23908 synchronize by daijh
April 1, 2025 04:04 Action required daijh:matmul-f16-block32-prefill
April 1, 2025 04:04 Action required
MlasTranspose multi-threads support.
ONNX Runtime WebGPU Builds #704: Pull request #24261 synchronize by msy-kato
April 1, 2025 03:16 Action required msy-kato:feature-mlastranspose-multithread-v2
April 1, 2025 03:16 Action required
[WebGPU EP] fixes bugs in split implementation (#24259)
ONNX Runtime WebGPU Builds #703: Commit 5068ab9 pushed by prathikr
April 1, 2025 03:04 1h 30m 19s main
April 1, 2025 03:04 1h 30m 19s
[webgpu] fix the reflect mode issue of Pad
ONNX Runtime WebGPU Builds #702: Pull request #24202 synchronize by xhcao
April 1, 2025 02:20 Action required xhcao:fix-pad-reflect
April 1, 2025 02:20 Action required
Bump vite from 6.2.3 to 6.2.4 in /js/web/test/e2e/exports/testcases/v…
ONNX Runtime WebGPU Builds #701: Commit e227415 pushed by fs-eire
April 1, 2025 00:29 1h 57m 45s main
April 1, 2025 00:29 1h 57m 45s
Adding build-system to pyproject.toml
ONNX Runtime WebGPU Builds #700: Pull request #24216 synchronize by jchen351
April 1, 2025 00:22 1h 41m 55s Cjian/py12-st
April 1, 2025 00:22 1h 41m 55s
MlasTranspose multi-threads support.
ONNX Runtime WebGPU Builds #699: Pull request #24261 opened by msy-kato
April 1, 2025 00:01 1h 50m 29s msy-kato:feature-mlastranspose-multithread-v2
April 1, 2025 00:01 1h 50m 29s
Support Gemma3 with Clip fused attention
ONNX Runtime WebGPU Builds #698: Pull request #24187 synchronize by titaiwangms
March 31, 2025 23:54 1h 49m 22s titaiwangms:titaiwang/gemma3-vision
March 31, 2025 23:54 1h 49m 22s
Implement load cancellation ability
ONNX Runtime WebGPU Builds #697: Pull request #24257 synchronize by yuslepukhin
March 31, 2025 23:44 1h 39m 8s yuslepukhin/load_cancelletion
March 31, 2025 23:44 1h 39m 8s
[TEST] depthtospace
ONNX Runtime WebGPU Builds #696: Pull request #23929 synchronize by prathikr
March 31, 2025 23:27 1h 26m 21s prathikrao/depth-to-space-webgpu-ep-test
March 31, 2025 23:27 1h 26m 21s
[WIP][Native WebGPU] Add Conv, ConTranspose and FusedConv
ONNX Runtime WebGPU Builds #695: Pull request #24186 synchronize by satyajandhyala
March 31, 2025 23:03 1h 58m 21s sajandhy/webgpu-ep-add-conv
March 31, 2025 23:03 1h 58m 21s
Implement load cancellation ability
ONNX Runtime WebGPU Builds #694: Pull request #24257 synchronize by yuslepukhin
March 31, 2025 22:55 48m 56s yuslepukhin/load_cancelletion
March 31, 2025 22:55 48m 56s
[WebGPU EP] fixes bugs in split implementation
ONNX Runtime WebGPU Builds #693: Pull request #24259 synchronize by prathikr
March 31, 2025 22:20 2h 21m 12s prathikrao/split-webgpu-ep-bugfix
March 31, 2025 22:20 2h 21m 12s
Update xcode and iphoneSimulatorVersion after MacOS-14
ONNX Runtime WebGPU Builds #692: Pull request #24260 opened by jchen351
March 31, 2025 22:18 2h 4m 58s Cjian/xcode
March 31, 2025 22:18 2h 4m 58s
[WebGPU EP] fixes bugs in split implementation
ONNX Runtime WebGPU Builds #691: Pull request #24259 opened by prathikr
March 31, 2025 22:13 7m 32s prathikrao/split-webgpu-ep-bugfix
March 31, 2025 22:13 7m 32s
Add support for uint8_t as data type for GatherBlockQuantized
ONNX Runtime WebGPU Builds #690: Pull request #24239 synchronize by sushraja-msft
March 31, 2025 22:05 2h 6m 28s user/sushraja/gather_dequantize
March 31, 2025 22:05 2h 6m 28s
Enabling c++20 on linux
ONNX Runtime WebGPU Builds #689: Pull request #17816 synchronize by jchen351
March 31, 2025 22:04 1h 44m 38s Cjian/linux_c++20
March 31, 2025 22:04 1h 44m 38s
Exclude onnxruntime-inference-examples directory from Component Gover…
ONNX Runtime WebGPU Builds #688: Pull request #24258 opened by jchen351
March 31, 2025 21:58 1h 32m 6s Cjian/npm_next
March 31, 2025 21:58 1h 32m 6s