Skip to content

Actions: microsoft/onnxruntime

windows_x64_release_xnnpack

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
891 workflow runs
891 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

[webgpu] Flash attention for generation
windows_x64_release_xnnpack #893: Pull request #23808 synchronize by qjia7
April 1, 2025 07:29 44m 57s attention_generate_fa
April 1, 2025 07:29 44m 57s
Expose TRT preview features as EP option
windows_x64_release_xnnpack #892: Pull request #24212 synchronize by toothache
April 1, 2025 07:24 36m 46s toothache:preview_features
April 1, 2025 07:24 36m 46s
[WIP] Support export of Llama with DynamicCache and transformers>=4.48
windows_x64_release_xnnpack #891: Pull request #24231 synchronize by xadupre
April 1, 2025 07:21 32m 16s xadupre:llama2
April 1, 2025 07:21 32m 16s
Expose TRT preview features as EP option
windows_x64_release_xnnpack #890: Pull request #24212 synchronize by toothache
April 1, 2025 06:59 24m 31s toothache:preview_features
April 1, 2025 06:59 24m 31s
[WIP] Support export of Llama with DynamicCache and transformers>=4.48
windows_x64_release_xnnpack #889: Pull request #24231 synchronize by xadupre
April 1, 2025 06:49 31m 40s xadupre:llama2
April 1, 2025 06:49 31m 40s
[WIP][Native WebGPU] Add Conv, ConTranspose and FusedConv
windows_x64_release_xnnpack #888: Pull request #24186 synchronize by satyajandhyala
April 1, 2025 06:42 33m 26s sajandhy/webgpu-ep-add-conv
April 1, 2025 06:42 33m 26s
[webgpu] optimize SkipLayerNormalization operator
windows_x64_release_xnnpack #887: Pull request #24164 synchronize by xhcao
April 1, 2025 04:08 Action required xhcao:skip-norm-layer
April 1, 2025 04:08 Action required
[webgpu] Optimize MatMulNBits for f16 Block32 prefill performance
windows_x64_release_xnnpack #886: Pull request #23908 synchronize by daijh
April 1, 2025 04:04 Action required daijh:matmul-f16-block32-prefill
April 1, 2025 04:04 Action required
MlasTranspose multi-threads support.
windows_x64_release_xnnpack #885: Pull request #24261 synchronize by msy-kato
April 1, 2025 03:16 Action required msy-kato:feature-mlastranspose-multithread-v2
April 1, 2025 03:16 Action required
[WebGPU EP] fixes bugs in split implementation (#24259)
windows_x64_release_xnnpack #884: Commit 5068ab9 pushed by prathikr
April 1, 2025 03:04 33m 58s main
April 1, 2025 03:04 33m 58s
[webgpu] fix the reflect mode issue of Pad
windows_x64_release_xnnpack #883: Pull request #24202 synchronize by xhcao
April 1, 2025 02:20 Action required xhcao:fix-pad-reflect
April 1, 2025 02:20 Action required
Bump vite from 6.2.3 to 6.2.4 in /js/web/test/e2e/exports/testcases/v…
windows_x64_release_xnnpack #882: Commit e227415 pushed by fs-eire
April 1, 2025 00:29 28m 55s main
April 1, 2025 00:29 28m 55s
Adding build-system to pyproject.toml
windows_x64_release_xnnpack #881: Pull request #24216 synchronize by jchen351
April 1, 2025 00:22 48m 27s Cjian/py12-st
April 1, 2025 00:22 48m 27s
Support Gemma3 with Clip fused attention
windows_x64_release_xnnpack #879: Pull request #24187 synchronize by titaiwangms
March 31, 2025 23:54 45m 32s titaiwangms:titaiwang/gemma3-vision
March 31, 2025 23:54 45m 32s
Implement load cancellation ability
windows_x64_release_xnnpack #878: Pull request #24257 synchronize by yuslepukhin
March 31, 2025 23:44 36m 44s yuslepukhin/load_cancelletion
March 31, 2025 23:44 36m 44s
[TEST] depthtospace
windows_x64_release_xnnpack #877: Pull request #23929 synchronize by prathikr
March 31, 2025 23:27 30m 34s prathikrao/depth-to-space-webgpu-ep-test
March 31, 2025 23:27 30m 34s
[WIP][Native WebGPU] Add Conv, ConTranspose and FusedConv
windows_x64_release_xnnpack #876: Pull request #24186 synchronize by satyajandhyala
March 31, 2025 23:03 37m 56s sajandhy/webgpu-ep-add-conv
March 31, 2025 23:03 37m 56s
Implement load cancellation ability
windows_x64_release_xnnpack #875: Pull request #24257 synchronize by yuslepukhin
March 31, 2025 22:55 31m 46s yuslepukhin/load_cancelletion
March 31, 2025 22:55 31m 46s
[WebGPU EP] fixes bugs in split implementation
windows_x64_release_xnnpack #874: Pull request #24259 synchronize by prathikr
March 31, 2025 22:20 33m 48s prathikrao/split-webgpu-ep-bugfix
March 31, 2025 22:20 33m 48s
Update xcode and iphoneSimulatorVersion after MacOS-14
windows_x64_release_xnnpack #873: Pull request #24260 opened by jchen351
March 31, 2025 22:18 41m 12s Cjian/xcode
March 31, 2025 22:18 41m 12s
[WebGPU EP] fixes bugs in split implementation
windows_x64_release_xnnpack #872: Pull request #24259 opened by prathikr
March 31, 2025 22:13 7m 38s prathikrao/split-webgpu-ep-bugfix
March 31, 2025 22:13 7m 38s
Add support for uint8_t as data type for GatherBlockQuantized
windows_x64_release_xnnpack #871: Pull request #24239 synchronize by sushraja-msft
March 31, 2025 22:05 44m 50s user/sushraja/gather_dequantize
March 31, 2025 22:05 44m 50s
Enabling c++20 on linux
windows_x64_release_xnnpack #870: Pull request #17816 synchronize by jchen351
March 31, 2025 22:04 34m 11s Cjian/linux_c++20
March 31, 2025 22:04 34m 11s
Exclude onnxruntime-inference-examples directory from Component Gover…
windows_x64_release_xnnpack #869: Pull request #24258 opened by jchen351
March 31, 2025 21:58 29m 40s Cjian/npm_next
March 31, 2025 21:58 29m 40s