-
Notifications
You must be signed in to change notification settings - Fork 2.9k
Pull requests: microsoft/onnxruntime
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[WIP] Stable Diffusion 3.x and Flux Optimization
#22986
opened Dec 2, 2024 by
tianleiwu
Loading…
3 of 4 tasks
[js/node] fix CUDA artifact installation script for Linux/x64
#22984
opened Dec 2, 2024 by
fs-eire
Loading…
flatten webgpu implementation
ep:WebGPU
ort-web webgpu provider
#22964
opened Nov 27, 2024 by
prathikr
Loading…
[WebNN] Update usage of MLTensor to align with latest spec
#22959
opened Nov 27, 2024 by
Honry
Loading…
free staging buffer early
ep:WebGPU
ort-web webgpu provider
#22943
opened Nov 26, 2024 by
guschmue
Loading…
[Test only] BFloat16 test for SkipSimplifiedLayerNormalization
#22941
opened Nov 25, 2024 by
jiafatom
Loading…
[WebNN] Improve the util function of creating WebNN constant MLOperand
#22935
opened Nov 25, 2024 by
Honry
Loading…
Implementation of flash attention for native webgpu ep
#22932
opened Nov 24, 2024 by
sushraja-msft
Loading…
3 tasks done
Bump onnx from 1.16.1 to 1.17.0 in /onnxruntime/python/tools/transformers/models/phi2
dependencies
Pull requests that update a dependency file
python
Pull requests that update Python code
#22928
opened Nov 22, 2024 by
dependabot
bot
Loading…
[TensorRT EP] Use TRT/CUDA/ORT version from runtime instead of build time to generate hash value
#22921
opened Nov 21, 2024 by
chilo-ms
Loading…
[js/webgpu] support FlashAttention-2 for attention operator
ep:WebGPU
ort-web webgpu provider
#22915
opened Nov 21, 2024 by
xhcao
Loading…
[QNN EP] [DRAFT] Support Conv float weight/bias.
#22906
opened Nov 20, 2024 by
adrianlizarraga
•
Draft
Previous Next
ProTip!
What’s not been updated in a month: updated:<2024-11-03.