[Web] 1.20.0 breaks SkipSimplifiedLayerNormalization backwards compatibility. Missing Input: model.layers.0.input_layernorm.weight #22704

xenova · 2024-11-03T04:51:33Z

Describe the issue

After upgrading to onnxruntime-node 1.20.0, I obtain the following error when trying to run models which were previously exported (and working) with earlier versions of onnx/onnxruntime:

Non-zero status code returned while running SkipSimplifiedLayerNormalization node. Name:'/model/layers.0/post_attention_layernorm/SkipLayerNorm' Status Message: /onnxruntime_src/include/onnxruntime/core/framework/op_kernel_context.h:42 const T* onnxruntime::OpKernelContext::Input(int) const [with T = onnxruntime::Tensor] Missing Input: model.layers.0.input_layernorm.weight

To reproduce

Attempt to run one of the following models:

Urgency

Blocks upgrading transformers.js to use onnxruntime-node v1.20.0

ONNX Runtime Installation

Released Package

ONNX Runtime Version or Commit ID

1.20.0

Execution Provider

'wasm'/'cpu' (WebAssembly CPU)

The text was updated successfully, but these errors were encountered:

xenova · 2024-11-03T05:22:09Z

It also breaks for WASM EP (WebGPU still works): https://jsfiddle.net/9v4fa3gw/

fs-eire · 2024-11-04T13:56:39Z

@xenova Thank you for the issue report! I did some investigation and identified the issue is in the CPU implementation of f16 [Skip][Simplified]LayerNormalizaion.

This issue is not web specific. All language binding may run into this issue if using CPU/f16 on any of the 4 operators.

We will do the fix ASAP and will publish a dev build of onnxruntime-web once it's done. Will also work on a patch release to include the fix.

xenova · 2024-11-04T15:44:24Z

@fs-eire Amazing - thanks so much! 🥳 I'll upgrade the build when you're ready 👍 Will this include a dev version of onnxruntime-node too?

oyazdanb · 2024-11-04T18:15:59Z

@xenova Thank you for the issue report! I did some investigation and identified the issue is in the CPU implementation of f16 [Skip][Simplified]LayerNormalizaion.

This issue is not web specific. All language binding may run into this issue if using CPU/f16 on any of the 4 operators.

We will do the fix ASAP and will publish a dev build of onnxruntime-web once it's done. Will also work on a patch release to include the fix.

I see this issue in Onnxruntime-DirectML as well; Does this fix help with ort-dml?

fs-eire · 2024-11-04T22:58:55Z

Will investigate this issue. It looks like the problem is in CPU EP and

@fs-eire Amazing - thanks so much! 🥳 I'll upgrade the build when you're ready 👍 Will this include a dev version of onnxruntime-node too?

Currently the pipeline does not support this but I can do a manual publish if necessary.

fs-eire · 2024-11-04T23:00:41Z

@xenova Thank you for the issue report! I did some investigation and identified the issue is in the CPU implementation of f16 [Skip][Simplified]LayerNormalizaion.
This issue is not web specific. All language binding may run into this issue if using CPU/f16 on any of the 4 operators.
We will do the fix ASAP and will publish a dev build of onnxruntime-web once it's done. Will also work on a patch release to include the fix.

I see this issue in Onnxruntime-DirectML as well; Does this fix help with ort-dml?

I am not sure if the problem that you saw is exactly caused by this. If it is, the fix should help.

xenova · 2024-11-05T08:13:49Z

Currently the pipeline does not support this but I can do a manual publish if necessary.

Yes please! 😇 Transformers.js v3.1.0 will include this fix

fs-eire · 2024-11-06T09:22:43Z

The fix is being worked on, and we want to make sure the change fixes the problem before it's merged.

@xenova could you please help to verify if the fix works? (replace the dist folder -> dist.zip)

xenova · 2024-11-06T10:23:01Z

Here is the new error message I get now:

failed to inference ONNX model: Error: failed to call OrtRun(). ERROR_CODE: 2, ERROR_MESSAGE: Non-zero status code returned while running SkipSimplifiedLayerNormalization node. Name:'/model/layers.0/post_attention_layernorm/SkipLayerNorm' Status Message: gamma is expected to have 1 dimension, got 0.

fs-eire · 2024-11-08T03:54:08Z

Did some update and this is the latest fix -> dist.zip

xenova · 2024-11-10T01:57:37Z

Great! That fixed it @fs-eire 🥳 Please let me know when you put a dev build out 👍

xenova · 2024-11-14T14:47:27Z

@fs-eire I see it was merged in f0ac5e0. could you put out a dev build for onnxruntime-node? 😇

jywu-msft · 2024-11-15T07:02:23Z

@fs-eire I see it was merged in f0ac5e0. could you put out a dev build for onnxruntime-node? 😇

+@guschmue

ulgens · 2024-11-26T08:32:04Z

👀

fs-eire · 2024-12-02T19:43:57Z

onnxruntime-node 1.20.1 is released and should have included the fixes.

xenova · 2024-12-02T22:57:18Z

I can confirm that fixes it. Thanks!

xenova added the platform:web issues related to ONNX Runtime web; typically submitted using template label Nov 3, 2024

xenova mentioned this issue Nov 3, 2024

[WIP] Update ORT versions + dependencies huggingface/transformers.js#1010

Closed

joein mentioned this issue Nov 4, 2024

[Bug/Model Request]: ONNX runtime exception qdrant/fastembed#385

Closed

jywu-msft assigned amarin16 Nov 4, 2024

sophies927 added release:1.20.1 release:1.20.0 and removed release:1.20.1 labels Nov 5, 2024

fs-eire mentioned this issue Nov 7, 2024

Update skip layer norm #22719

Merged

joein mentioned this issue Nov 15, 2024

new: add python 3.13 support qdrant/fastembed#404

Merged

This was referenced Nov 19, 2024

C# Sample Error with Phi 3.5 GPU/DML model - Non-zero status code returned while running SkipSimplifiedLayerNormalization node microsoft/onnxruntime-genai#1074

Closed

Update Onnxruntime download version for GenAI #22900

Open

xenova closed this as completed Dec 2, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Web] 1.20.0 breaks SkipSimplifiedLayerNormalization backwards compatibility. Missing Input: model.layers.0.input_layernorm.weight #22704

[Web] 1.20.0 breaks SkipSimplifiedLayerNormalization backwards compatibility. Missing Input: model.layers.0.input_layernorm.weight #22704

xenova commented Nov 3, 2024

xenova commented Nov 3, 2024

fs-eire commented Nov 4, 2024 •

edited

Loading

xenova commented Nov 4, 2024

oyazdanb commented Nov 4, 2024

fs-eire commented Nov 4, 2024

fs-eire commented Nov 4, 2024

xenova commented Nov 5, 2024

fs-eire commented Nov 6, 2024

xenova commented Nov 6, 2024 •

edited

Loading

fs-eire commented Nov 8, 2024

xenova commented Nov 10, 2024

xenova commented Nov 14, 2024 •

edited

Loading

jywu-msft commented Nov 15, 2024

ulgens commented Nov 26, 2024

fs-eire commented Dec 2, 2024

xenova commented Dec 2, 2024

[Web] 1.20.0 breaks SkipSimplifiedLayerNormalization backwards compatibility. Missing Input: model.layers.0.input_layernorm.weight #22704

[Web] 1.20.0 breaks SkipSimplifiedLayerNormalization backwards compatibility. Missing Input: model.layers.0.input_layernorm.weight #22704

Comments

xenova commented Nov 3, 2024

Describe the issue

To reproduce

Urgency

ONNX Runtime Installation

ONNX Runtime Version or Commit ID

Execution Provider

xenova commented Nov 3, 2024

fs-eire commented Nov 4, 2024 • edited Loading

xenova commented Nov 4, 2024

oyazdanb commented Nov 4, 2024

fs-eire commented Nov 4, 2024

fs-eire commented Nov 4, 2024

xenova commented Nov 5, 2024

fs-eire commented Nov 6, 2024

xenova commented Nov 6, 2024 • edited Loading

fs-eire commented Nov 8, 2024

xenova commented Nov 10, 2024

xenova commented Nov 14, 2024 • edited Loading

jywu-msft commented Nov 15, 2024

ulgens commented Nov 26, 2024

fs-eire commented Dec 2, 2024

xenova commented Dec 2, 2024

fs-eire commented Nov 4, 2024 •

edited

Loading

xenova commented Nov 6, 2024 •

edited

Loading

xenova commented Nov 14, 2024 •

edited

Loading