tf.aliasing support #1026

steeve · 2024-11-04T17:17:09Z

Hi,

We (@zml) found that tf.aliasing support seemed to be not working as expected, with the model producing garbage when used. In our case Llama 3.1 8B.
This is problematic for transformer models because we leverage donations for the KvCache.

For now we're not emitting those attributes when on neuron, but we're not sure what to do as we feel that if the SDK doesn't support them, it should just ignore them right ?

The llama implementation is attached.

Thank you !

llama.aliasing.mlir.txt

The text was updated successfully, but these errors were encountered:

nalwayaakshay · 2024-11-07T14:46:10Z

Can you try to use jax.buffer_donor rather than tf.aliasing_output in order to annotate donated buffers.

For example:
%arg2: tensor<...> {jax.buffer_donor = true, mhlo.layout_mode = "default", mhlo.sharding = "{devices=[1,1,32,1]<=[32]}"} loc("state.kv_cache[0]['cached_key']")

From your .txt file:
%arg291: tensor<256xi32> {mhlo.layout_mode = "default", mhlo.sharding = "{replicated}", tf.aliasing_output = 0 : i32}

steeve · 2024-11-08T13:38:41Z

TIL jax.buffer_donor. Unfortunately, it doesn't work either.
That being said, while the output is wrong, the tok/s doesn't change with donation, which is weird ?

devesr-amzn · 2024-11-11T22:06:19Z

Can you provide steps to reproduce the issue along with versions of dependencies in use (versions of neuronx-cc, libneuronxla)?

steeve · 2024-11-12T08:24:54Z

Packages:

neuronx-cc==2.15.141.0+d3cfc8ca
libneuronxla==2.0.4986.0

Checkout this branch: https://github.com/zml/zml/tree/steeve/synapse

Run the llama example with neuron:

$ cd zml/examples
$ ./bazel.sh run -c opt //llama:Llama-3.1-8B-Instruct --@zml//runtimes:cpu=false --@zml//runtimes:neuron=true

You can re-enable donations by commenting out those lines: https://github.com/zml/zml/blob/steeve/synapse/zml/module.zig#L301-L303

aws-taylor added the bug Something isn't working label Nov 8, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

tf.aliasing support #1026

tf.aliasing support #1026

steeve commented Nov 4, 2024 •

edited

Loading

nalwayaakshay commented Nov 7, 2024

steeve commented Nov 8, 2024

devesr-amzn commented Nov 11, 2024

steeve commented Nov 12, 2024

tf.aliasing support #1026

tf.aliasing support #1026

Comments

steeve commented Nov 4, 2024 • edited Loading

nalwayaakshay commented Nov 7, 2024

steeve commented Nov 8, 2024

devesr-amzn commented Nov 11, 2024

steeve commented Nov 12, 2024

steeve commented Nov 4, 2024 •

edited

Loading