Refactor the positional emebdding config code #4920

arashb · 2024-01-08T23:13:38Z

The Mixtral PR #4828 has introduced the positional embedding config class which is a required argument of make_attn_layer() function. This has forced the user to override and duplicate the make_attn_layer() call for new model implementations using RoPE (This has also broken the Falcon model implementations). This PR:

refactors the inference transformer base class to avoid code duplication by adding a new abstract positional_embedding_config property
Fixes the Falcon model implementation to use positional embedding config.

The models llama_v2, OPT, Mistral 7B, Mixtral, Falcon and Phi-2 are tested with the PR!

This reverts commit c1e0205.

follow PR #4920 on Qwen inference code Co-authored-by: Michael Wyatt <[email protected]>

The Mixtral PR microsoft#4828 has introduced the positional embedding config class which is a required argument of `make_attn_layer()` function. This has forced the user to override and duplicate the `make_attn_layer()` call for new model implementations using RoPE (This has also broken the Falcon model implementations). This PR: - refactors the inference transformer base class to avoid code duplication by adding a new abstract `positional_embedding_config` property - Fixes the Falcon model implementation to use positional embedding config. The models `llama_v2`, `OPT`, `Mistral 7B`, `Mixtral`, `Falcon` and `Phi-2` are tested with the PR! --------- Co-authored-by: Logan Adams <[email protected]>

follow PR microsoft#4920 on Qwen inference code Co-authored-by: Michael Wyatt <[email protected]>

Refactor the positional emebdding config code

d041fec

arashb requested review from mrwyattii and awan-10 as code owners January 8, 2024 23:13

Fix formatting

18e6e56

mrwyattii approved these changes Jan 9, 2024

View reviewed changes

arashb and others added 3 commits January 9, 2024 10:00

Merge branch 'master' into arashb/fix-falcon

d06723e

Merge branch 'master' into arashb/fix-falcon

8e08050

Merge branch 'master' into arashb/fix-falcon

649b3c5

mrwyattii merged commit c1e0205 into master Jan 10, 2024
9 checks passed

mrwyattii deleted the arashb/fix-falcon branch January 10, 2024 17:33

loadams added a commit that referenced this pull request Jan 10, 2024

Revert "Refactor the positional emebdding config code (#4920)"

8b00b48

This reverts commit c1e0205.

ZonePG mentioned this pull request Jan 14, 2024

Refactor the Qwen positional emebdding config code #4955

Merged

mrwyattii added a commit that referenced this pull request Jan 23, 2024

Refactor the Qwen positional emebdding config code (#4955)

1d35db7

follow PR #4920 on Qwen inference code Co-authored-by: Michael Wyatt <[email protected]>

mauryaavinash95 pushed a commit to mauryaavinash95/DeepSpeed that referenced this pull request Feb 17, 2024

Refactor the Qwen positional emebdding config code (microsoft#4955)

c19f398

follow PR microsoft#4920 on Qwen inference code Co-authored-by: Michael Wyatt <[email protected]>

rraminen pushed a commit to ROCm/DeepSpeed that referenced this pull request May 9, 2024

Refactor the Qwen positional emebdding config code (microsoft#4955)

0c05015

follow PR microsoft#4920 on Qwen inference code Co-authored-by: Michael Wyatt <[email protected]>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Refactor the positional emebdding config code #4920

Refactor the positional emebdding config code #4920

arashb commented Jan 8, 2024

Refactor the positional emebdding config code #4920

Refactor the positional emebdding config code #4920

Conversation

arashb commented Jan 8, 2024