📦 Update transformers to min of 4.36.0 to get new caching module #310

gkumbhat · 2024-01-29T16:07:55Z

Update peft to make it work with llama models. This PR will take care of Bump peft from 0.6.0 to 0.7 #305
Update transformers library to make it work with new caching module required by llama models
Add granite_modeling_llama module that declares, defines and registers new transformers model type (gpt_megatron) into transformers causal lm registry thus allowing us to load sphinx / granite models with AutoModelForCausalLM and prompt tune those.
Integrate granite_modeling_llama script into causal-lm resource.
Fix bug to provide safe deletion of model config attribute
Fix bug to use torch_dtype at load time for prompt tuning local inferencing

Signed-off-by: gkumbhat <[email protected]>

Signed-off-by: gkumbhat <[email protected]> Co-authored-by: ani300 <[email protected]>

Signed-off-by: gkumbhat <[email protected]>

evaline-ju

LGTM!

pyproject.toml

gkumbhat and others added 5 commits January 29, 2024 10:07

📦 Update transformers to min of 4.36.0 to get new caching module

34cd493

Signed-off-by: gkumbhat <[email protected]>

✨ Add granite to llama model converter script

c664621

Signed-off-by: gkumbhat <[email protected]> Co-authored-by: ani300 <[email protected]>

🐛 Add safe deletion of attribute from base_model_config

f3dd26b

Signed-off-by: gkumbhat <[email protected]>

🎨 Fix formatting

a0cc033

Signed-off-by: gkumbhat <[email protected]>

🚨 Skip linting checks on granite modeling llama file for now

29f0a4d

Signed-off-by: gkumbhat <[email protected]>

gkumbhat force-pushed the add_granite_modeling_llama branch from 53755d7 to 530712b Compare January 29, 2024 16:54

🚨 Fix unused import linting error

480ee44

Signed-off-by: gkumbhat <[email protected]>

gkumbhat force-pushed the add_granite_modeling_llama branch from 530712b to 480ee44 Compare January 29, 2024 19:11

gkumbhat added 2 commits January 29, 2024 13:27

🐛 Pass torch_dtype at load time for prompt tuning local inferencing

c9f4bf4

Signed-off-by: gkumbhat <[email protected]>

📦 Update peft to make it work with llama for PT inferencing

70b280b

Signed-off-by: gkumbhat <[email protected]>

gkumbhat force-pushed the add_granite_modeling_llama branch from 7cd5db9 to 70b280b Compare January 29, 2024 20:41

gkumbhat marked this pull request as ready for review February 1, 2024 19:44

gkumbhat requested review from alex-jw-brooks, evaline-ju, gabe-l-hart and tharapalanivel as code owners February 1, 2024 19:44

evaline-ju approved these changes Feb 1, 2024

View reviewed changes

pyproject.toml Show resolved Hide resolved

gkumbhat merged commit aa066f8 into caikit:release-0.3 Feb 1, 2024
4 checks passed

gkumbhat deleted the add_granite_modeling_llama branch February 1, 2024 19:56

gkumbhat mentioned this pull request Feb 1, 2024

Add granite modeling llama main #314

Merged

gkumbhat mentioned this pull request Feb 9, 2024

Revert ":package: Update transformers to min of 4.36.0 to get new caching module" #320

Merged

Provide feedback