rebase #6

pglorio · 2024-11-19T09:06:25Z

No description provided.

update revision

…ic (huggingface#33079) * Add docs/source/ar/torchscript.md to Add_docs_source_ar_torchscript.md * Update docs/source/ar/torchscript.md Co-authored-by: Abdullah Mohammed <[email protected]> * Update docs/source/ar/torchscript.md Co-authored-by: Abdullah Mohammed <[email protected]> * Update docs/source/ar/torchscript.md Co-authored-by: Abdullah Mohammed <[email protected]> * Update docs/source/ar/torchscript.md Co-authored-by: Abdullah Mohammed <[email protected]> * Update docs/source/ar/torchscript.md Co-authored-by: Abdullah Mohammed <[email protected]> * Update docs/source/ar/torchscript.md Co-authored-by: Abdullah Mohammed <[email protected]> * Update docs/source/ar/torchscript.md Co-authored-by: Abdullah Mohammed <[email protected]> * Update docs/source/ar/torchscript.md Co-authored-by: Abdullah Mohammed <[email protected]> * Update docs/source/ar/torchscript.md Co-authored-by: Abdullah Mohammed <[email protected]> * Update docs/source/ar/torchscript.md Co-authored-by: Abdullah Mohammed <[email protected]> * Update docs/source/ar/torchscript.md Co-authored-by: Abdullah Mohammed <[email protected]> * Update docs/source/ar/torchscript.md Co-authored-by: Abdullah Mohammed <[email protected]> * Update docs/source/ar/torchscript.md Co-authored-by: Abdullah Mohammed <[email protected]> * Update docs/source/ar/torchscript.md Co-authored-by: Abdullah Mohammed <[email protected]> * Update docs/source/ar/torchscript.md Co-authored-by: Abdullah Mohammed <[email protected]> * Update docs/source/ar/torchscript.md Co-authored-by: Abdullah Mohammed <[email protected]> * Update docs/source/ar/torchscript.md Co-authored-by: Abdullah Mohammed <[email protected]> * Update docs/source/ar/torchscript.md Co-authored-by: Abdullah Mohammed <[email protected]> * Update docs/source/ar/torchscript.md Co-authored-by: Abdullah Mohammed <[email protected]> * Update docs/source/ar/torchscript.md Co-authored-by: Abdullah Mohammed <[email protected]> * Merge troubleshooting.md with this Branch * Update _toctree.yml * Update torchscript.md * Update troubleshooting.md --------- Co-authored-by: Abdullah Mohammed <[email protected]>

…4549) * Better support transformers.agents in gradio: small fixes and additional tests

* initial translation * removed english * Fixed Trivial Typos, updated _toctree.yml

[docs] Broken link

* add XPU path * use accelerate API * Update docs/source/en/tasks/semantic_segmentation.md Co-authored-by: Steven Liu <[email protected]> * update more places with accelerate API --------- Co-authored-by: Steven Liu <[email protected]>

…uggingface#34253) * Retain newlines in chat template when * Add try/except * Add regression test * Simplify test * Apply suggestions from code review Co-authored-by: Matt <[email protected]> --------- Co-authored-by: Matt <[email protected]>

LLava -> Llava

…ingface#34455) (huggingface#34720)

* add xpu path for awq * update readme

* add gradient accumulation steps tests for fsdp * invert no_sync context to fix training for fsdp

* Remove FSDP wrapping from sub-models. * solve conflict trainer.py * make fixup * add unit test for fsdp_auto_wrap_policy when using auto_find_batch_size * put back extract_model_from_parallel * use transformers unwrap_model

* remove v4.44 deprecations * PR comments * deprecations scheduled for v4.50 * hub version update * make fiuxp --------- Co-authored-by: Marc Sun <[email protected]> Co-authored-by: Arthur <[email protected]>

* Add model skeletion with transformers-cli add-new-model-like * Convert config to modular, add rms_norm_eps, delete clip_qkv * Convert model to modular, add RMSNorm * Add flash attention with qk norm and no qkv clipping * Add decoder layer with RMSNorm after attention/feedforward layers * Add base and causal model * Add converter improvements from OLMo repo * Update weight loading in OLMo to HF converter * Set correct default for rms_norm_eps * Set correct pipeline_model_mapping in test * Run make fixup * Fix model type * Re-run modular conversion * Manually set config docs to fix build errors * Convert olmo-1124 to olmo_1124 to fix flash attention docs errors * Start updating tests * Update tests * Copy upstream test_eager_matches_sdpa_inference_1_bfloat16 changes to olmo_1124 * Rename input_layernorm and post_attention_layernorm to reflect their ops better * Use correct tokenizer * Remove test unsupported by GPT2 tokenizer * Create GenerationConfig outside of from_pretrained call * Use simpler init file structure * Add explicit __all__ to support simplified init * Make safetensor serialization the default * Update OLMo November 2024 docs

…3424) * use num additional tokens * fix copies + docs * another fix copies :) * add docs * move order for BC

…when reading config (huggingface#34637) fix a bug where 'id2label' was incorrectly written as 'i2label' when reading the config from pretrained config

19d58d3 has introduced a context manager to manage subtests of test_training_gradient_checkpointing. However, test body was not moved under "with" statement. Thus, while tests are correctly marked as skipped, test bodies were still executed. In some cases, as with llama this caused attribute errors. Fixes: huggingface#34722 Fixes: 19d58d3 ("Add MLLama (huggingface#33703)") Signed-off-by: Dmitry Rogozhkin <[email protected]>

make device-agnostic

add XPU

add XPU part to testing Signed-off-by: Lin, Fanli <[email protected]>

Fixes typo.

…34184) * Simplify Tensor Parallel implementation with PyTorch TP * Move tp_plan to config * Lint * Format and warning * Disable copy-from check * Conditionally get attr from config * make fix-copies * Move base_model_tp_plan to PretrainedConfig * Move TP into from_pretrained * Add device context for load * Do not serialize * Move _tp_plan setting to post_init * Add has_tp_plan * Add test_tp * Add 'Multi-gpu inference' doc * Add backward support for device type identification * Auto-detect accelerator * supports_tp_plan * copyright year * Fix copy

…uggingface#34687) * Allow handling files as args for a tool created with `Tool.from_space`

* Revert "Revert "Fix Whisper CI" (huggingface#34605)" This reverts commit 74d3824. * update --------- Co-authored-by: ydshieh <[email protected]> Co-authored-by: Arthur <[email protected]>

protect

* gptqmodel Signed-off-by: jiqing-feng <[email protected]> * fix format Signed-off-by: jiqing-feng <[email protected]> * update readme Signed-off-by: jiqing-feng <[email protected]> * gptqmodel need use checkpoint_format (#1) * gptqmodel need use checkpoint_format * fix quantize * Update quantization_config.py * Update quantization_config.py * Update quantization_config.py --------- Co-authored-by: ZX-ModelCloud <[email protected]> Co-authored-by: Qubitium-ModelCloud <[email protected]> * Revert quantizer_gptq.py (#2) * revert quantizer_gptq.py change * pass **kwargs * limit gptqmodel and optimum version Signed-off-by: jiqing-feng <[email protected]> * fix format Signed-off-by: jiqing-feng <[email protected]> * fix warning Signed-off-by: jiqing-feng <[email protected]> * fix version check Signed-off-by: jiqing-feng <[email protected]> * revert unrelated changes Signed-off-by: jiqing-feng <[email protected]> * enable gptqmodel tests Signed-off-by: jiqing-feng <[email protected]> * fix requires gptq Signed-off-by: jiqing-feng <[email protected]> * Fix Transformer compat (#3) * revert quantizer_gptq.py change * pass **kwargs * add meta info * cleanup * cleanup * Update quantization_config.py * hf_select_quant_linear pass checkpoint_format and meta * fix GPTQTestCUDA * Update test_gptq.py * gptqmodel.hf_select_quant_linear() now does not select ExllamaV2 * cleanup * add backend * cleanup * cleanup * no need check exllama version * Update quantization_config.py * lower checkpoint_format and backend * check none * cleanup * Update quantization_config.py * fix self.use_exllama == False * spell * fix unittest * fix unittest --------- Co-authored-by: LRL <[email protected]> Co-authored-by: Qubitium-ModelCloud <[email protected]> * fix format Signed-off-by: jiqing-feng <[email protected]> * fix format again Signed-off-by: jiqing-feng <[email protected]> * update gptqmodel version (#6) * update gptqmodel version * update gptqmodel version * fix unit test (#5) * update gptqmodel version * update gptqmodel version * "not self.use_exllama" is not equivalent to "self.use_exllama==False" * fix unittest * update gptqmodel version * backend is loading_attibutes (huggingface#7) * fix format and tests Signed-off-by: jiqing-feng <[email protected]> * fix memory check Signed-off-by: jiqing-feng <[email protected]> * fix device mismatch Signed-off-by: jiqing-feng <[email protected]> * fix result check Signed-off-by: jiqing-feng <[email protected]> * Update src/transformers/quantizers/quantizer_gptq.py Co-authored-by: Marc Sun <[email protected]> * Update src/transformers/quantizers/quantizer_gptq.py Co-authored-by: Marc Sun <[email protected]> * Update src/transformers/quantizers/quantizer_gptq.py Co-authored-by: Marc Sun <[email protected]> * update tests Signed-off-by: jiqing-feng <[email protected]> * review: update docs (huggingface#10) * review: update docs (huggingface#12) * review: update docs * fix typo * update tests for gptqmodel Signed-off-by: jiqing-feng <[email protected]> * update document (huggingface#9) * update overview.md * cleanup * Update overview.md * Update overview.md * Update overview.md * update gptq.md * Update gptq.md * Update gptq.md * Update gptq.md * Update gptq.md * Update gptq.md * Update gptq.md --------- Co-authored-by: Qubitium-ModelCloud <[email protected]> * typo * doc note for asymmetric quant * typo with apple silicon(e) * typo for marlin * column name revert: review * doc rocm support * Update docs/source/en/quantization/gptq.md Co-authored-by: Steven Liu <[email protected]> * Update docs/source/en/quantization/gptq.md Co-authored-by: Steven Liu <[email protected]> * Update docs/source/en/quantization/gptq.md Co-authored-by: Steven Liu <[email protected]> * Update docs/source/en/quantization/gptq.md Co-authored-by: Steven Liu <[email protected]> * Update docs/source/en/quantization/overview.md Co-authored-by: Steven Liu <[email protected]> * Update docs/source/en/quantization/overview.md Co-authored-by: Steven Liu <[email protected]> --------- Signed-off-by: jiqing-feng <[email protected]> Co-authored-by: LRL-ModelCloud <[email protected]> Co-authored-by: ZX-ModelCloud <[email protected]> Co-authored-by: Qubitium-ModelCloud <[email protected]> Co-authored-by: ZX-ModelCloud <[email protected]> Co-authored-by: LRL <[email protected]> Co-authored-by: Marc Sun <[email protected]> Co-authored-by: Mohamed Mekkouri <[email protected]> Co-authored-by: Steven Liu <[email protected]>

faaany and others added 30 commits November 11, 2024 07:09

[docs] update not-working model revision (huggingface#34682)

25f510a

update revision

Agents: Small fixes in streaming to gradio + add tests (huggingface#3…

33eef99

…4549) * Better support transformers.agents in gradio: small fixes and additional tests

🌐 [i18n-KO] Translated marian.md to Korean (huggingface#34698)

be8748a

* initial translation * removed english * Fixed Trivial Typos, updated _toctree.yml

[docs] Broken link in generation_strategies (huggingface#34717)

e7c36a9

[docs] Broken link

Fix example in EsmConfig docstring (huggingface#34653)

68f8186

[docs] add xpu device check (huggingface#34684)

a3d69a8

* add XPU path * use accelerate API * Update docs/source/en/tasks/semantic_segmentation.md Co-authored-by: Steven Liu <[email protected]> * update more places with accelerate API --------- Co-authored-by: Steven Liu <[email protected]>

Update llava.md (huggingface#34749)

f5dbfab

LLava -> Llava

fix(wandb): pass fake dataset to avoid exception in trainer (see hugg…

7b3d615

…ingface#34455) (huggingface#34720)

add xpu path for awq (huggingface#34712)

52ea4aa

* add xpu path for awq * update readme

FSDP grad accum fix (huggingface#34645)

b0c0ba7

* add gradient accumulation steps tests for fsdp * invert no_sync context to fix training for fsdp

Remove FSDP wrapping from sub-models. (huggingface#34452)

8d50fda

* Remove FSDP wrapping from sub-models. * solve conflict trainer.py * make fixup * add unit test for fsdp_auto_wrap_policy when using auto_find_batch_size * put back extract_model_from_parallel * use transformers unwrap_model

🧼 remove v4.44 deprecations (huggingface#34245)

1349321

* remove v4.44 deprecations * PR comments * deprecations scheduled for v4.50 * hub version update * make fiuxp --------- Co-authored-by: Marc Sun <[email protected]> Co-authored-by: Arthur <[email protected]>

VLMs: patch_size -> num_image_tokens in processing (huggingface#3…

1646ffb

…3424) * use num additional tokens * fix copies + docs * another fix copies :) * add docs * move order for BC

Fix broken link (huggingface#34618)

eb0ab3e

fix a typo bug where 'id2label' was incorrectly written as 'i2label' …

c772d4d

…when reading config (huggingface#34637) fix a bug where 'id2label' was incorrectly written as 'i2label' when reading the config from pretrained config

make sure to disable gradients for integer tensor (huggingface#32943)

36759f3

[docs] make empty_cache device-agnostic (huggingface#34774)

8568bf1

make device-agnostic

[docs] add XPU besides CUDA, MPS etc. (huggingface#34777)

9568a9d

add XPU

[tests] add XPU part to testing (huggingface#34778)

e80a65b

add XPU part to testing Signed-off-by: Lin, Fanli <[email protected]>

fix: Update pixel_values parameter in hf_model input (huggingface#34782)

1ef6c5f

Fix callback key name (huggingface#34762)

7693b62

Fixes typo.

fix: Wrong task mentioned in docs (huggingface#34757)

7df93d6

Allow handling files as args for a tool created with Tool.from_space (h…

759a378

…uggingface#34687) * Allow handling files as args for a tool created with `Tool.from_space`

Fix Whisper CI (huggingface#34617)

eed11f3

* Revert "Revert "Fix Whisper CI" (huggingface#34605)" This reverts commit 74d3824. * update --------- Co-authored-by: ydshieh <[email protected]> Co-authored-by: Arthur <[email protected]>

protect tensor parallel usage (huggingface#34800)

dadb286

protect

pglorio merged commit 4725983 into zamba2 Nov 19, 2024
24 of 42 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

rebase #6

rebase #6

pglorio commented Nov 19, 2024

rebase #6

rebase #6

Conversation

pglorio commented Nov 19, 2024