Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Support xpu for ipex static quant #1916

Merged
merged 14 commits into from
Jul 17, 2024
Merged

Support xpu for ipex static quant #1916

merged 14 commits into from
Jul 17, 2024

Conversation

violetch24
Copy link
Contributor

@violetch24 violetch24 commented Jul 12, 2024

Type of Change

feature and example

Description

Support xpu for ipex static quant

  • support quantized on xpu in current API
  • save & load using config mapping
  • enable op_name and op_type fallback
  • add xpu related ut

Expected Behavior & Potential Risk

How has this PR been tested?

ut pass

Dependency Change?

@violetch24 violetch24 requested review from xin3he and changwangss July 12, 2024 06:16
@violetch24 violetch24 marked this pull request as ready for review July 12, 2024 06:17
Copy link

github-actions bot commented Jul 12, 2024

⚡ Required checks status: All passing 🟢

Groups summary

🟢 Code Scan Tests workflow
Check ID Status Error details
Code-Scan success
Code-Scan (Bandit Code Scan Bandit) success
Code-Scan (DocStyle Code Scan DocStyle) success
Code-Scan (Pylint Code Scan Pylint) success

These checks are required after the changes to neural_compressor/torch/algorithms/static_quant/save_load.py, neural_compressor/torch/algorithms/static_quant/static_quant.py, neural_compressor/torch/algorithms/static_quant/utility.py, neural_compressor/torch/quantization/config.py.

🟢 Model Tests 3x workflow
Check ID Status Error details
Model-Test-3x success
Model-Test-3x (Generate Report GenerateReport) success
Model-Test-3x (Run PyTorch Model opt_125m_woq_gptq_int4) success
Model-Test-3x (Run PyTorch Model opt_125m_woq_gptq_int4_dq_bnb) success
Model-Test-3x (Run PyTorch Model opt_125m_woq_gptq_int4_dq_ggml) success

These checks are required after the changes to neural_compressor/torch/algorithms/static_quant/save_load.py, neural_compressor/torch/algorithms/static_quant/static_quant.py, neural_compressor/torch/algorithms/static_quant/utility.py, neural_compressor/torch/quantization/config.py.

🟢 Unit Tests 3x-PyTorch workflow
Check ID Status Error details
UT-3x-Torch success
UT-3x-Torch (Coverage Compare CollectDatafiles) success
UT-3x-Torch (Unit Test 3x Torch Unit Test 3x Torch) success
UT-3x-Torch (Unit Test 3x Torch baseline Unit Test 3x Torch baseline) success

These checks are required after the changes to neural_compressor/torch/algorithms/static_quant/save_load.py, neural_compressor/torch/algorithms/static_quant/static_quant.py, neural_compressor/torch/algorithms/static_quant/utility.py, neural_compressor/torch/quantization/config.py, test/3x/torch/quantization/test_static_quant.py.


Thank you for your contribution! 💜

Note
This comment is automatically generated and will be updates every 180 seconds within the next 6 hours. If you have any other questions, contact chensuyue or XuehaoSun for help.

@chensuyue chensuyue added this to the v3.0 milestone Jul 16, 2024
@chensuyue chensuyue merged commit 53e6ee6 into master Jul 17, 2024
25 checks passed
@chensuyue chensuyue deleted the zixuan/ipex_xpu branch July 17, 2024 12:35
xin3he pushed a commit that referenced this pull request Oct 11, 2024
commit 2bb257e
Author: Kaihui-intel <[email protected]>
Date:   Thu Oct 10 19:27:11 2024 +0800

    Add woq examples (#1982)

    Signed-off-by: Kaihui-intel <[email protected]>
    Signed-off-by: Sun, Xuehao <[email protected]>
    Co-authored-by: Sun, Xuehao <[email protected]>

commit 586eb88
Author: Huang, Tai <[email protected]>
Date:   Wed Oct 9 09:22:39 2024 +0800

    add transformers-like api link in readme (#2022)

    Signed-off-by: Huang, Tai <[email protected]>

commit 4e9c764
Author: Kaihui-intel <[email protected]>
Date:   Tue Oct 8 13:13:45 2024 +0800

    Remove itrex dependency for 3x example (#2016)

    Signed-off-by: Kaihui-intel <[email protected]>
    Co-authored-by: Sun, Xuehao <[email protected]>

commit a0066d4
Author: Kaihui-intel <[email protected]>
Date:   Mon Sep 30 18:17:32 2024 +0800

    Fix transformers rtn layer-wise quant (#2008)

    Signed-off-by: Kaihui-intel <[email protected]>
    Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>

commit 802a5af
Author: Huang, Tai <[email protected]>
Date:   Mon Sep 30 17:02:52 2024 +0800

    add autoround EMNLP24 to pub list (#2014)

    Signed-off-by: Huang, Tai <[email protected]>

commit 44795a1
Author: Kaihui-intel <[email protected]>
Date:   Mon Sep 30 16:55:22 2024 +0800

    Adapt transformers 4.45.1 (#2019)

    Signed-off-by: Kaihui-intel <[email protected]>
    Co-authored-by: changwangss <[email protected]>
    Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>

commit d4662ad
Author: Kaihui-intel <[email protected]>
Date:   Mon Sep 30 15:52:17 2024 +0800

    Add transformers-like api doc (#2018)

    Signed-off-by: Kaihui-intel <[email protected]>

commit 72398b6
Author: Wang, Chang <[email protected]>
Date:   Fri Sep 27 15:11:04 2024 +0800

    fix xpu device set weight and bias (#2010)

    Signed-off-by: changwangss <[email protected]>
    Co-authored-by: Sun, Xuehao <[email protected]>

commit 9d27743
Author: Sun, Xuehao <[email protected]>
Date:   Fri Sep 27 14:17:24 2024 +0800

    Update model accuracy (#2006)

    Signed-off-by: Sun, Xuehao <[email protected]>

commit 7bbc473
Author: xinhe <[email protected]>
Date:   Fri Sep 27 11:47:00 2024 +0800

    add pad_to_buckets in evaluation for hpu performance (#2011)

    * add pad_to_buckets in evaluation for hpu performance
    ---------

    Signed-off-by: xin3he <[email protected]>

commit b6b7d7c
Author: Kaihui-intel <[email protected]>
Date:   Thu Sep 26 17:21:54 2024 +0800

    Update auto_round requirements for transformers example (#2013)

    Signed-off-by: Kaihui-intel <[email protected]>

commit ee600ba
Author: Wang, Chang <[email protected]>
Date:   Fri Sep 20 13:54:06 2024 +0800

    add repack_awq_to_optimum_format function (#1998)

    Signed-off-by: changwangss <[email protected]>

commit 4ee6861
Author: Sun, Xuehao <[email protected]>
Date:   Thu Sep 19 22:27:05 2024 +0800

    remove accelerate version in unit test (#2007)

    Signed-off-by: Sun, Xuehao <[email protected]>

commit 2445811
Author: WeiweiZhang1 <[email protected]>
Date:   Sat Sep 14 18:13:30 2024 +0800

    enable auto_round format export (#2002)

    Signed-off-by: Zhang, Weiwei1 <[email protected]>

commit 906333a
Author: Kaihui-intel <[email protected]>
Date:   Sat Sep 14 16:17:46 2024 +0800

    Replace FORCE_DEVICE with INC_TARGET_DEVICE [transformers] (#2005)

    Signed-off-by: Kaihui-intel <[email protected]>

commit 443d007
Author: xinhe <[email protected]>
Date:   Fri Sep 13 21:35:32 2024 +0800

    add INC_FORCE_DEVICE introduction (#1988)

    * add INC_FORCE_DEVICE introduction

    Signed-off-by: xin3he <[email protected]>

    * Update PyTorch.md

    * Update PyTorch.md

    * Update docs/source/3x/PyTorch.md

    Co-authored-by: Yi Liu <[email protected]>

    * rename to INC_TARGET_DEVICE

    Signed-off-by: xin3he <[email protected]>

    ---------

    Signed-off-by: xin3he <[email protected]>
    Co-authored-by: Yi Liu <[email protected]>

commit 5de9a4f
Author: Kaihui-intel <[email protected]>
Date:   Fri Sep 13 20:48:22 2024 +0800

    Support transformers-like api for woq quantization (#1987)

    Signed-off-by: Kaihui-intel <[email protected]>
    Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
    Co-authored-by: Wang, Chang <[email protected]>

commit 9c39b42
Author: chen, suyue <[email protected]>
Date:   Thu Sep 12 14:34:49 2024 +0800

    update docker image prune rules (#2003)

    Signed-off-by: chensuyue <[email protected]>

commit 09d4f2d
Author: Huang, Tai <[email protected]>
Date:   Mon Sep 9 09:24:35 2024 +0800

    Add recent publications (#1995)

    * add recent publications

    Signed-off-by: Huang, Tai <[email protected]>

    * update total count

    Signed-off-by: Huang, Tai <[email protected]>

    ---------

    Signed-off-by: Huang, Tai <[email protected]>

commit 399cd44
Author: Kaihui-intel <[email protected]>
Date:   Tue Sep 3 16:37:09 2024 +0800

     Remove the save of gptq config (#1993)

    Signed-off-by: Kaihui-intel <[email protected]>

commit 05272c4
Author: Yi Liu <[email protected]>
Date:   Tue Sep 3 10:21:51 2024 +0800

    add per_channel_minmax (#1990)

    Signed-off-by: yiliu30 <[email protected]>

commit 82d8c06
Author: chen, suyue <[email protected]>
Date:   Fri Aug 30 21:21:00 2024 +0800

    update 3x pt binary build (#1992)

    Signed-off-by: chensuyue <[email protected]>

commit e9f06af
Author: Huang, Tai <[email protected]>
Date:   Fri Aug 30 17:49:48 2024 +0800

    Update installation_guide.md (#1989)

    Correct typo in installation doc

commit 093c966
Author: Wang, Chang <[email protected]>
Date:   Fri Aug 30 17:45:54 2024 +0800

    add quantize, save, load function for transformers-like api (#1986)

    Signed-off-by: changwangss <[email protected]>

commit 4dd49a4
Author: xinhe <[email protected]>
Date:   Thu Aug 29 17:23:18 2024 +0800

    add hasattr check for torch fp8 dtype (#1985)

    Signed-off-by: xin3he <[email protected]>

commit f2c454f
Author: chen, suyue <[email protected]>
Date:   Thu Aug 29 13:45:39 2024 +0800

    update installation and ci test for 3x api (#1991)

    Signed-off-by: chensuyue <[email protected]>

commit 7ba9fdc
Author: Kaihui-intel <[email protected]>
Date:   Mon Aug 19 14:50:50 2024 +0800

    support gptq `true_sequential` and `quant_lm_head` (#1977)

    Signed-off-by: Kaihui-intel <[email protected]>

commit 68b1f8b
Author: Sun, Xuehao <[email protected]>
Date:   Fri Aug 16 09:43:46 2024 +0800

    Fix UT env and upgrade torch to 2.4.0 (#1978)

    Signed-off-by: Sun, Xuehao <[email protected]>

commit f9dfd54
Author: Yi Liu <[email protected]>
Date:   Thu Aug 15 14:13:26 2024 +0800

    Skip some tests for torch 2.4 (#1981)

    Signed-off-by: yiliu30 <[email protected]>

commit 46d9192
Author: xinhe <[email protected]>
Date:   Thu Aug 15 09:57:22 2024 +0800

    update readme for fp8 (#1979)

    Signed-off-by: xinhe3 <[email protected]>

commit 842b715
Author: chen, suyue <[email protected]>
Date:   Tue Aug 13 12:09:25 2024 +0800

    bump main version into v3.1 (#1974)

    Signed-off-by: chensuyue <[email protected]>

commit 3845cdc
Author: Neo Zhang Jianyu <[email protected]>
Date:   Tue Aug 13 12:09:09 2024 +0800

    fix online doc search issue (#1975)

    Co-authored-by: ZhangJianyu <[email protected]>

commit 7056720
Author: chen, suyue <[email protected]>
Date:   Sun Aug 11 20:58:34 2024 +0800

    update main page (#1973)

    Signed-off-by: chensuyue <[email protected]>

commit 95197d1
Author: xinhe <[email protected]>
Date:   Sat Aug 10 23:28:43 2024 +0800

    Cherry pick v1.17.0 (#1964)

    * [SW-184941] INC CI, CD and Promotion

    Change-Id: I60c420f9776e1bdab7bb9e02e5bcbdb6891bfe52

    * [SW-183320]updated setup.py

    Change-Id: I592af89486cb1d9e0b5197521c428920197a9103

    * [SW-177474] add HQT FP8 porting code

    Change-Id: I4676f13a5ed43c444f2ec68675cc41335e7234dd
    Signed-off-by: Zhou Yuwen <[email protected]>

    * [SW-189361] Fix white list extend

    Change-Id: Ic2021c248798fce37710d28014a6d59259c868a3

    * [SW-191317] Raise exception according to hqt config object

    Change-Id: I06ba8fa912c811c88912987c11e5c12ef328348a

    * [SW-184714] Port HQT code into INC

    HQT lib content was copied as is under fp8_quant

    Tests were copied to 3.x torch location

    Change-Id: Iec6e1fa7ac4bf1df1c95b429524c40e32bc13ac9

    * [SW-184714] Add internal folder to fp8 quant

    This is a folder used for experiments,
    not to be used by users

    Change-Id: I9e221ae582794e304e95392c0f37638f7bce69bc

    * [SW-177468] Removed unused code + cleanup

    Change-Id: I4d27c067e87c1a30eb1da9df16a16c46d092c638

    * Fix errors in regression_detection

    Change-Id: Iee5318bd5593ba349812516eb5641958ece3c438

    * [SW-187731] Save orig module as member of patched module

    This allows direct usage of the original module methods,
    which solves torch compile issue

    Change-Id: I464d8bd1bacdfc3cd1f128a67114e1e43f092632

    * [SW-190899] Install packages according to configuration

    Change-Id: I570b490658f5d2c5399ba1db93f8f52f56449525

    * [SW-184689] use finalize_calibration intrenaly for one step flow

    Change-Id: Ie0b8b426c951cf57ed7e6e678c86813fb2d05c89

    * [SW-191945] align requirement_pt.txt in gerrit INC with Github INC

    Change-Id: If5c0dbf21bf989af37a8e29246e4f8760cd215ef
    Signed-off-by: xinhe3 <[email protected]>

    * [SW-192358] Remove HQT reference in INC

    Change-Id: Ic25f9323486596fa2dc6d909cd568a37ab84dd5e

    * [SW-191415] update fp8 maxAbs observer  using torch.copy_

    Change-Id: I3923c832f9a8a2b14e392f3f4719d233a457702f

    * [SW-184943] Enhance INC WOQ model loading

    - Support loading huggingface WOQ model
    - Abstract WeightOnlyLinear base class. Add INCWeightOnlyLinear and HPUWeighOnlyLinear subclasses
    - Load woq linear weight module by module
    - Save hpu format tensor to reuse it once load it again

    Change-Id: I679a42759b49e1f45f52bbb0bdae8580a23d0bcf

    * [SW-190303] Implement HPUWeightOnlyLinear class in INC

    Change-Id: Ie05c8787e708e2c3559dce24ef0758d6c498ac41

    * [SW-192809] fix json_file bug when instantiating FP8Config class

    Change-Id: I4a715d0a706efe20ccdb49033755cabbc729ccdc
    Signed-off-by: Zhou Yuwen <[email protected]>

    * [SW-192931] align setup.py with github INC and remove fp8_convert

    Change-Id: Ibbc157646cfcfad64b323ecfd96b9bbda5ba9e2f
    Signed-off-by: xinhe3 <[email protected]>

    * [SW-192917] Update all HQT logic files with pre-commit check

    Change-Id: I119dc8578cb10932fd1a8a674a8bdbf61f978e42
    Signed-off-by: xinhe3 <[email protected]>

    * update docstring

    Signed-off-by: yuwenzho <[email protected]>

    * add fp8 example and document (#1639)

    Signed-off-by: xinhe3 <[email protected]>

    * Update settings to be compatible with gerrit

    * enhance ut

    Signed-off-by: yuwenzho <[email protected]>

    * move fp8 sample to helloworld folder

    Signed-off-by: yuwenzho <[email protected]>

    * update torch version of habana docker

    Signed-off-by: xinhe3 <[email protected]>

    * [pre-commit.ci] auto fixes from pre-commit.com hooks

    for more information, see https://pre-commit.ci

    * update readme demo

    Signed-off-by: xinhe3 <[email protected]>

    * update WeightOnlyLinear to INCWeightOnlyLinear

    Signed-off-by: xinhe3 <[email protected]>

    * [pre-commit.ci] auto fixes from pre-commit.com hooks

    for more information, see https://pre-commit.ci

    * add docstring for FP8Config

    Signed-off-by: xinhe3 <[email protected]>

    * fix pylint

    Signed-off-by: xinhe3 <[email protected]>

    * update fp8 test scripts

    Signed-off-by: chensuyue <[email protected]>

    * delete deps

    Signed-off-by: chensuyue <[email protected]>

    * update container into v1.17.0

    Signed-off-by: chensuyue <[email protected]>

    * update docker version

    Signed-off-by: xinhe3 <[email protected]>

    * update pt ut

    Signed-off-by: chensuyue <[email protected]>

    * add lib path

    Signed-off-by: chensuyue <[email protected]>

    * fix dir issue

    Signed-off-by: xinhe3 <[email protected]>

    * [pre-commit.ci] auto fixes from pre-commit.com hooks

    for more information, see https://pre-commit.ci

    * update fp8 test scope

    Signed-off-by: chensuyue <[email protected]>

    * fix typo

    Signed-off-by: xinhe3 <[email protected]>

    * update fp8 test scope

    Signed-off-by: chensuyue <[email protected]>

    * update pre-commit-ci

    Signed-off-by: chensuyue <[email protected]>

    * work around for hpu

    Signed-off-by: xinhe3 <[email protected]>

    * fix UT

    Signed-off-by: xinhe3 <[email protected]>

    * fix parameter

    Signed-off-by: chensuyue <[email protected]>

    * omit some test

    Signed-off-by: chensuyue <[email protected]>

    * update main page example to llm loading

    Signed-off-by: xinhe3 <[email protected]>

    * [pre-commit.ci] auto fixes from pre-commit.com hooks

    for more information, see https://pre-commit.ci

    * fix autotune

    Signed-off-by: xinhe3 <[email protected]>

    ---------

    Signed-off-by: Zhou Yuwen <[email protected]>
    Signed-off-by: xinhe3 <[email protected]>
    Signed-off-by: yuwenzho <[email protected]>
    Signed-off-by: chensuyue <[email protected]>
    Co-authored-by: yan tomsinsky <[email protected]>
    Co-authored-by: Ron Ben Moshe <[email protected]>
    Co-authored-by: Uri Livne <[email protected]>
    Co-authored-by: Danny Semiat <[email protected]>
    Co-authored-by: smarkovichgolan <[email protected]>
    Co-authored-by: Dudi Lester <[email protected]>

commit de0fa21
Author: Huang, Tai <[email protected]>
Date:   Fri Aug 9 22:32:37 2024 +0800

    Fix broken link in docs (#1969)

    Signed-off-by: Huang, Tai <[email protected]>

commit 385da7c
Author: Sun, Xuehao <[email protected]>
Date:   Fri Aug 9 21:53:51 2024 +0800

    Add 3.x readme (#1971)

    Signed-off-by: Sun, Xuehao <[email protected]>

commit acd8f4f
Author: Huang, Tai <[email protected]>
Date:   Fri Aug 9 15:24:14 2024 +0800

    Add version mapping between INC and Gaudi SW Stack (#1967)

    Signed-off-by: Huang, Tai <[email protected]>

commit 74a4641
Author: Sun, Xuehao <[email protected]>
Date:   Fri Aug 9 10:23:59 2024 +0800

    remove unnecessary CI (#1966)

    Signed-off-by: Sun, Xuehao <[email protected]>

commit b99abae
Author: Kaihui-intel <[email protected]>
Date:   Tue Aug 6 16:02:03 2024 +0800

    Fix `opt_125m_woq_gptq_int4_dq_ggml` issue (#1965)

    Signed-off-by: Kaihui-intel <[email protected]>

commit b35ff8f
Author: Zixuan Cheng <[email protected]>
Date:   Fri Aug 2 09:06:35 2024 +0800

    example update for 3.x ipex sq (#1902)

    Signed-off-by: violetch24 <[email protected]>

commit 000946f
Author: Zixuan Cheng <[email protected]>
Date:   Thu Aug 1 10:19:32 2024 +0800

    add SDXL model example to INC 3.x (#1887)

    * add SDXL model example to INC 3.x

    Signed-off-by: Cheng, Zixuan <[email protected]>

    * add evaluation script

    Signed-off-by: violetch24 <[email protected]>

    * add test script

    Signed-off-by: violetch24 <[email protected]>

    * minor fix

    Signed-off-by: violetch24 <[email protected]>

    * Update run_quant.sh

    * add iter limit

    Signed-off-by: violetch24 <[email protected]>

    * modify test script

    Signed-off-by: violetch24 <[email protected]>

    * update json

    Signed-off-by: chensuyue <[email protected]>

    * add requirements

    Signed-off-by: violetch24 <[email protected]>

    * Update run_benchmark.sh

    * Update sdxl_smooth_quant.py

    * minor fix

    Signed-off-by: violetch24 <[email protected]>

    ---------

    Signed-off-by: Cheng, Zixuan <[email protected]>
    Signed-off-by: violetch24 <[email protected]>
    Signed-off-by: chensuyue <[email protected]>
    Co-authored-by: violetch24 <[email protected]>
    Co-authored-by: chensuyue <[email protected]>

commit aa42e5e
Author: xinhe <[email protected]>
Date:   Wed Jul 31 15:36:06 2024 +0800

    replenish docstring (#1955)

    * replenish docstring

    Signed-off-by: xin3he <[email protected]>

    * update  Quantizer API docstring

    Signed-off-by: xin3he <[email protected]>

    * Add docstring for auto accelerator (#1956)

    Signed-off-by: yiliu30 <[email protected]>

    * temporary remove torch/quantization and add it back after fp8 code is updated.

    * Update config.py

    ---------

    Signed-off-by: xin3he <[email protected]>
    Signed-off-by: yiliu30 <[email protected]>
    Co-authored-by: Yi Liu <[email protected]>

commit 81a076d
Author: Neo Zhang Jianyu <[email protected]>
Date:   Wed Jul 31 13:51:33 2024 +0800

    fix welcome.html link issue (#1962)

    Co-authored-by: ZhangJianyu <[email protected]>

commit 87f02c1
Author: chen, suyue <[email protected]>
Date:   Wed Jul 31 10:09:47 2024 +0800

    fix docs link (#1959)

    Signed-off-by: chensuyue <[email protected]>

commit 03813e2
Author: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
Date:   Wed Jul 31 10:09:29 2024 +0800

    Bump tensorflow version (#1961)

    Signed-off-by: dependabot[bot] <[email protected]>

commit 3b5dbf6
Author: Kaihui-intel <[email protected]>
Date:   Tue Jul 30 17:27:21 2024 +0800

    Set low_gpu_mem_usage=False for AutoRound

    Signed-off-by: Kaihui-intel <[email protected]>

commit 41244d3
Author: chen, suyue <[email protected]>
Date:   Mon Jul 29 23:05:36 2024 +0800

    new previous results could not find all raise issues in CI model test (#1958)

    Signed-off-by: chensuyue <[email protected]>

commit 190e6b2
Author: Kaihui-intel <[email protected]>
Date:   Mon Jul 29 19:39:57 2024 +0800

    Fix itrex qbits nf4/int8 training core dumped issue (#1954)

    Signed-off-by: Kaihui-intel <[email protected]>
    Signed-off-by: chensuyue <[email protected]>

commit 0e724a4
Author: Kaihui-intel <[email protected]>
Date:   Mon Jul 29 16:22:13 2024 +0800

    Add save/load for pt2e example (#1927)

    Signed-off-by: Kaihui-intel <[email protected]>

commit 50eb6fb
Author: chen, suyue <[email protected]>
Date:   Mon Jul 29 13:40:36 2024 +0800

    update 3x torch installation (#1957)

    Signed-off-by: chensuyue <[email protected]>

commit 6e1b1da
Author: Zixuan Cheng <[email protected]>
Date:   Fri Jul 26 15:58:00 2024 +0800

    add ipex xpu example to 3x API (#1948)

    Signed-off-by: violetch24 <[email protected]>

commit 19024b3
Author: zehao-intel <[email protected]>
Date:   Fri Jul 26 14:52:01 2024 +0800

    Enable yolov5 Example for TF 3x API  (#1943)

    Signed-off-by: zehao-intel <[email protected]>

commit d84a93f
Author: zehao-intel <[email protected]>
Date:   Thu Jul 25 14:45:19 2024 +0800

    Complement UT of calibration function for TF 3x API (#1945)

    Signed-off-by: zehao-intel <[email protected]>

commit fb85779
Author: zehao-intel <[email protected]>
Date:   Thu Jul 25 14:04:25 2024 +0800

    Update Examples for TF 3x API (#1901)

    Signed-off-by: zehao-intel <[email protected]>

commit 6b30207
Author: zehao-intel <[email protected]>
Date:   Thu Jul 25 13:39:06 2024 +0800

    Add Docstring for TF 3x API and Torch 3x Mixed Precision (#1944)

    Signed-off-by: zehao-intel <[email protected]>

commit d254d50
Author: Yi Liu <[email protected]>
Date:   Wed Jul 24 21:50:44 2024 +0800

    Update doc for client-usage and LWQ (#1947)

    Signed-off-by: yiliu30 <[email protected]>

commit f253d35
Author: Neo Zhang Jianyu <[email protected]>
Date:   Wed Jul 24 17:48:05 2024 +0800

    Update publish.yml (#1950)

commit 6cda338
Author: Neo Zhang Jianyu <[email protected]>
Date:   Wed Jul 24 17:31:19 2024 +0800

    Update publish.yml (#1949)

    * Update publish.yml

    * Update publish.yml

commit c80b68a
Author: Kaihui-intel <[email protected]>
Date:   Tue Jul 23 21:26:53 2024 +0800

    Update AutoRound commit version (#1941)

    Signed-off-by: Kaihui-intel <[email protected]>

commit 9077b38
Author: zehao-intel <[email protected]>
Date:   Tue Jul 23 17:04:37 2024 +0800

    Refine Pytorch 3x Mixed Precision Example (#1946)

    Signed-off-by: zehao-intel <[email protected]>

commit efcb293
Author: Neo Zhang Jianyu <[email protected]>
Date:   Tue Jul 23 10:15:41 2024 +0800

    Update for API 3.0 online doc (#1940)

    Co-authored-by: ZhangJianyu <[email protected]>

commit b787940
Author: Wang, Mengni <[email protected]>
Date:   Tue Jul 23 10:12:34 2024 +0800

    add docstring for mx quant (#1932)

    Signed-off-by: Mengni Wang <[email protected]>
    Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
    Co-authored-by: xinhe <[email protected]>

commit 0c52e12
Author: Kaihui-intel <[email protected]>
Date:   Tue Jul 23 09:59:17 2024 +0800

    Add docstring for WOQ&LayerWise (#1938)

    Signed-off-by: Kaihui-intel <[email protected]>
    Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
    Co-authored-by: xinhe <[email protected]>

commit 08914d6
Author: Huang, Tai <[email protected]>
Date:   Mon Jul 22 11:14:44 2024 +0800

    add read permission token (#1942)

    Signed-off-by: Huang, Tai <[email protected]>

commit e106dea
Author: zehao-intel <[email protected]>
Date:   Sun Jul 21 21:48:51 2024 +0800

    Update Example for Pytorch 3x Mixed Precision (#1882)

    Signed-off-by: zehao-intel <[email protected]>

commit 1ebf698
Author: Zixuan Cheng <[email protected]>
Date:   Fri Jul 19 15:56:09 2024 +0800

    add docstring for static quant and smooth quant (#1936)

    * add docstring for static quant and smooth quant

    Signed-off-by: violetch24 <[email protected]>

    * format fix

    Signed-off-by: violetch24 <[email protected]>

    * update scan path

    Signed-off-by: violetch24 <[email protected]>

    * Update utility.py

    ---------

    Signed-off-by: violetch24 <[email protected]>
    Co-authored-by: violetch24 <[email protected]>

commit 296c5d4
Author: Yi Liu <[email protected]>
Date:   Fri Jul 19 15:08:05 2024 +0800

    Add docstring for PT2E and HQQ (#1937)

    Signed-off-by: yiliu30 <[email protected]>

commit 437c8e7
Author: Kaihui-intel <[email protected]>
Date:   Thu Jul 18 10:00:41 2024 +0800

    Fix unused pkgs  import (#1931)

    Signed-off-by: Kaihui-intel <[email protected]>

commit ff37401
Author: chen, suyue <[email protected]>
Date:   Wed Jul 17 23:11:15 2024 +0800

    3.X API installation update (#1935)

    Signed-off-by: chensuyue <[email protected]>

commit 6c27c19
Author: zehao-intel <[email protected]>
Date:   Wed Jul 17 20:35:42 2024 +0800

    Support calib_func on TF 3x API (#1934)

    Signed-off-by: zehao-intel <[email protected]>

commit 53e6ee6
Author: Zixuan Cheng <[email protected]>
Date:   Wed Jul 17 20:35:03 2024 +0800

    Support xpu for ipex static quant (#1916)

    Signed-off-by: violetch24 <[email protected]>

commit a1cc618
Author: chen, suyue <[email protected]>
Date:   Wed Jul 17 17:29:49 2024 +0800

    remove peft version limit (#1933)

    Signed-off-by: chensuyue <[email protected]>

commit 3058388
Author: Yi Liu <[email protected]>
Date:   Wed Jul 17 15:31:38 2024 +0800

    Add doc for client usage (#1914)

    Signed-off-by: yiliu30 <[email protected]>

commit 29471df
Author: Kaihui-intel <[email protected]>
Date:   Wed Jul 17 12:12:40 2024 +0800

    Enhance load_empty_model import (#1930)

    Signed-off-by: Kaihui-intel <[email protected]>

commit fd96851
Author: Kaihui-intel <[email protected]>
Date:   Wed Jul 17 12:05:32 2024 +0800

    Integrate AutoRound v0.3 to 2x (#1926)

    Signed-off-by: Kaihui-intel <[email protected]>

commit bfa27e4
Author: Kaihui-intel <[email protected]>
Date:   Wed Jul 17 09:33:13 2024 +0800

    Integrate AutoRound v0.3 (#1925)

    Signed-off-by: Kaihui-intel <[email protected]>

commit 5767aed
Author: xinhe <[email protected]>
Date:   Wed Jul 17 09:16:37 2024 +0800

    add docstring for torch.quantization and torch.utils (#1928)

    Signed-off-by: xin3he <[email protected]>

commit f909bca
Author: chen, suyue <[email protected]>
Date:   Tue Jul 16 21:12:54 2024 +0800

    update itrex ut test (#1929)

    Signed-off-by: chensuyue <[email protected]>

commit 649e6b1
Author: Kaihui-intel <[email protected]>
Date:   Tue Jul 16 21:05:55 2024 +0800

    Support LayerWise for RTN/GPTQ (#1883)

    Signed-off-by: Kaihui-intel <[email protected]>
    Co-authored-by: chensuyue <[email protected]>

commit de43d85
Author: Kaihui-intel <[email protected]>
Date:   Tue Jul 16 17:18:12 2024 +0800

    Support absorb dict for awq (#1920)

    Signed-off-by: Kaihui-intel <[email protected]>

commit e976595
Author: Kaihui-intel <[email protected]>
Date:   Tue Jul 16 17:17:56 2024 +0800

    Support woq Autotune (#1921)

    Signed-off-by: Kaihui-intel <[email protected]>

commit d56075c
Author: Huang, Tai <[email protected]>
Date:   Tue Jul 16 15:21:06 2024 +0800

    fix typo in architecture diagram (#1924)

    Signed-off-by: Huang, Tai <[email protected]>

commit 0a54239
Author: chen, suyue <[email protected]>
Date:   Tue Jul 16 15:12:43 2024 +0800

    update documentation for 3x API (#1923)

    Signed-off-by: chensuyue <[email protected]>
    Signed-off-by: xin3he <[email protected]>
    Signed-off-by: yiliu30 <[email protected]>

commit be42d03
Author: xinhe <[email protected]>
Date:   Tue Jul 16 09:48:48 2024 +0800

    implement TorchBaseConfig (#1911)

    Signed-off-by: xin3he <[email protected]>

commit 7a4715c
Author: Kaihui-intel <[email protected]>
Date:   Mon Jul 15 14:59:03 2024 +0800

    Support PT2E save and load (#1918)

    Signed-off-by: Kaihui-intel <[email protected]>

commit 34f0a9f
Author: Yi Liu <[email protected]>
Date:   Mon Jul 15 09:10:14 2024 +0800

    Add `save`/`load` support for HQQ (#1913)

    Signed-off-by: yiliu30 <[email protected]>
    Co-authored-by: chen, suyue <[email protected]>

commit d320460
Author: Yi Liu <[email protected]>
Date:   Fri Jul 12 14:48:12 2024 +0800

    remove 1x docs (#1900)

    Signed-off-by: yiliu30 <[email protected]>

commit 6c547f7
Author: chen, suyue <[email protected]>
Date:   Fri Jul 12 14:42:04 2024 +0800

    fix CI docker container clean up issue (#1917)

    Signed-off-by: chensuyue <[email protected]>

commit 1703658
Author: chen, suyue <[email protected]>
Date:   Fri Jul 12 11:14:48 2024 +0800

    Remove deprecated modules (#1872)

    Signed-off-by: chensuyue <[email protected]>

commit f698c96
Author: chen, suyue <[email protected]>
Date:   Thu Jul 11 18:00:28 2024 +0800

    update Gaudi CI baseline artifacts name (#1912)

    Signed-off-by: chensuyue <[email protected]>

commit 4a45093
Author: Yi Liu <[email protected]>
Date:   Thu Jul 11 17:47:47 2024 +0800

    Add export support for TEQ (#1910)

    Signed-off-by: yiliu30 <[email protected]>

commit 16a7b11
Author: Yi Liu <[email protected]>
Date:   Thu Jul 11 17:13:24 2024 +0800

    Get default config based on the auto-detect CPU type (#1904)

    Signed-off-by: yiliu30 <[email protected]>

commit 2fc7255
Author: xinhe <[email protected]>
Date:   Thu Jul 11 13:22:52 2024 +0800

    implement `incbench` command for ease-of-use benchmark (#1884)
     implement incbench command as entrypoint for ease-of-use benchmark
     automatically check numa/socket info and dump it with table for ease-of-understand
     supports both Linux and Windows platform
     add benchmark documents
     dump benchmark summary
     add benchmark UTs
    incbench main.py: run 1 instance on NUMA:0.
    incbench --num_i 2 main.py: run 2 instances on NUMA:0.
    incbench --num_c 2 main.py: run multi-instances with 2 cores per instance on NUMA:0.
    incbench -C 24-47 main.py: run 1 instance on COREs:24-47.
    incbench -C 24-47 --num_c 4 main.py: run multi-instances with 4 COREs per instance on COREs:24-47.

    ---------

    Signed-off-by: xin3he <[email protected]>
    Co-authored-by: chen, suyue <[email protected]>

commit de8577e
Author: chen, suyue <[email protected]>
Date:   Wed Jul 10 17:21:45 2024 +0800

    bump version into 3.0 (#1908)

    Signed-off-by: chensuyue <[email protected]>

commit 01f16c4
Author: chen, suyue <[email protected]>
Date:   Wed Jul 10 17:19:57 2024 +0800

    support habana fp8 UT test in CI (#1909)

    Signed-off-by: chensuyue <[email protected]>

commit 28578b9
Author: Yi Liu <[email protected]>
Date:   Wed Jul 10 13:19:27 2024 +0800

    Add docstring for `common` module (#1905)

    Signed-off-by: yiliu30 <[email protected]>

commit 5fde50f
Author: Wang, Chang <[email protected]>
Date:   Wed Jul 10 10:34:46 2024 +0800

    update fp4_e2m1 mapping list (#1906)

    * update fp4_e2m1 mapping list

    * Update utility.py

    * [pre-commit.ci] auto fixes from pre-commit.com hooks

    for more information, see https://pre-commit.ci

    ---------

    Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>

commit 3fe2fd9
Author: xinhe <[email protected]>
Date:   Tue Jul 9 15:01:25 2024 +0800

    fix bf16 symbolic_trace bug (#1892)

    Description: fix bf16 symbolic_trace bug,

    - cause abnormal recursive calling.
    - missing necessary attributes
    - By moving BF16 fallback ahead of quantization and removing bf16_symbolic_trace, we fix it.

    ---------

    Signed-off-by: xin3he <[email protected]>
    Co-authored-by: Sun, Xuehao <[email protected]>

commit e080e06
Author: Sun, Xuehao <[email protected]>
Date:   Tue Jul 9 11:04:30 2024 +0800

    remove neural insight CI (#1903)

    Signed-off-by: Sun, Xuehao <[email protected]>

commit f28fcee
Author: Yi Liu <[email protected]>
Date:   Fri Jul 5 15:47:37 2024 +0800

    Remove 1x API (#1865)

    Signed-off-by: yiliu30 <[email protected]>
    Co-authored-by: chen, suyue <[email protected]>

commit 1386ac5
Author: Yi Liu <[email protected]>
Date:   Thu Jul 4 12:18:03 2024 +0800

    Port auto-detect absorb layers for TEQ (#1895)

    Signed-off-by: yiliu30 <[email protected]>

commit 856118e
Author: Wang, Chang <[email protected]>
Date:   Wed Jul 3 13:50:00 2024 +0800

    remove import pdb (#1897)

    Signed-off-by: changwangss <[email protected]>

commit f75ff40
Author: xinhe <[email protected]>
Date:   Wed Jul 3 13:07:48 2024 +0800

    support auto_host2device on RTN and GPTQ(#1894)

    Signed-off-by: He, Xin3 <[email protected]>

commit b9e73f5
Author: chen, suyue <[email protected]>
Date:   Wed Jul 3 11:10:45 2024 +0800

    tmp fix nas deps issue (#1896)

    Signed-off-by: chensuyue <[email protected]>

commit 63b2912
Author: Yi Liu <[email protected]>
Date:   Tue Jul 2 14:46:02 2024 +0800

    Refine HQQ UTs (#1888)

    Signed-off-by: yiliu30 <[email protected]>

commit 5592acc
Author: zehao-intel <[email protected]>
Date:   Tue Jul 2 14:18:51 2024 +0800

    Remove Gelu Fusion for TF Newapi (#1886)

    Signed-off-by: zehao-intel <[email protected]>

commit 4372a76
Author: Kaihui-intel <[email protected]>
Date:   Fri Jun 28 14:55:10 2024 +0800

    Fix sql injection for Neural Solution gRPC (#1879)

    Signed-off-by: Kaihui-intel <[email protected]>

commit 4ae2e87
Author: xinhe <[email protected]>
Date:   Thu Jun 27 09:56:52 2024 +0800

    support quant_lm_head arg in all WOQ configs (#1881)

    Signed-off-by: xin3he <[email protected]>

commit cc763f5
Author: Dina Suehiro Jones <[email protected]>
Date:   Wed Jun 26 18:29:06 2024 -0700

    Update the Gaudi container example in the README (#1885)

commit 1f58f02
Author: Yi Liu <[email protected]>
Date:   Thu Jun 20 22:03:45 2024 +0800

    Add `set_local` support for static quant with pt2e (#1870)

    Signed-off-by: yiliu30 <[email protected]>

commit 0341295
Author: Yi Liu <[email protected]>
Date:   Wed Jun 19 09:40:11 2024 +0800

    rm cov (#1878)

    Signed-off-by: yiliu30 <[email protected]>

commit 503d9ef
Author: Kaihui-intel <[email protected]>
Date:   Tue Jun 18 17:12:12 2024 +0800

    Add op statistics dump for woq (#1876)

    Signed-off-by: Kaihui-intel <[email protected]>

commit 5a0374e
Author: Yi Liu <[email protected]>
Date:   Tue Jun 18 16:21:05 2024 +0800

    Enhance autotune to return the best `q_model` directly (#1875)

    Signed-off-by: yiliu30 <[email protected]>

commit 90fb431
Author: Kaihui-intel <[email protected]>
Date:   Tue Jun 18 16:06:04 2024 +0800

    fix layer match (#1873)

    Signed-off-by: Kaihui-intel <[email protected]>
    Co-authored-by: Sun, Xuehao <[email protected]>

commit f4eb660
Author: Sun, Xuehao <[email protected]>
Date:   Mon Jun 17 16:12:06 2024 +0800

    Limit numpy versions (#1874)

    Signed-off-by: Sun, Xuehao <[email protected]>

commit 2928d85
Author: chen, suyue <[email protected]>
Date:   Fri Jun 14 21:51:13 2024 +0800

    update v2.6 release readme (#1871)

    Signed-off-by: chensuyue <[email protected]>

commit 48c5e3a
Author: Kaihui-intel <[email protected]>
Date:   Fri Jun 14 21:10:14 2024 +0800

    Modify WOQ examples structure (#1866)

    Signed-off-by: Kaihui-intel <[email protected]>
    Signed-off-by: chensuyue <[email protected]>

commit 498af74
Author: Sun, Xuehao <[email protected]>
Date:   Fri Jun 14 21:09:36 2024 +0800

    Update SQ/WOQ status (#1869)

    Signed-off-by: Sun, Xuehao <[email protected]>
    Co-authored-by: chen, suyue <[email protected]>

commit b401b02
Author: Kaihui-intel <[email protected]>
Date:   Fri Jun 14 17:48:03 2024 +0800

    Add PT2E cv&llm example (#1853)

    Signed-off-by: Kaihui-intel <[email protected]>

commit e470f6c
Author: xinhe <[email protected]>
Date:   Fri Jun 14 17:34:26 2024 +0800

    [3x] add recommendation examples (#1844)

    Signed-off-by: xin3he <[email protected]>

commit a141512
Author: zehao-intel <[email protected]>
Date:   Fri Jun 14 14:56:30 2024 +0800

    Improve UT Branch Coverage for TF 3x (#1867)

    Signed-off-by: zehao-intel <[email protected]>

commit b99a79d
Author: Zixuan Cheng <[email protected]>
Date:   Fri Jun 14 14:10:49 2024 +0800

    modify 3.x ipex example structure (#1858)

    * modify 3.x ipex example structure

    Signed-off-by: Cheng, Zixuan <[email protected]>

    * add json path

    Signed-off-by: Cheng, Zixuan <[email protected]>

    * fix for sq

    Signed-off-by: Cheng, Zixuan <[email protected]>

    * minor fix

    Signed-off-by: Cheng, Zixuan <[email protected]>

    * Update run_clm_no_trainer.py

    * Update run_clm_no_trainer.py

    * Update run_clm_no_trainer.py

    * minor fix

    Signed-off-by: Cheng, Zixuan <[email protected]>

    * remove old files

    Signed-off-by: Cheng, Zixuan <[email protected]>

    * fix act_algo

    Signed-off-by: Cheng, Zixuan <[email protected]>

    ---------

    Signed-off-by: Cheng, Zixuan <[email protected]>
    Co-authored-by: xinhe <[email protected]>

commit 922b247
Author: zehao-intel <[email protected]>
Date:   Fri Jun 14 12:33:39 2024 +0800

    Add TF 3x Examples (#1839)

    Signed-off-by: zehao-intel <[email protected]>

commit 70a1d50
Author: Zixuan Cheng <[email protected]>
Date:   Fri Jun 14 10:17:33 2024 +0800

    fix 3x ipex static quant regression (#1864)

    Description
    fix 3x ipex static quant regression
    cannot fallback with op type name ('linear')
    dump wrong op stats (no 'Linear&relu' op type)
    ---------

    Signed-off-by: Cheng, Zixuan <[email protected]>

commit 4e45f8f
Author: zehao-intel <[email protected]>
Date:   Fri Jun 14 10:04:11 2024 +0800

    Improve UT Coverage for TF 3x  (#1852)

    Signed-off-by: zehao-intel <[email protected]>
    Signed-off-by: chensuyue <[email protected]>

commit 794b276
Author: xinhe <[email protected]>
Date:   Thu Jun 13 18:02:04 2024 +0800

    migrate export to 2x and 3x from deprecated (#1845)

    Signed-off-by: xin3he <[email protected]>

commit 0eced14
Author: yuwenzho <[email protected]>
Date:   Wed Jun 12 18:49:17 2024 -0700

    Enhance INC WOQ model loading & support Huggingface WOQ model loading (#1826)

    Signed-off-by: yuwenzho <[email protected]>

commit 6733dab
Author: Wang, Mengni <[email protected]>
Date:   Wed Jun 12 17:08:31 2024 +0800

    update mx script (#1838)

    Signed-off-by: Mengni Wang <[email protected]>

commit a0dee94
Author: Wang, Chang <[email protected]>
Date:   Wed Jun 12 15:01:25 2024 +0800

    Remove export_compressed_model in AWQConfig (#1831)

commit 2c3556d
Author: Huang, Tai <[email protected]>
Date:   Wed Jun 12 14:46:14 2024 +0800

    Add 3x architecture diagram (#1849)

    Signed-off-by: Huang, Tai <[email protected]>

commit 0e2cade
Author: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
Date:   Wed Jun 12 14:20:06 2024 +0800

    Bump braces from 3.0.2 to 3.0.3 in /neural_insights/gui (#1862)

    Signed-off-by: dependabot[bot] <[email protected]>

commit 5b5579b
Author: Kaihui-intel <[email protected]>
Date:   Wed Jun 12 14:12:00 2024 +0800

    Fix Neural Solution security issue (#1856)

    Signed-off-by: Kaihui-intel <[email protected]>

commit e9cb48c
Author: xinhe <[email protected]>
Date:   Wed Jun 12 11:19:47 2024 +0800

    improve UT coverage of PT Utils and Quantization (#1842)

    * update UTs

    ---------

    Signed-off-by: xin3he <[email protected]>
    Signed-off-by: xinhe3 <[email protected]>

commit 6b27383
Author: Yi Liu <[email protected]>
Date:   Wed Jun 12 11:11:50 2024 +0800

    Fix config expansion with empty options (#1861)

    Signed-off-by: yiliu30 <[email protected]>

commit 25c71aa
Author: WenjiaoYue <[email protected]>
Date:   Tue Jun 11 17:54:31 2024 +0800

    Delete the static resources of the JupyterLab extension after packaging (#1860)

    Signed-off-by: Yue, Wenjiao <[email protected]>

commit 455f1e1
Author: Wang, Mengni <[email protected]>
Date:   Tue Jun 11 15:28:40 2024 +0800

    Add UT and remove unused code for torch MX quant (#1854)

    * Add UT and remove unused code for torch MX quant
    ---------

    Signed-off-by: Mengni Wang <[email protected]>

Signed-off-by: xinhe3 <[email protected]>
changwangss pushed a commit that referenced this pull request Nov 14, 2024
Merged from INC public master branch

Squashed commit of the following:
commit 27f3e2657b2667e8bca8fb9c02a50d55f404a7e6
Author: Kaihui-intel <[email protected]>
Date:   Tue Oct 22 11:51:37 2024 +0800

    Adapt autoround format (#2038)

    Signed-off-by: Kaihui-intel <[email protected]>

commit 7775768245ec8beea3210910b166e95e2e730586
Author: Sun, Xuehao <[email protected]>
Date:   Sun Oct 20 19:41:36 2024 +0800

    remove autoround limit (#2036)

    Signed-off-by: Sun, Xuehao <[email protected]>

commit 795aeb5bb6950a357da5e339ed97277bc73b4c5c
Author: WeiweiZhang1 <[email protected]>
Date:   Fri Oct 18 17:39:10 2024 +0800

    Add vlm examples, bugfix (#2012)

    * add VLM examples

    Signed-off-by: Zhang, Weiwei1 <[email protected]>

    * bugfix, add utils

    Signed-off-by: Zhang, Weiwei1 <[email protected]>

    * [pre-commit.ci] auto fixes from pre-commit.com hooks

    for more information, see https://pre-commit.ci

    * fix docstring issues

    Signed-off-by: Zhang, Weiwei1 <[email protected]>

    * [pre-commit.ci] auto fixes from pre-commit.com hooks

    for more information, see https://pre-commit.ci

    * bugfix

    Signed-off-by: Zhang, Weiwei1 <[email protected]>

    * [pre-commit.ci] auto fixes from pre-commit.com hooks

    for more information, see https://pre-commit.ci

    * refine examples

    Signed-off-by: Zhang, Weiwei1 <[email protected]>

    * fix scan issue

    Signed-off-by: Zhang, Weiwei1 <[email protected]>

    * [pre-commit.ci] auto fixes from pre-commit.com hooks

    for more information, see https://pre-commit.ci

    * refine shell

    Signed-off-by: Zhang, Weiwei1 <[email protected]>

    * refine scripts & requirements

    Signed-off-by: Zhang, Weiwei1 <[email protected]>

    * typofix

    Signed-off-by: Zhang, Weiwei1 <[email protected]>

    * refine docs

    Signed-off-by: Zhang, Weiwei1 <[email protected]>

    * set attn_implementation for Phi3-vision

    Signed-off-by: Zhang, Weiwei1 <[email protected]>

    * refine phi3 example

    Signed-off-by: Zhang, Weiwei1 <[email protected]>

    * [pre-commit.ci] auto fixes from pre-commit.com hooks

    for more information, see https://pre-commit.ci

    * fix code coverage

    Signed-off-by: Zhang, Weiwei1 <[email protected]>

    * [pre-commit.ci] auto fixes from pre-commit.com hooks

    for more information, see https://pre-commit.ci

    * update config

    Signed-off-by: Sun, Xuehao <[email protected]>

    * refine shells, docs and example. enable qwen2-vl quantization

    Signed-off-by: Zhang, Weiwei1 <[email protected]>

    * [pre-commit.ci] auto fixes from pre-commit.com hooks

    for more information, see https://pre-commit.ci

    * fix ci

    Signed-off-by: Zhang, Weiwei1 <[email protected]>

    * fix EOF error

    Signed-off-by: Sun, Xuehao <[email protected]>

    * update qwen dir

    Signed-off-by: Sun, Xuehao <[email protected]>

    * refine shell, add llama3.2 inference to doc

    Signed-off-by: Zhang, Weiwei1 <[email protected]>

    * bugfix

    Signed-off-by: Zhang, Weiwei1 <[email protected]>

    * [pre-commit.ci] auto fixes from pre-commit.com hooks

    for more information, see https://pre-commit.ci

    * bugfix

    Signed-off-by: Zhang, Weiwei1 <[email protected]>

    * bugfix

    Signed-off-by: Zhang, Weiwei1 <[email protected]>

    * refine eval shell

    Signed-off-by: Zhang, Weiwei1 <[email protected]>

    * fix eval device issue

    Signed-off-by: Zhang, Weiwei1 <[email protected]>

    * refine eval dtype

    Signed-off-by: Zhang, Weiwei1 <[email protected]>

    ---------

    Signed-off-by: Zhang, Weiwei1 <[email protected]>
    Signed-off-by: Sun, Xuehao <[email protected]>
    Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
    Co-authored-by: Sun, Xuehao <[email protected]>

commit b5f3eb7ea604fc1bd235cce15dda87569b70390e
Author: xinhe <[email protected]>
Date:   Fri Oct 18 15:03:53 2024 +0800

    add back missing image (#2035)

    Signed-off-by: xin3he <[email protected]>

commit 45b29d46a2e958b103c6f8a5539fead25809a89a
Author: Huang, Tai <[email protected]>
Date:   Thu Oct 17 15:23:26 2024 +0800

    fix broken link to FP8 example (#2034)

    Signed-off-by: Huang, Tai <[email protected]>

commit 01bf4b2b3a0f12434b5f44f07a9c26abf96fb5f0
Author: Huang, Tai <[email protected]>
Date:   Thu Oct 17 15:22:23 2024 +0800

    update gaudi version mapping table for v3.1 (#2030)

    Signed-off-by: Huang, Tai <[email protected]>
    Co-authored-by: chen, suyue <[email protected]>

commit 5fb21847e12acc51ed4f197eb86b066e6578934b
Author: xinhe <[email protected]>
Date:   Thu Oct 17 15:21:18 2024 +0800

    Cherry pick Habana software 1.18.0 update (#2025)

    Signed-off-by: xinhe3 <[email protected]>
    Signed-off-by: Yi Liu <[email protected]>
    Signed-off-by: Sun, Xuehao <[email protected]>
    Signed-off-by: chensuyue <[email protected]>
    Co-authored-by: yan tomsinsky <[email protected]>
    Co-authored-by: Uri Livne <[email protected]>
    Co-authored-by: Dudi Lester <[email protected]>
    Co-authored-by: Danny <[email protected]>
    Co-authored-by: Tomer Gafni <[email protected]>
    Co-authored-by: Eran Geva <[email protected]>
    Co-authored-by: Daniel Ohayon <[email protected]>
    Co-authored-by: Roi Tiefenbrunn <[email protected]>
    Co-authored-by: Kamil Felskowski <[email protected]>
    Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>

commit d6149aac01c8142f6f5ffc18c03433c82f44150c
Author: Yi Liu <[email protected]>
Date:   Wed Oct 16 14:02:29 2024 +0800

    Update the PT2E CV example (#2032)

    Signed-off-by: yiliu30 <[email protected]>

commit 08ec90866f9cbd770bed3d93c35aaaf0087d4fe9
Author: Kaihui-intel <[email protected]>
Date:   Wed Oct 16 09:20:33 2024 +0800

    Remove itrex dependency for 2x example  (#2024)

    Signed-off-by: Kaihui-intel <[email protected]>
    Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>

commit d9377b826d24b2e1c206632bc40f39aab02b3d43
Author: Kaihui-intel <[email protected]>
Date:   Tue Oct 15 15:28:37 2024 +0800

    Support generation search for transformers examples (#2029)

    Signed-off-by: Kaihui-intel <[email protected]>

commit 61f1e393b6374703b6516fe9406bafb0cc088009
Author: Kaihui-intel <[email protected]>
Date:   Fri Oct 11 17:07:14 2024 +0800

    Support quant procedure on XPU (#2026)

    Signed-off-by: Kaihui-intel <[email protected]>

commit dfa6aabab3d280085fb166822b7d849a2dc9b36e
Author: Sun, Xuehao <[email protected]>
Date:   Fri Oct 11 16:36:42 2024 +0800

    remove ITREX unit test CI (#2021)

    Signed-off-by: Sun, Xuehao <[email protected]>

commit 2bb257e71353d87414ff7e410ca35bce5cc3dbc7
Author: Kaihui-intel <[email protected]>
Date:   Thu Oct 10 19:27:11 2024 +0800

    Add woq examples (#1982)

    Signed-off-by: Kaihui-intel <[email protected]>
    Signed-off-by: Sun, Xuehao <[email protected]>
    Co-authored-by: Sun, Xuehao <[email protected]>

commit 586eb88fc7b4bfe87bf8fed9f77951623e48bd88
Author: Huang, Tai <[email protected]>
Date:   Wed Oct 9 09:22:39 2024 +0800

    add transformers-like api link in readme (#2022)

    Signed-off-by: Huang, Tai <[email protected]>

commit 4e9c7641589c5f3eec20972f9a16022b7eb7e941
Author: Kaihui-intel <[email protected]>
Date:   Tue Oct 8 13:13:45 2024 +0800

    Remove itrex dependency for 3x example (#2016)

    Signed-off-by: Kaihui-intel <[email protected]>
    Co-authored-by: Sun, Xuehao <[email protected]>

commit a0066d4e55f3dc03a2e0b992286d8806509cf368
Author: Kaihui-intel <[email protected]>
Date:   Mon Sep 30 18:17:32 2024 +0800

    Fix transformers rtn layer-wise quant (#2008)

    Signed-off-by: Kaihui-intel <[email protected]>
    Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>

commit 802a5af3c3093941b665e6e9a92c706da1aeccdc
Author: Huang, Tai <[email protected]>
Date:   Mon Sep 30 17:02:52 2024 +0800

    add autoround EMNLP24 to pub list (#2014)

    Signed-off-by: Huang, Tai <[email protected]>

commit 44795a1ae93f3676a595063cf0e6f680c41989b2
Author: Kaihui-intel <[email protected]>
Date:   Mon Sep 30 16:55:22 2024 +0800

    Adapt transformers 4.45.1 (#2019)

    Signed-off-by: Kaihui-intel <[email protected]>
    Co-authored-by: changwangss <[email protected]>
    Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>

commit d4662ad47a4af11a9ed8b45429aff007d8c1b605
Author: Kaihui-intel <[email protected]>
Date:   Mon Sep 30 15:52:17 2024 +0800

    Add transformers-like api doc (#2018)

    Signed-off-by: Kaihui-intel <[email protected]>

commit 72398b69334d90cdd7664ac12a025cd36695b55c
Author: Wang, Chang <[email protected]>
Date:   Fri Sep 27 15:11:04 2024 +0800

    fix xpu device set weight and bias (#2010)

    Signed-off-by: changwangss <[email protected]>
    Co-authored-by: Sun, Xuehao <[email protected]>

commit 9d27743705af66a66aafae7fb1d19e2ffad6b2a2
Author: Sun, Xuehao <[email protected]>
Date:   Fri Sep 27 14:17:24 2024 +0800

    Update model accuracy (#2006)

    Signed-off-by: Sun, Xuehao <[email protected]>

commit 7bbc47373033a46fca92ce5ec312d4e051092eee
Author: xinhe <[email protected]>
Date:   Fri Sep 27 11:47:00 2024 +0800

    add pad_to_buckets in evaluation for hpu performance (#2011)

    * add pad_to_buckets in evaluation for hpu performance
    ---------

    Signed-off-by: xin3he <[email protected]>

commit b6b7d7c3c415d67976e054ab5ad5be6b5d5b460d
Author: Kaihui-intel <[email protected]>
Date:   Thu Sep 26 17:21:54 2024 +0800

    Update auto_round requirements for transformers example (#2013)

    Signed-off-by: Kaihui-intel <[email protected]>

commit ee600ba79c5197908ca119446b377be59a5a19e7
Author: Wang, Chang <[email protected]>
Date:   Fri Sep 20 13:54:06 2024 +0800

    add repack_awq_to_optimum_format function (#1998)

    Signed-off-by: changwangss <[email protected]>

commit 4ee6861d666a15c26bb796547d446879e17e6b11
Author: Sun, Xuehao <[email protected]>
Date:   Thu Sep 19 22:27:05 2024 +0800

    remove accelerate version in unit test (#2007)

    Signed-off-by: Sun, Xuehao <[email protected]>

commit 24458114c0765e177b3f4dfbb73d7cfda6b196ab
Author: WeiweiZhang1 <[email protected]>
Date:   Sat Sep 14 18:13:30 2024 +0800

    enable auto_round format export (#2002)

    Signed-off-by: Zhang, Weiwei1 <[email protected]>

commit 906333abd41c8be8a6f097da42c1931ea3bb37d5
Author: Kaihui-intel <[email protected]>
Date:   Sat Sep 14 16:17:46 2024 +0800

    Replace FORCE_DEVICE with INC_TARGET_DEVICE [transformers] (#2005)

    Signed-off-by: Kaihui-intel <[email protected]>

commit 443d00779acac739c3a185f384b78236eaac9643
Author: xinhe <[email protected]>
Date:   Fri Sep 13 21:35:32 2024 +0800

    add INC_FORCE_DEVICE introduction (#1988)

    * add INC_FORCE_DEVICE introduction

    Signed-off-by: xin3he <[email protected]>

    * Update PyTorch.md

    * Update PyTorch.md

    * Update docs/source/3x/PyTorch.md

    Co-authored-by: Yi Liu <[email protected]>

    * rename to INC_TARGET_DEVICE

    Signed-off-by: xin3he <[email protected]>

    ---------

    Signed-off-by: xin3he <[email protected]>
    Co-authored-by: Yi Liu <[email protected]>

commit 5de9a4f56c4cf3901b8ca75d56677255c4e8c833
Author: Kaihui-intel <[email protected]>
Date:   Fri Sep 13 20:48:22 2024 +0800

    Support transformers-like api for woq quantization (#1987)

    Signed-off-by: Kaihui-intel <[email protected]>
    Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
    Co-authored-by: Wang, Chang <[email protected]>

commit 9c39b429baa16591d4fe883e1a7279761f7f86a5
Author: chen, suyue <[email protected]>
Date:   Thu Sep 12 14:34:49 2024 +0800

    update docker image prune rules (#2003)

    Signed-off-by: chensuyue <[email protected]>

commit 09d4f2d6fb1a6aa91874a0b87a967067800462cb
Author: Huang, Tai <[email protected]>
Date:   Mon Sep 9 09:24:35 2024 +0800

    Add recent publications (#1995)

    * add recent publications

    Signed-off-by: Huang, Tai <[email protected]>

    * update total count

    Signed-off-by: Huang, Tai <[email protected]>

    ---------

    Signed-off-by: Huang, Tai <[email protected]>

commit 399cd44a35583bd96701bee58107c6969be0201e
Author: Kaihui-intel <[email protected]>
Date:   Tue Sep 3 16:37:09 2024 +0800

     Remove the save of gptq config (#1993)

    Signed-off-by: Kaihui-intel <[email protected]>

commit 05272c48591567d0a1d36fe6cfe5c697d836887b
Author: Yi Liu <[email protected]>
Date:   Tue Sep 3 10:21:51 2024 +0800

    add per_channel_minmax (#1990)

    Signed-off-by: yiliu30 <[email protected]>

commit 82d8c06c6b535d8db21a6c848a2e374b3b16288e
Author: chen, suyue <[email protected]>
Date:   Fri Aug 30 21:21:00 2024 +0800

    update 3x pt binary build (#1992)

    Signed-off-by: chensuyue <[email protected]>

commit e9f06af240065fd48066d32ec4d856c0b7a62f14
Author: Huang, Tai <[email protected]>
Date:   Fri Aug 30 17:49:48 2024 +0800

    Update installation_guide.md (#1989)

    Correct typo in installation doc

commit 093c9669692c8b9263cfbc16d7299da4170c8201
Author: Wang, Chang <[email protected]>
Date:   Fri Aug 30 17:45:54 2024 +0800

    add quantize, save, load function for transformers-like api (#1986)

    Signed-off-by: changwangss <[email protected]>

commit 4dd49a43dec86aea581db4f29c7ca36b0baf1f7c
Author: xinhe <[email protected]>
Date:   Thu Aug 29 17:23:18 2024 +0800

    add hasattr check for torch fp8 dtype (#1985)

    Signed-off-by: xin3he <[email protected]>

commit f2c454f88c0ffbb4d30d66eedaa6fc56ad47f804
Author: chen, suyue <[email protected]>
Date:   Thu Aug 29 13:45:39 2024 +0800

    update installation and ci test for 3x api (#1991)

    Signed-off-by: chensuyue <[email protected]>

commit 7ba9fdcb24a8ea1c1efc27844f39d0c128f83517
Author: Kaihui-intel <[email protected]>
Date:   Mon Aug 19 14:50:50 2024 +0800

    support gptq `true_sequential` and `quant_lm_head` (#1977)

    Signed-off-by: Kaihui-intel <[email protected]>

commit 68b1f8b734bff723dd4962da08ecdc0d22c5faab
Author: Sun, Xuehao <[email protected]>
Date:   Fri Aug 16 09:43:46 2024 +0800

    Fix UT env and upgrade torch to 2.4.0 (#1978)

    Signed-off-by: Sun, Xuehao <[email protected]>

commit f9dfd54272348483037cc70802cd85a085fec39c
Author: Yi Liu <[email protected]>
Date:   Thu Aug 15 14:13:26 2024 +0800

    Skip some tests for torch 2.4 (#1981)

    Signed-off-by: yiliu30 <[email protected]>

commit 46d9192659f1c0dcf488e2e69f0f7dd7bd0b2f2e
Author: xinhe <[email protected]>
Date:   Thu Aug 15 09:57:22 2024 +0800

    update readme for fp8 (#1979)

    Signed-off-by: xinhe3 <[email protected]>

commit 842b7159fafa09300bc0e745c802910a2d60502e
Author: chen, suyue <[email protected]>
Date:   Tue Aug 13 12:09:25 2024 +0800

    bump main version into v3.1 (#1974)

    Signed-off-by: chensuyue <[email protected]>

commit 3845cdc4837e7f0ede12b9de0906b7d01899fc00
Author: Neo Zhang Jianyu <[email protected]>
Date:   Tue Aug 13 12:09:09 2024 +0800

    fix online doc search issue (#1975)

    Co-authored-by: ZhangJianyu <[email protected]>

commit 7056720df96f17c706522bc6b0530df534d22ee7
Author: chen, suyue <[email protected]>
Date:   Sun Aug 11 20:58:34 2024 +0800

    update main page (#1973)

    Signed-off-by: chensuyue <[email protected]>

commit 95197d1697e19323b124c2a32bdef7425d4d1c3e
Author: xinhe <[email protected]>
Date:   Sat Aug 10 23:28:43 2024 +0800

    Cherry pick v1.17.0 (#1964)

    * [SW-184941] INC CI, CD and Promotion

    Change-Id: I60c420f9776e1bdab7bb9e02e5bcbdb6891bfe52

    * [SW-183320]updated setup.py

    Change-Id: I592af89486cb1d9e0b5197521c428920197a9103

    * [SW-177474] add HQT FP8 porting code

    Change-Id: I4676f13a5ed43c444f2ec68675cc41335e7234dd
    Signed-off-by: Zhou Yuwen <[email protected]>

    * [SW-189361] Fix white list extend

    Change-Id: Ic2021c248798fce37710d28014a6d59259c868a3

    * [SW-191317] Raise exception according to hqt config object

    Change-Id: I06ba8fa912c811c88912987c11e5c12ef328348a

    * [SW-184714] Port HQT code into INC

    HQT lib content was copied as is under fp8_quant

    Tests were copied to 3.x torch location

    Change-Id: Iec6e1fa7ac4bf1df1c95b429524c40e32bc13ac9

    * [SW-184714] Add internal folder to fp8 quant

    This is a folder used for experiments,
    not to be used by users

    Change-Id: I9e221ae582794e304e95392c0f37638f7bce69bc

    * [SW-177468] Removed unused code + cleanup

    Change-Id: I4d27c067e87c1a30eb1da9df16a16c46d092c638

    * Fix errors in regression_detection

    Change-Id: Iee5318bd5593ba349812516eb5641958ece3c438

    * [SW-187731] Save orig module as member of patched module

    This allows direct usage of the original module methods,
    which solves torch compile issue

    Change-Id: I464d8bd1bacdfc3cd1f128a67114e1e43f092632

    * [SW-190899] Install packages according to configuration

    Change-Id: I570b490658f5d2c5399ba1db93f8f52f56449525

    * [SW-184689] use finalize_calibration intrenaly for one step flow

    Change-Id: Ie0b8b426c951cf57ed7e6e678c86813fb2d05c89

    * [SW-191945] align requirement_pt.txt in gerrit INC with Github INC

    Change-Id: If5c0dbf21bf989af37a8e29246e4f8760cd215ef
    Signed-off-by: xinhe3 <[email protected]>

    * [SW-192358] Remove HQT reference in INC

    Change-Id: Ic25f9323486596fa2dc6d909cd568a37ab84dd5e

    * [SW-191415] update fp8 maxAbs observer  using torch.copy_

    Change-Id: I3923c832f9a8a2b14e392f3f4719d233a457702f

    * [SW-184943] Enhance INC WOQ model loading

    - Support loading huggingface WOQ model
    - Abstract WeightOnlyLinear base class. Add INCWeightOnlyLinear and HPUWeighOnlyLinear subclasses
    - Load woq linear weight module by module
    - Save hpu format tensor to reuse it once load it again

    Change-Id: I679a42759b49e1f45f52bbb0bdae8580a23d0bcf

    * [SW-190303] Implement HPUWeightOnlyLinear class in INC

    Change-Id: Ie05c8787e708e2c3559dce24ef0758d6c498ac41

    * [SW-192809] fix json_file bug when instantiating FP8Config class

    Change-Id: I4a715d0a706efe20ccdb49033755cabbc729ccdc
    Signed-off-by: Zhou Yuwen <[email protected]>

    * [SW-192931] align setup.py with github INC and remove fp8_convert

    Change-Id: Ibbc157646cfcfad64b323ecfd96b9bbda5ba9e2f
    Signed-off-by: xinhe3 <[email protected]>

    * [SW-192917] Update all HQT logic files with pre-commit check

    Change-Id: I119dc8578cb10932fd1a8a674a8bdbf61f978e42
    Signed-off-by: xinhe3 <[email protected]>

    * update docstring

    Signed-off-by: yuwenzho <[email protected]>

    * add fp8 example and document (#1639)

    Signed-off-by: xinhe3 <[email protected]>

    * Update settings to be compatible with gerrit

    * enhance ut

    Signed-off-by: yuwenzho <[email protected]>

    * move fp8 sample to helloworld folder

    Signed-off-by: yuwenzho <[email protected]>

    * update torch version of habana docker

    Signed-off-by: xinhe3 <[email protected]>

    * [pre-commit.ci] auto fixes from pre-commit.com hooks

    for more information, see https://pre-commit.ci

    * update readme demo

    Signed-off-by: xinhe3 <[email protected]>

    * update WeightOnlyLinear to INCWeightOnlyLinear

    Signed-off-by: xinhe3 <[email protected]>

    * [pre-commit.ci] auto fixes from pre-commit.com hooks

    for more information, see https://pre-commit.ci

    * add docstring for FP8Config

    Signed-off-by: xinhe3 <[email protected]>

    * fix pylint

    Signed-off-by: xinhe3 <[email protected]>

    * update fp8 test scripts

    Signed-off-by: chensuyue <[email protected]>

    * delete deps

    Signed-off-by: chensuyue <[email protected]>

    * update container into v1.17.0

    Signed-off-by: chensuyue <[email protected]>

    * update docker version

    Signed-off-by: xinhe3 <[email protected]>

    * update pt ut

    Signed-off-by: chensuyue <[email protected]>

    * add lib path

    Signed-off-by: chensuyue <[email protected]>

    * fix dir issue

    Signed-off-by: xinhe3 <[email protected]>

    * [pre-commit.ci] auto fixes from pre-commit.com hooks

    for more information, see https://pre-commit.ci

    * update fp8 test scope

    Signed-off-by: chensuyue <[email protected]>

    * fix typo

    Signed-off-by: xinhe3 <[email protected]>

    * update fp8 test scope

    Signed-off-by: chensuyue <[email protected]>

    * update pre-commit-ci

    Signed-off-by: chensuyue <[email protected]>

    * work around for hpu

    Signed-off-by: xinhe3 <[email protected]>

    * fix UT

    Signed-off-by: xinhe3 <[email protected]>

    * fix parameter

    Signed-off-by: chensuyue <[email protected]>

    * omit some test

    Signed-off-by: chensuyue <[email protected]>

    * update main page example to llm loading

    Signed-off-by: xinhe3 <[email protected]>

    * [pre-commit.ci] auto fixes from pre-commit.com hooks

    for more information, see https://pre-commit.ci

    * fix autotune

    Signed-off-by: xinhe3 <[email protected]>

    ---------

    Signed-off-by: Zhou Yuwen <[email protected]>
    Signed-off-by: xinhe3 <[email protected]>
    Signed-off-by: yuwenzho <[email protected]>
    Signed-off-by: chensuyue <[email protected]>
    Co-authored-by: yan tomsinsky <[email protected]>
    Co-authored-by: Ron Ben Moshe <[email protected]>
    Co-authored-by: Uri Livne <[email protected]>
    Co-authored-by: Danny Semiat <[email protected]>
    Co-authored-by: smarkovichgolan <[email protected]>
    Co-authored-by: Dudi Lester <[email protected]>

commit de0fa21cd9d6291b521281b2b5fc8f6519cb84ae
Author: Huang, Tai <[email protected]>
Date:   Fri Aug 9 22:32:37 2024 +0800

    Fix broken link in docs (#1969)

    Signed-off-by: Huang, Tai <[email protected]>

commit 385da7c7ed018a66fcba6e28658d1a5eea2e52e4
Author: Sun, Xuehao <[email protected]>
Date:   Fri Aug 9 21:53:51 2024 +0800

    Add 3.x readme (#1971)

    Signed-off-by: Sun, Xuehao <[email protected]>

commit acd8f4f182eaccf03b221f765ec0ddb451be3415
Author: Huang, Tai <[email protected]>
Date:   Fri Aug 9 15:24:14 2024 +0800

    Add version mapping between INC and Gaudi SW Stack (#1967)

    Signed-off-by: Huang, Tai <[email protected]>

commit 74a4641390b4d8c11dc66ff8ef40df92c298b996
Author: Sun, Xuehao <[email protected]>
Date:   Fri Aug 9 10:23:59 2024 +0800

    remove unnecessary CI (#1966)

    Signed-off-by: Sun, Xuehao <[email protected]>

commit b99abae5d937380cf9df80c9050fce18bddfb72d
Author: Kaihui-intel <[email protected]>
Date:   Tue Aug 6 16:02:03 2024 +0800

    Fix `opt_125m_woq_gptq_int4_dq_ggml` issue (#1965)

    Signed-off-by: Kaihui-intel <[email protected]>

commit b35ff8f0044bdf12da87647d0404b62ae5ff7d3d
Author: Zixuan Cheng <[email protected]>
Date:   Fri Aug 2 09:06:35 2024 +0800

    example update for 3.x ipex sq (#1902)

    Signed-off-by: violetch24 <[email protected]>

commit 000946fce147a02ad6662538e337570c0a56329d
Author: Zixuan Cheng <[email protected]>
Date:   Thu Aug 1 10:19:32 2024 +0800

    add SDXL model example to INC 3.x (#1887)

    * add SDXL model example to INC 3.x

    Signed-off-by: Cheng, Zixuan <[email protected]>

    * add evaluation script

    Signed-off-by: violetch24 <[email protected]>

    * add test script

    Signed-off-by: violetch24 <[email protected]>

    * minor fix

    Signed-off-by: violetch24 <[email protected]>

    * Update run_quant.sh

    * add iter limit

    Signed-off-by: violetch24 <[email protected]>

    * modify test script

    Signed-off-by: violetch24 <[email protected]>

    * update json

    Signed-off-by: chensuyue <[email protected]>

    * add requirements

    Signed-off-by: violetch24 <[email protected]>

    * Update run_benchmark.sh

    * Update sdxl_smooth_quant.py

    * minor fix

    Signed-off-by: violetch24 <[email protected]>

    ---------

    Signed-off-by: Cheng, Zixuan <[email protected]>
    Signed-off-by: violetch24 <[email protected]>
    Signed-off-by: chensuyue <[email protected]>
    Co-authored-by: violetch24 <[email protected]>
    Co-authored-by: chensuyue <[email protected]>

commit aa42e5edcd0b5196a21ee7bb68a7965125601fea
Author: xinhe <[email protected]>
Date:   Wed Jul 31 15:36:06 2024 +0800

    replenish docstring (#1955)

    * replenish docstring

    Signed-off-by: xin3he <[email protected]>

    * update  Quantizer API docstring

    Signed-off-by: xin3he <[email protected]>

    * Add docstring for auto accelerator (#1956)

    Signed-off-by: yiliu30 <[email protected]>

    * temporary remove torch/quantization and add it back after fp8 code is updated.

    * Update config.py

    ---------

    Signed-off-by: xin3he <[email protected]>
    Signed-off-by: yiliu30 <[email protected]>
    Co-authored-by: Yi Liu <[email protected]>

commit 81a076d7c59609be666ddddf64a574cacf1a5c36
Author: Neo Zhang Jianyu <[email protected]>
Date:   Wed Jul 31 13:51:33 2024 +0800

    fix welcome.html link issue (#1962)

    Co-authored-by: ZhangJianyu <[email protected]>

commit 87f02c15a2f1047a8b4bcb5b7f443a4cecb4dfc7
Author: chen, suyue <[email protected]>
Date:   Wed Jul 31 10:09:47 2024 +0800

    fix docs link (#1959)

    Signed-off-by: chensuyue <[email protected]>

commit 03813e2894871fce7a95fb4ee584aab6c5bb18f7
Author: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
Date:   Wed Jul 31 10:09:29 2024 +0800

    Bump tensorflow version (#1961)

    Signed-off-by: dependabot[bot] <[email protected]>

commit 3b5dbf681d8e9beb47eb0d1be4c5a58f4018d42a
Author: Kaihui-intel <[email protected]>
Date:   Tue Jul 30 17:27:21 2024 +0800

    Set low_gpu_mem_usage=False for AutoRound

    Signed-off-by: Kaihui-intel <[email protected]>

commit 41244d3bc65fd646d2d6d88ca2c6686f2ab65bc6
Author: chen, suyue <[email protected]>
Date:   Mon Jul 29 23:05:36 2024 +0800

    new previous results could not find all raise issues in CI model test (#1958)

    Signed-off-by: chensuyue <[email protected]>

commit 190e6b2be6b31158a1101729bcf621bc93e85531
Author: Kaihui-intel <[email protected]>
Date:   Mon Jul 29 19:39:57 2024 +0800

    Fix itrex qbits nf4/int8 training core dumped issue (#1954)

    Signed-off-by: Kaihui-intel <[email protected]>
    Signed-off-by: chensuyue <[email protected]>

commit 0e724a4d96ca0d6a170281688ca644b37fa340e0
Author: Kaihui-intel <[email protected]>
Date:   Mon Jul 29 16:22:13 2024 +0800

    Add save/load for pt2e example (#1927)

    Signed-off-by: Kaihui-intel <[email protected]>

commit 50eb6fb6f5924054b38d8ed99e78e0ebdab51f50
Author: chen, suyue <[email protected]>
Date:   Mon Jul 29 13:40:36 2024 +0800

    update 3x torch installation (#1957)

    Signed-off-by: chensuyue <[email protected]>

commit 6e1b1da712d20d9291e5932974bc3167b00dd214
Author: Zixuan Cheng <[email protected]>
Date:   Fri Jul 26 15:58:00 2024 +0800

    add ipex xpu example to 3x API (#1948)

    Signed-off-by: violetch24 <[email protected]>

commit 19024b351372ca76934db33b0d230552c13bff39
Author: zehao-intel <[email protected]>
Date:   Fri Jul 26 14:52:01 2024 +0800

    Enable yolov5 Example for TF 3x API  (#1943)

    Signed-off-by: zehao-intel <[email protected]>

commit d84a93f7db8eeb69099aa332a4c01a743c9f4090
Author: zehao-intel <[email protected]>
Date:   Thu Jul 25 14:45:19 2024 +0800

    Complement UT of calibration function for TF 3x API (#1945)

    Signed-off-by: zehao-intel <[email protected]>

commit fb8577931c11c3bdc55868e01576b73372d9912b
Author: zehao-intel <[email protected]>
Date:   Thu Jul 25 14:04:25 2024 +0800

    Update Examples for TF 3x API (#1901)

    Signed-off-by: zehao-intel <[email protected]>

commit 6b30207d0a3b6d6d497ecf8f6bb5891765d798ba
Author: zehao-intel <[email protected]>
Date:   Thu Jul 25 13:39:06 2024 +0800

    Add Docstring for TF 3x API and Torch 3x Mixed Precision (#1944)

    Signed-off-by: zehao-intel <[email protected]>

commit d254d508be9c6b14c474fd643ad448a4e261ca72
Author: Yi Liu <[email protected]>
Date:   Wed Jul 24 21:50:44 2024 +0800

    Update doc for client-usage and LWQ (#1947)

    Signed-off-by: yiliu30 <[email protected]>

commit f253d35a152b8003cfc8738fd3c6db7930149264
Author: Neo Zhang Jianyu <[email protected]>
Date:   Wed Jul 24 17:48:05 2024 +0800

    Update publish.yml (#1950)

commit 6cda338a042073aba61ba411a6fc563fc8731889
Author: Neo Zhang Jianyu <[email protected]>
Date:   Wed Jul 24 17:31:19 2024 +0800

    Update publish.yml (#1949)

    * Update publish.yml

    * Update publish.yml

commit c80b68afdba7a55b19898b1b9ff3e21d18b57427
Author: Kaihui-intel <[email protected]>
Date:   Tue Jul 23 21:26:53 2024 +0800

    Update AutoRound commit version (#1941)

    Signed-off-by: Kaihui-intel <[email protected]>

commit 9077b382259e2e56ff5796084a1f4275e4387537
Author: zehao-intel <[email protected]>
Date:   Tue Jul 23 17:04:37 2024 +0800

    Refine Pytorch 3x Mixed Precision Example (#1946)

    Signed-off-by: zehao-intel <[email protected]>

commit efcb2930be6b9d575b1fb8a6e86afdd6a09b5857
Author: Neo Zhang Jianyu <[email protected]>
Date:   Tue Jul 23 10:15:41 2024 +0800

    Update for API 3.0 online doc (#1940)

    Co-authored-by: ZhangJianyu <[email protected]>

commit b787940ea2868e1fc8a56a81b94d62d4ea3d8454
Author: Wang, Mengni <[email protected]>
Date:   Tue Jul 23 10:12:34 2024 +0800

    add docstring for mx quant (#1932)

    Signed-off-by: Mengni Wang <[email protected]>
    Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
    Co-authored-by: xinhe <[email protected]>

commit 0c52e1243b78734e95fc348834303bc3c3cfe369
Author: Kaihui-intel <[email protected]>
Date:   Tue Jul 23 09:59:17 2024 +0800

    Add docstring for WOQ&LayerWise (#1938)

    Signed-off-by: Kaihui-intel <[email protected]>
    Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
    Co-authored-by: xinhe <[email protected]>

commit 08914d6b0e365212fee6016d03dcdc087bd7e441
Author: Huang, Tai <[email protected]>
Date:   Mon Jul 22 11:14:44 2024 +0800

    add read permission token (#1942)

    Signed-off-by: Huang, Tai <[email protected]>

commit e106dea73471ddecdb1cfc702e90fcb1a5d41452
Author: zehao-intel <[email protected]>
Date:   Sun Jul 21 21:48:51 2024 +0800

    Update Example for Pytorch 3x Mixed Precision (#1882)

    Signed-off-by: zehao-intel <[email protected]>

commit 1ebf6987bd054b926d3cdd5630ae058c8d3a66c2
Author: Zixuan Cheng <[email protected]>
Date:   Fri Jul 19 15:56:09 2024 +0800

    add docstring for static quant and smooth quant (#1936)

    * add docstring for static quant and smooth quant

    Signed-off-by: violetch24 <[email protected]>

    * format fix

    Signed-off-by: violetch24 <[email protected]>

    * update scan path

    Signed-off-by: violetch24 <[email protected]>

    * Update utility.py

    ---------

    Signed-off-by: violetch24 <[email protected]>
    Co-authored-by: violetch24 <[email protected]>

commit 296c5d4f1138e5bf33584fb75cea0f6ca5080122
Author: Yi Liu <[email protected]>
Date:   Fri Jul 19 15:08:05 2024 +0800

    Add docstring for PT2E and HQQ (#1937)

    Signed-off-by: yiliu30 <[email protected]>

commit 437c8e75706cff1767dcde115e428654766b3f18
Author: Kaihui-intel <[email protected]>
Date:   Thu Jul 18 10:00:41 2024 +0800

    Fix unused pkgs  import (#1931)

    Signed-off-by: Kaihui-intel <[email protected]>

commit ff3740146a829e845d79266acf233b202843d3fd
Author: chen, suyue <[email protected]>
Date:   Wed Jul 17 23:11:15 2024 +0800

    3.X API installation update (#1935)

    Signed-off-by: chensuyue <[email protected]>

commit 6c27c19c3ec7a318455bd12d6e66ad9bb757ab93
Author: zehao-intel <[email protected]>
Date:   Wed Jul 17 20:35:42 2024 +0800

    Support calib_func on TF 3x API (#1934)

    Signed-off-by: zehao-intel <[email protected]>

commit 53e6ee6b75d476bae0382c7d6fb9aa1348c2ab5e
Author: Zixuan Cheng <[email protected]>
Date:   Wed Jul 17 20:35:03 2024 +0800

    Support xpu for ipex static quant (#1916)

    Signed-off-by: violetch24 <[email protected]>

commit a1cc618df6efa823bb1834ff2f8be83531f91178
Author: chen, suyue <[email protected]>
Date:   Wed Jul 17 17:29:49 2024 +0800

    remove peft version limit (#1933)

    Signed-off-by: chensuyue <[email protected]>

commit 30583882df76838ea3e4a719e25ddca7bb449b9b
Author: Yi Liu <[email protected]>
Date:   Wed Jul 17 15:31:38 2024 +0800

    Add doc for client usage (#1914)

    Signed-off-by: yiliu30 <[email protected]>

commit 29471df05a9e2c36c4ad8083c0b0b285011748d8
Author: Kaihui-intel <[email protected]>
Date:   Wed Jul 17 12:12:40 2024 +0800

    Enhance load_empty_model import (#1930)

    Signed-off-by: Kaihui-intel <[email protected]>

commit fd96851f7f8339ec8bfabd602cf494ac6c31d17b
Author: Kaihui-intel <[email protected]>
Date:   Wed Jul 17 12:05:32 2024 +0800

    Integrate AutoRound v0.3 to 2x (#1926)

    Signed-off-by: Kaihui-intel <[email protected]>

commit bfa27e422dc4760f6a9b1783eee7dae10fe5324f
Author: Kaihui-intel <[email protected]>
Date:   Wed Jul 17 09:33:13 2024 +0800

    Integrate AutoRound v0.3 (#1925)

    Signed-off-by: Kaihui-intel <[email protected]>

commit 5767aed4dbc9a400f65f74bdc9c09209f0a4c145
Author: xinhe <[email protected]>
Date:   Wed Jul 17 09:16:37 2024 +0800

    add docstring for torch.quantization and torch.utils (#1928)

    Signed-off-by: xin3he <[email protected]>

commit f909bca86cfe7881119b62c4e75ca1f330718764
Author: chen, suyue <[email protected]>
Date:   Tue Jul 16 21:12:54 2024 +0800

    update itrex ut test (#1929)

    Signed-off-by: chensuyue <[email protected]>

commit 649e6b148755bda737009bc323b735b92231c579
Author: Kaihui-intel <[email protected]>
Date:   Tue Jul 16 21:05:55 2024 +0800

    Support LayerWise for RTN/GPTQ (#1883)

    Signed-off-by: Kaihui-intel <[email protected]>
    Co-authored-by: chensuyue <[email protected]>

commit de43d851a24a5f4290fe148f7d3607cad6d8433f
Author: Kaihui-intel <[email protected]>
Date:   Tue Jul 16 17:18:12 2024 +0800

    Support absorb dict for awq (#1920)

    Signed-off-by: Kaihui-intel <[email protected]>

commit e9765955f991e1270e3b65635285f6b6cb8fc38c
Author: Kaihui-intel <[email protected]>
Date:   Tue Jul 16 17:17:56 2024 +0800

    Support woq Autotune (#1921)

    Signed-off-by: Kaihui-intel <[email protected]>

commit d56075c7e9f6e3e85385abbff9f1b0d07d157a04
Author: Huang, Tai <[email protected]>
Date:   Tue Jul 16 15:21:06 2024 +0800

    fix typo in architecture diagram (#1924)

    Signed-off-by: Huang, Tai <[email protected]>

commit 0a542397ac1ea8d6fe2edf04565d3cb673001b2c
Author: chen, suyue <[email protected]>
Date:   Tue Jul 16 15:12:43 2024 +0800

    update documentation for 3x API (#1923)

    Signed-off-by: chensuyue <[email protected]>
    Signed-off-by: xin3he <[email protected]>
    Signed-off-by: yiliu30 <[email protected]>

commit be42d033b25c6dd3bcac0ead964699f25f939014
Author: xinhe <[email protected]>
Date:   Tue Jul 16 09:48:48 2024 +0800

    implement TorchBaseConfig (#1911)

    Signed-off-by: xin3he <[email protected]>

commit 7a4715c1d488441e383b7c999fd1b574a3f6ceda
Author: Kaihui-intel <[email protected]>
Date:   Mon Jul 15 14:59:03 2024 +0800

    Support PT2E save and load (#1918)

    Signed-off-by: Kaihui-intel <[email protected]>

commit 34f0a9f450b385aa3227f7f34e8d0f16460080a9
Author: Yi Liu <[email protected]>
Date:   Mon Jul 15 09:10:14 2024 +0800

    Add `save`/`load` support for HQQ (#1913)

    Signed-off-by: yiliu30 <[email protected]>
    Co-authored-by: chen, suyue <[email protected]>

commit d3204604aad007f3db67c46dcb0575aa8f5cd584
Author: Yi Liu <[email protected]>
Date:   Fri Jul 12 14:48:12 2024 +0800

    remove 1x docs (#1900)

    Signed-off-by: yiliu30 <[email protected]>

commit 6c547f7c4cd71342e28a1b23f827a6aa7aa91bb8
Author: chen, suyue <[email protected]>
Date:   Fri Jul 12 14:42:04 2024 +0800

    fix CI docker container clean up issue (#1917)

    Signed-off-by: chensuyue <[email protected]>

commit 17036587d84d2b42e0e9eb501d175e78d552c063
Author: chen, suyue <[email protected]>
Date:   Fri Jul 12 11:14:48 2024 +0800

    Remove deprecated modules (#1872)

    Signed-off-by: chensuyue <[email protected]>

commit f698c96c817c56292a66aee07b3e1396e074b966
Author: chen, suyue <[email protected]>
Date:   Thu Jul 11 18:00:28 2024 +0800

    update Gaudi CI baseline artifacts name (#1912)

    Signed-off-by: chensuyue <[email protected]>

commit 4a45093c1418f34da2660a54052a2ff5c2b4edff
Author: Yi Liu <[email protected]>
Date:   Thu Jul 11 17:47:47 2024 +0800

    Add export support for TEQ (#1910)

    Signed-off-by: yiliu30 <[email protected]>

commit 16a7b11508c008d4d4180a0fe0e31c75b8e5d662
Author: Yi Liu <[email protected]>
Date:   Thu Jul 11 17:13:24 2024 +0800

    Get default config based on the auto-detect CPU type (#1904)

    Signed-off-by: yiliu30 <[email protected]>

commit 2fc72555c987dc7bce8476b389720e1a29159a43
Author: xinhe <[email protected]>
Date:   Thu Jul 11 13:22:52 2024 +0800

    implement `incbench` command for ease-of-use benchmark (#1884)
     implement incbench command as entrypoint for ease-of-use benchmark
     automatically check numa/socket info and dump it with table for ease-of-understand
     supports both Linux and Windows platform
     add benchmark documents
     dump benchmark summary
     add benchmark UTs
    incbench main.py: run 1 instance on NUMA:0.
    incbench --num_i 2 main.py: run 2 instances on NUMA:0.
    incbench --num_c 2 main.py: run multi-instances with 2 cores per instance on NUMA:0.
    incbench -C 24-47 main.py: run 1 instance on COREs:24-47.
    incbench -C 24-47 --num_c 4 main.py: run multi-instances with 4 COREs per instance on COREs:24-47.

    ---------

    Signed-off-by: xin3he <[email protected]>
    Co-authored-by: chen, suyue <[email protected]>

commit de8577ef5874f85d39c3b08f63c98f22c3ce25c6
Author: chen, suyue <[email protected]>
Date:   Wed Jul 10 17:21:45 2024 +0800

    bump version into 3.0 (#1908)

    Signed-off-by: chensuyue <[email protected]>

commit 01f16c4e816fec9d05d34f9d2bd7e425a59b803c
Author: chen, suyue <[email protected]>
Date:   Wed Jul 10 17:19:57 2024 +0800

    support habana fp8 UT test in CI (#1909)

    Signed-off-by: chensuyue <[email protected]>

commit 28578b96bf6217fa2b79699838e5a4af30843de4
Author: Yi Liu <[email protected]>
Date:   Wed Jul 10 13:19:27 2024 +0800

    Add docstring for `common` module (#1905)

    Signed-off-by: yiliu30 <[email protected]>

commit 5fde50f2c0476dbc08d59481b742515f5a210de1
Author: Wang, Chang <[email protected]>
Date:   Wed Jul 10 10:34:46 2024 +0800

    update fp4_e2m1 mapping list (#1906)

    * update fp4_e2m1 mapping list

    * Update utility.py

    * [pre-commit.ci] auto fixes from pre-commit.com hooks

    for more information, see https://pre-commit.ci

    ---------

    Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>

commit 3fe2fd9aadda4991552d65fef09a75ba5127b5db
Author: xinhe <[email protected]>
Date:   Tue Jul 9 15:01:25 2024 +0800

    fix bf16 symbolic_trace bug (#1892)

    Description: fix bf16 symbolic_trace bug,

    - cause abnormal recursive calling.
    - missing necessary attributes
    - By moving BF16 fallback ahead of quantization and removing bf16_symbolic_trace, we fix it.

    ---------

    Signed-off-by: xin3he <[email protected]>
    Co-authored-by: Sun, Xuehao <[email protected]>

commit e080e06d38447d2ab869fe8265a04e464a732057
Author: Sun, Xuehao <[email protected]>
Date:   Tue Jul 9 11:04:30 2024 +0800

    remove neural insight CI (#1903)

    Signed-off-by: Sun, Xuehao <[email protected]>

commit f28fcee6cc7bd6b3e1642157744f38686b1b9a91
Author: Yi Liu <[email protected]>
Date:   Fri Jul 5 15:47:37 2024 +0800

    Remove 1x API (#1865)

    Signed-off-by: yiliu30 <[email protected]>
    Co-authored-by: chen, suyue <[email protected]>

commit 1386ac5ec7be40608dfac082d2275307b8e4d14e
Author: Yi Liu <[email protected]>
Date:   Thu Jul 4 12:18:03 2024 +0800

    Port auto-detect absorb layers for TEQ (#1895)

    Signed-off-by: yiliu30 <[email protected]>

commit 856118e36f7670136c8d83dfbc232010d13d72a6
Author: Wang, Chang <[email protected]>
Date:   Wed Jul 3 13:50:00 2024 +0800

    remove import pdb (#1897)

    Signed-off-by: changwangss <[email protected]>

commit f75ff4082bc7a22d9367d3e91a3ea2c7aaec2bd2
Author: xinhe <[email protected]>
Date:   Wed Jul 3 13:07:48 2024 +0800

    support auto_host2device on RTN and GPTQ(#1894)

    Signed-off-by: He, Xin3 <[email protected]>

commit b9e73f5cf34f824a9b84d74f725c6157dc6430a2
Author: chen, suyue <[email protected]>
Date:   Wed Jul 3 11:10:45 2024 +0800

    tmp fix nas deps issue (#1896)

    Signed-off-by: chensuyue <[email protected]>

commit 63b29126b7c1958939af388d48e56fcceb85db6f
Author: Yi Liu <[email protected]>
Date:   Tue Jul 2 14:46:02 2024 +0800

    Refine HQQ UTs (#1888)

    Signed-off-by: yiliu30 <[email protected]>

commit 5592acc60562b7fccb308af0eaaba9cad53004a5
Author: zehao-intel <[email protected]>
Date:   Tue Jul 2 14:18:51 2024 +0800

    Remove Gelu Fusion for TF Newapi (#1886)

    Signed-off-by: zehao-intel <[email protected]>

commit 4372a762585189accc65196e081a0a7a85f5af9e
Author: Kaihui-intel <[email protected]>
Date:   Fri Jun 28 14:55:10 2024 +0800

    Fix sql injection for Neural Solution gRPC (#1879)

    Signed-off-by: Kaihui-intel <[email protected]>

commit 4ae2e87d2f98eb34c2e523a76ffa6ff77bf767e1
Author: xinhe <[email protected]>
Date:   Thu Jun 27 09:56:52 2024 +0800

    support quant_lm_head arg in all WOQ configs (#1881)

    Signed-off-by: xin3he <[email protected]>

commit cc763f5134f5f84b3020a8ea1bee409a60d15218
Author: Dina Suehiro Jones <[email protected]>
Date:   Wed Jun 26 18:29:06 2024 -0700

    Update the Gaudi container example in the README (#1885)

commit 1f58f024d812b6c1f7f3430b62e61051599cd1b2
Author: Yi Liu <[email protected]>
Date:   Thu Jun 20 22:03:45 2024 +0800

    Add `set_local` support for static quant with pt2e (#1870)

    Signed-off-by: yiliu30 <[email protected]>

commit 0341295de95dce5d5c775fdba78de85e3d3a041d
Author: Yi Liu <[email protected]>
Date:   Wed Jun 19 09:40:11 2024 +0800

    rm cov (#1878)

    Signed-off-by: yiliu30 <[email protected]>

commit 503d9ef4136023f1952e397a2ab0f7f476040901
Author: Kaihui-intel <[email protected]>
Date:   Tue Jun 18 17:12:12 2024 +0800

    Add op statistics dump for woq (#1876)

    Signed-off-by: Kaihui-intel <[email protected]>

commit 5a0374e7db23cac209af78f1ace9b38d23bebbb0
Author: Yi Liu <[email protected]>
Date:   Tue Jun 18 16:21:05 2024 +0800

    Enhance autotune to return the best `q_model` directly (#1875)

    Signed-off-by: yiliu30 <[email protected]>

commit 90fb43135397a035968b5334eba21931c18a83c0
Author: Kaihui-intel <[email protected]>
Date:   Tue Jun 18 16:06:04 2024 +0800

    fix layer match (#1873)

    Signed-off-by: Kaihui-intel <[email protected]>
    Co-authored-by: Sun, Xuehao <[email protected]>

commit f4eb66073fc2c3f13d624c31056d94f2b6735076
Author: Sun, Xuehao <[email protected]>
Date:   Mon Jun 17 16:12:06 2024 +0800

    Limit numpy versions (#1874)

    Signed-off-by: Sun, Xuehao <[email protected]>

commit 2928d856336d3cd2db9068950e054ce4f7c7bbe0
Author: chen, suyue <[email protected]>
Date:   Fri Jun 14 21:51:13 2024 +0800

    update v2.6 release readme (#1871)

    Signed-off-by: chensuyue <[email protected]>

commit 48c5e3a9c22b8f16446a6849d63fed0cdf4a0a7a
Author: Kaihui-intel <[email protected]>
Date:   Fri Jun 14 21:10:14 2024 +0800

    Modify WOQ examples structure (#1866)

    Signed-off-by: Kaihui-intel <[email protected]>
    Signed-off-by: chensuyue <[email protected]>

commit 498af747839af0f54e8b1e946ac20fb52b0fbb89
Author: Sun, Xuehao <[email protected]>
Date:   Fri Jun 14 21:09:36 2024 +0800

    Update SQ/WOQ status (#1869)

    Signed-off-by: Sun, Xuehao <[email protected]>
    Co-authored-by: chen, suyue <[email protected]>

commit b401b02db2cc7d7f4f8412a815fa435e66e330a0
Author: Kaihui-intel <[email protected]>
Date:   Fri Jun 14 17:48:03 2024 +0800

    Add PT2E cv&llm example (#1853)

    Signed-off-by: Kaihui-intel <[email protected]>

commit e470f6cdfbbad32fcf17be56903e649a05059780
Author: xinhe <[email protected]>
Date:   Fri Jun 14 17:34:26 2024 +0800

    [3x] add recommendation examples (#1844)

    Signed-off-by: xin3he <[email protected]>

commit a1415128a8d63af7e1d2798521f11b137eccec81
Author: zehao-intel <[email protected]>
Date:   Fri Jun 14 14:56:30 2024 +0800

    Improve UT Branch Coverage for TF 3x (#1867)

    Signed-off-by: zehao-intel <[email protected]>

commit b99a79d029e8010d234d3b4259994e598bec1a06
Author: Zixuan Cheng <[email protected]>
Date:   Fri Jun 14 14:10:49 2024 +0800

    modify 3.x ipex example structure (#1858)

    * modify 3.x ipex example structure

    Signed-off-by: Cheng, Zixuan <[email protected]>

    * add json path

    Signed-off-by: Cheng, Zixuan <[email protected]>

    * fix for sq

    Signed-off-by: Cheng, Zixuan <[email protected]>

    * minor fix

    Signed-off-by: Cheng, Zixuan <[email protected]>

    * Update run_clm_no_trainer.py

    * Update run_clm_no_trainer.py

    * Update run_clm_no_trainer.py

    * minor fix

    Signed-off-by: Cheng, Zixuan <[email protected]>

    * remove old files

    Signed-off-by: Cheng, Zixuan <[email protected]>

    * fix act_algo

    Signed-off-by: Cheng, Zixuan <[email protected]>

    ---------

    Signed-off-by: Cheng, Zixuan <[email protected]>
    Co-authored-by: xinhe <[email protected]>

commit 922b2471e617cc4c56376866e991302d0beb0640
Author: zehao-intel <[email protected]>
Date:   Fri Jun 14 12:33:39 2024 +0800

    Add TF 3x Examples (#1839)

    Signed-off-by: zehao-intel <[email protected]>

commit 70a1d501fdfee16a10e34385bca9f15eba4366b4
Author: Zixuan Cheng <[email protected]>
Date:   Fri Jun 14 10:17:33 2024 +0800

    fix 3x ipex static quant regression (#1864)

    Description
    fix 3x ipex static quant regression
    cannot fallback with op type name ('linear')
    dump wrong op stats (no 'Linear&relu' op type)
    ---------

    Signed-off-by: Cheng, Zixuan <[email protected]>

commit 4e45f8f68bf126ca0c9dd655fce03b21a93ec151
Author: zehao-intel <[email protected]>
Date:   Fri Jun 14 10:04:11 2024 +0800

    Improve UT Coverage for TF 3x  (#1852)

    Signed-off-by: zehao-intel <[email protected]>
    Signed-off-by: chensuyue <[email protected]>

commit 794b2762c0bb2f076973e1fca5fdecd23efec774
Author: xinhe <[email protected]>
Date:   Thu Jun 13 18:02:04 2024 +0800

    migrate export to 2x and 3x from deprecated (#1845)

    Signed-off-by: xin3he <[email protected]>

commit 0eced1478c6796a5e2dcb254a65bbc96af4d1b8b
Author: yuwenzho <[email protected]>
Date:   Wed Jun 12 18:49:17 2024 -0700

    Enhance INC WOQ model loading & support Huggingface WOQ model loading (#1826)

    Signed-off-by: yuwenzho <[email protected]>

commit 6733dabc4d48a6625e184e4a29a754949f415097
Author: Wang, Mengni <[email protected]>
Date:   Wed Jun 12 17:08:31 2024 +0800

    update mx script (#1838)

    Signed-off-by: Mengni Wang <[email protected]>

commit a0dee94dab0920ba30de049e871b19a72ddb8996
Author: Wang, Chang <[email protected]>
Date:   Wed Jun 12 15:01:25 2024 +0800

    Remove export_compressed_model in AWQConfig (#1831)

commit 2c3556d441de2f0963167db71ecdee7353bd76bb
Author: Huang, Tai <[email protected]>
Date:   Wed Jun 12 14:46:14 2024 +0800

    Add 3x architecture diagram (#1849)

    Signed-off-by: Huang, Tai <[email protected]>

commit 0e2cade66f8c3951e6ce7de226421f6700d2ad85
Author: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
Date:   Wed Jun 12 14:20:06 2024 +0800

    Bump braces from 3.0.2 to 3.0.3 in /neural_insights/gui (#1862)

    Signed-off-by: dependabot[bot] <[email protected]>

commit 5b5579bf953cb24607dc18b3a01ffe1071c3b604
Author: Kaihui-intel <[email protected]>
Date:   Wed Jun 12 14:12:00 2024 +0800

    Fix Neural Solution security issue (#1856)

    Signed-off-by: Kaihui-intel <[email protected]>

commit e9cb48c9462fdc671c523f93611b30b41b6cff98
Author: xinhe <[email protected]>
Date:   Wed Jun 12 11:19:47 2024 +0800

    improve UT coverage of PT Utils and Quantization (#1842)

    * update UTs

    ---------

    Signed-off-by: xin3he <[email protected]>
    Signed-off-by: xinhe3 <[email protected]>

commit 6b2738390dfdab543de1ccd9242fe541c78b6a2e
Author: Yi Liu <[email protected]>
Date:   Wed Jun 12 11:11:50 2024 +0800

    Fix config expansion with empty options (#1861)

    Signed-off-by: yiliu30 <[email protected]>

commit 25c71aad5a55210d87d371257344f21762e3bb0e
Author: WenjiaoYue <[email protected]>
Date:   Tue Jun 11 17:54:31 2024 +0800

    Delete the static resources of the JupyterLab extension after packaging (#1860)

    Signed-off-by: Yue, Wenjiao <[email protected]>

commit 455f1e1f0f0284e87b46d257b6d126ca76fe1748
Author: Wang, Mengni <[email protected]>
Date:   Tue Jun 11 15:28:40 2024 +0800

    Add UT and remove unused code for torch MX quant (#1854)

    * Add UT and remove unused code for torch MX quant
    ---------

    Signed-off-by: Mengni Wang <[email protected]>

Change-Id: I543550ffcc16143d3e612fac2f9ea3a31a1143e1
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants