Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

add attr to MatMulNBits #1378

Merged
merged 9 commits into from
Nov 17, 2023
Merged

add attr to MatMulNBits #1378

merged 9 commits into from
Nov 17, 2023

Conversation

mengniwang95
Copy link
Contributor

@mengniwang95 mengniwang95 commented Nov 7, 2023

Type of Change

feature

Description

add attr "accuracy_level" to MatMulNBits to fit our jblas kernel

default is 0, means original mlas kernel
1 means jblas compute type is fp32
2 means jblas compute type is fp16
3 means jblas compute type is bf16
4 means jblas compute type is int8

Expected Behavior & Potential Risk

the expected behavior that triggered by this PR

How has this PR been tested?

how to reproduce the test (including hardware information)

Dependency Change?

any library dependency introduced or removed

@chensuyue
Copy link
Contributor

@yuwenzho please review.

@chensuyue
Copy link
Contributor

/azp run Model-Test

Copy link

Azure Pipelines successfully started running 1 pipeline(s).

@chensuyue chensuyue added the enhancement New feature or request label Nov 16, 2023
@chensuyue chensuyue merged commit 7057e3b into master Nov 17, 2023
45 of 47 checks passed
@chensuyue chensuyue deleted the mengni/4bit branch November 17, 2023 02:33
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants