cleanup unused --no-mul-mat-q,-nommq, -mmq, --mul-mat-q, mul_mat_q #5772

phymbert · 2024-02-28T17:04:57Z

Cleanup deprecated not used params.mul_mat_q and related docs. It can make confusion for user trying to optmize GPU inference performance: #3359 and #3412 .

Breaking upstream application if any using it.

slaren · 2024-02-28T17:08:18Z

This option is no longer supported, and should be removed completely instead.

phymbert · 2024-02-28T17:16:30Z

@slaren Thanks, I did not checked :/, I will clean it up it then.

examples/batched-bench/batched-bench.cpp

phymbert · 2024-03-01T08:14:02Z

@ggerganov Hi, as I got one approval, should I wait for yours ? thanks

* cleanup unused --no-mul-mat-q,-nommq, -mmq, --mul-mat-q, mul_mat_q * remove: mul_mat_q in compare llama bench and usage * update llama-bench --------- Co-authored-by: slaren <[email protected]>

phymbert requested a review from ggerganov February 28, 2024 17:05

phymbert marked this pull request as draft February 28, 2024 17:15

cleanup unused --no-mul-mat-q,-nommq, -mmq, --mul-mat-q, mul_mat_q

2e64897

phymbert force-pushed the feature/server-mul-mat-q branch from b4b0d53 to 2e64897 Compare February 28, 2024 17:34

phymbert changed the title ~~server: docs: --no-mul-mat-q,-nommq~~ cleanup unused --no-mul-mat-q,-nommq, -mmq, --mul-mat-q, mul_mat_q Feb 28, 2024

phymbert added the breaking change Changes that break ABIs, APIs, file formats, or other forms of backwards compatibility. label Feb 28, 2024

phymbert marked this pull request as ready for review February 28, 2024 17:37

ggerganov reviewed Feb 29, 2024

View reviewed changes

examples/batched-bench/batched-bench.cpp Show resolved Hide resolved

phymbert and others added 2 commits February 29, 2024 10:38

remove: mul_mat_q in compare llama bench and usage

36fed7a

update llama-bench

6e0733b

slaren approved these changes Feb 29, 2024

View reviewed changes

ggerganov approved these changes Mar 1, 2024

View reviewed changes

ggerganov merged commit 3ab8b3a into master Mar 1, 2024
61 checks passed

phymbert deleted the feature/server-mul-mat-q branch March 1, 2024 13:11

CameronNg mentioned this pull request Apr 2, 2024

chore: Pump llama.cpp version janhq/cortex.cpp#475

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

cleanup unused --no-mul-mat-q,-nommq, -mmq, --mul-mat-q, mul_mat_q #5772

cleanup unused --no-mul-mat-q,-nommq, -mmq, --mul-mat-q, mul_mat_q #5772

phymbert commented Feb 28, 2024 •

edited

Loading

slaren commented Feb 28, 2024

phymbert commented Feb 28, 2024

phymbert commented Mar 1, 2024

cleanup unused --no-mul-mat-q,-nommq, -mmq, --mul-mat-q, mul_mat_q #5772

cleanup unused --no-mul-mat-q,-nommq, -mmq, --mul-mat-q, mul_mat_q #5772

Conversation

phymbert commented Feb 28, 2024 • edited Loading

slaren commented Feb 28, 2024

phymbert commented Feb 28, 2024

phymbert commented Mar 1, 2024

phymbert commented Feb 28, 2024 •

edited

Loading