llama : correctly report GGUFv3 format #3818

cebtenzzre · 2023-10-27T17:12:44Z

Follow-up to #3552.

Before:

llm_load_print_meta: format           = unknown

After:

llm_load_print_meta: format           = GGUFv3 (latest)

Will GGUFv2 be deprecated like GGUFv1 was?

edit: I guess it doesn't matter since for little-endian it's just a version bump AFAIK.

* master: (350 commits) speculative : ensure draft and target model vocab matches (ggerganov#3812) llama : correctly report GGUFv3 format (ggerganov#3818) simple : fix batch handling (ggerganov#3803) cuda : improve text-generation and batched decoding performance (ggerganov#3776) server : do not release slot on image input (ggerganov#3798) batched-bench : print params at start log : disable pid in log filenames server : add parameter -tb N, --threads-batch N (ggerganov#3584) (ggerganov#3768) server : do not block system prompt update (ggerganov#3767) sync : ggml (conv ops + cuda MSVC fixes) (ggerganov#3765) cmake : add missed dependencies (ggerganov#3763) cuda : add batched cuBLAS GEMM for faster attention (ggerganov#3749) Add more tokenizer tests (ggerganov#3742) metal : handle ggml_scale for n%4 != 0 (close ggerganov#3754) Revert "make : add optional CUDA_NATIVE_ARCH (ggerganov#2482)" issues : separate bug and enhancement template + no default title (ggerganov#3748) Update special token handling in conversion scripts for gpt2 derived tokenizers (ggerganov#3746) llama : remove token functions with `context` args in favor of `model` (ggerganov#3720) Fix baichuan convert script not detecing model (ggerganov#3739) make : add optional CUDA_NATIVE_ARCH (ggerganov#2482) ...

* ggerganov/llama.cpp#3818

llama : correctly report GGUFv3 format

d055fed

cebtenzzre requested a review from ggerganov October 27, 2023 17:12

KerfuffleV2 approved these changes Oct 27, 2023

View reviewed changes

cebtenzzre merged commit 6d459cb into ggerganov:master Oct 27, 2023
32 checks passed

brittlewis12 added a commit to brittlewis12/llmfarm_core.swift that referenced this pull request Nov 17, 2023

Report ggufv3 correctly

e607131

* ggerganov/llama.cpp#3818

olexiyb pushed a commit to Sanctum-AI/llama.cpp that referenced this pull request Nov 23, 2023

llama : correctly report GGUFv3 format (ggerganov#3818)

53d1471

brittlewis12 added a commit to brittlewis12/llmfarm_core.swift that referenced this pull request Nov 30, 2023

Report ggufv3 correctly

4ddfbae

* ggerganov/llama.cpp#3818

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

llama : correctly report GGUFv3 format #3818

llama : correctly report GGUFv3 format #3818

cebtenzzre commented Oct 27, 2023 •

edited

Loading

llama : correctly report GGUFv3 format #3818

llama : correctly report GGUFv3 format #3818

Conversation

cebtenzzre commented Oct 27, 2023 • edited Loading

cebtenzzre commented Oct 27, 2023 •

edited

Loading