Skip to content

Commit

Permalink
phi3 : duplicate rope factors in each layer (ggerganov#7447)
Browse files Browse the repository at this point in the history
* phi3 : duplicate rope factors in each layer

phi3 : set phi-3 model type as 14B

model loader : simplify the process for duplicating model tensors

llama-bench : remove default pg test

* replace bool parameters in llama_model_loader with named flags
  • Loading branch information
slaren authored May 22, 2024
1 parent fcda112 commit b18532a
Show file tree
Hide file tree
Showing 2 changed files with 83 additions and 97 deletions.
2 changes: 1 addition & 1 deletion examples/llama-bench/llama-bench.cpp
Original file line number Diff line number Diff line change
Expand Up @@ -195,7 +195,7 @@ static const cmd_params cmd_params_defaults = {
/* model */ {"models/7B/ggml-model-q4_0.gguf"},
/* n_prompt */ {512},
/* n_gen */ {128},
/* n_pg */ {{512, 128}},
/* n_pg */ {},
/* n_batch */ {2048},
/* n_ubatch */ {512},
/* type_k */ {GGML_TYPE_F16},
Expand Down
Loading

0 comments on commit b18532a

Please sign in to comment.