modification on llamacpp readme after Ipex-llm latest update #11971

JinheTang · 2024-08-30T02:00:46Z

main -> llama-cli
-c 1024
2 new troubleshoots
@rnwang04

rnwang04 · 2024-08-30T02:07:53Z

docs/mddocs/Quickstart/llama_cpp_quickstart.md

+llama_new_context_with_model:  SYCL_Host compute buffer size =   288.02 MiB
+llama_new_context_with_model: graph nodes  = 1062
+llama_new_context_with_model: graph splits = 2
+Native API failed. Native API returns: -5 (PI_ERROR_OUT_OF_RESOURCES) -5 (PI_ERROR_OUT_OF_RESOURCES)Exception caught at file:C:/Users/Administrator/actions-runner/cpp-release/_work/llm.cpp/llm.cpp/llama-cpp-bigdl/ggml/src/ggml-sycl.cpp, line:2856


just keep latest

rnwang04 · 2024-08-30T02:08:06Z

docs/mddocs/Quickstart/llama_cpp_quickstart.md

@@ -127,16 +126,12 @@ To use GPU acceleration, several environment variables are required or recommend

  ```cmd
  set SYCL_CACHE_PERSISTENT=1
-  rem under most circumstances, the following environment variable may improve performance, but sometimes this may also cause performance degradation


rnwang04 · 2024-08-30T02:08:35Z

docs/mddocs/Quickstart/llama_cpp_quickstart.md

+
+#### 14. `Native API failed` error
+On latest version of `ipex-llm`, you might come across `native API failed` error with certain models without the `-c` parameter. Simply adding `-c 1024` would resolve this problem.


On latest version of ipex-llm's llama.cpp, you might come across native API failed error with certain models without the -c parameter. Simply adding -c xx would resolve this problem.

rnwang04 · 2024-08-30T02:10:12Z

docs/mddocs/Quickstart/llama_cpp_quickstart.md

+
+#### 13. Core dump when having both integrated and dedicated graphics
+If you have both integrated and dedicated graphics on your computer and didn't specify which device to use, it will cause a core dump. Therefore, you need to run `export ONEAPI_DEVICE_SELECTOR=level_zero:0` before running `llama-cli`.


If you have both integrated and dedicated graphics displayed in your llama.cpp's device log and don't specify which device to use, it will cause a core dump. In such case, you may need to specify export ONEAPI_DEVICE_SELECTOR=level_zero:0 before running llama-cli.

rnwang04 · 2024-08-30T02:10:40Z

Please also add number for ollama's troubleshooting.

rnwang04 · 2024-08-30T02:46:51Z

docs/mddocs/Quickstart/llama3_llamacpp_ollama_quickstart.md

  ```

 Under your current directory, you can also execute below command to have interactive chat with Llama3:

 - For **Linux users**:

  ```bash
-  ./main -ngl 33 --interactive-first --color -e --in-prefix '<|start_header_id|>user<|end_header_id|>\n\n' --in-suffix '<|eot_id|><|start_header_id|>assistant<|end_header_id|>\n\n' -r '<|eot_id|>' -m <model_dir>/Meta-Llama-3-8B-Instruct-Q4_K_M.gguf
+  ./llama-cli -ngl 33 --interactive-first --color -e --in-prefix '<|start_header_id|>user<|end_header_id|>\n\n' --in-suffix '<|eot_id|><|start_header_id|>assistant<|end_header_id|>\n\n' -r '<|eot_id|>' -m <model_dir>/Meta-Llama-3-8B-Instruct-Q4_K_M.gguf -c 1024


Please confirm if it still works for new llama.cpp

rnwang04

LGTM

…nalytics#11971) * update on readme after ipex-llm update * update on readme after ipex-llm update * rebase & delete redundancy * revise * add numbers for troubleshooting

JinheTang added 2 commits August 30, 2024 09:58

update on readme after ipex-llm update

79e01d2

update on readme after ipex-llm update

6d454e8

rnwang04 reviewed Aug 30, 2024

View reviewed changes

JinheTang added 3 commits August 30, 2024 10:11

rebase & delete redundancy

96b7d5d

revise

fb1c6f4

add numbers for troubleshooting

fcb7bbe

rnwang04 reviewed Aug 30, 2024

View reviewed changes

rnwang04 approved these changes Aug 30, 2024

View reviewed changes

rnwang04 merged commit e895e1b into intel-analytics:main Aug 30, 2024

JinheTang deleted the ipex-llm-latest-update branch August 30, 2024 05:34

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

modification on llamacpp readme after Ipex-llm latest update #11971

modification on llamacpp readme after Ipex-llm latest update #11971

JinheTang commented Aug 30, 2024

rnwang04 Aug 30, 2024

rnwang04 Aug 30, 2024

rnwang04 Aug 30, 2024

rnwang04 Aug 30, 2024

rnwang04 commented Aug 30, 2024 •

edited

Loading

rnwang04 Aug 30, 2024

rnwang04 left a comment


		#### 14. `Native API failed` error
		On latest version of `ipex-llm`, you might come across `native API failed` error with certain models without the `-c` parameter. Simply adding `-c 1024` would resolve this problem.


		#### 13. Core dump when having both integrated and dedicated graphics
		If you have both integrated and dedicated graphics on your computer and didn't specify which device to use, it will cause a core dump. Therefore, you need to run `export ONEAPI_DEVICE_SELECTOR=level_zero:0` before running `llama-cli`.

modification on llamacpp readme after Ipex-llm latest update #11971

modification on llamacpp readme after Ipex-llm latest update #11971

Conversation

JinheTang commented Aug 30, 2024

rnwang04 Aug 30, 2024

Choose a reason for hiding this comment

rnwang04 Aug 30, 2024

Choose a reason for hiding this comment

rnwang04 Aug 30, 2024

Choose a reason for hiding this comment

rnwang04 Aug 30, 2024

Choose a reason for hiding this comment

rnwang04 commented Aug 30, 2024 • edited Loading

rnwang04 Aug 30, 2024

Choose a reason for hiding this comment

rnwang04 left a comment

Choose a reason for hiding this comment

rnwang04 commented Aug 30, 2024 •

edited

Loading