-
Notifications
You must be signed in to change notification settings - Fork 1.3k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
ipex-llm Llama.cpp port inside ipex-llm Docker containers getting SIGBUS #10955
Comments
This image is still under internal testing,we will update you with the latest image after development is completed. |
I understand that, but I am getting the same problem regardless of if I use this image or if I use my custom Docker container running the llama.cpp fork inside |
Maybe this issue is caused by a higher version of linux kenel. We have validated the kenel version of 5.19.0-41-generic and 6.2.0 but not 6.8.8. |
I found what the problem was. I checked a few other issues, and one of the troubleshooting steps was to run the utility scripts in ipex-llm. I ran that and then found out that in my |
This might be compute-runtime or kernel related but I am posting here first since I don't know. For getting the simplest reproduction, I pulled the Docker image from
intelanalytics/ipex-llm-xpu:cpp-test
which was recently published to the public but I had been using another Docker container for trying to run the Llama.cpp fork included inside thebigdl-core-cpp
pip package and had the same error show up. I used the same command as in the Quickstart guide and I got this as output.GDB stacktrace shows the following.
If I run with
SYCL_PI_TRACE=-1
, I see this as the last snippet before the SIGBUSI am using Linux kernel 6.8.8 and I was under the impression that any kernel or compute-runtime issues had been fixed with regards to seeing
sycl-ls
output the GPU correctly and not seeing the kernel hang for a workload. Hope this is enough information to track the issue but I can provide full logs upon request.The text was updated successfully, but these errors were encountered: