-
Notifications
You must be signed in to change notification settings - Fork 237
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Assert with Xe KMD when using -DNEO_ENABLE_XE_DRM_DETECTION=TRUE #696
Comments
OpenCL programs give also same assert, which is here in the repo code: Strace shows this memory region check issue happening at driver init time:
|
On Arc, I've seen also segfault instead of assert, but it was not reproducible. Strace showed it happening with same backtrace as the assert. With OpenCL, strace shows line 311 assert being arrived through another route than in above
Mesa driver works fine with this (last night) Xe KMD git version. |
Tried also older (Dec 21st) Xe KMD version recommended for media-driver in intel/media-driver#1761 But
Latest Mesa tag works with Xe KMD HEAD, and the linked So, what Xe KMD version |
As latest "compute-runtime" tag ( Although latest Mesa (release) and media-driver (
With what Xe KMD version, patches etc compute-runtime is supposed to work with? And which compute-runtime version, patches etc. I should use? |
@JablonskiMateusz That commit seems to be only in
Similarly to Actually, both of the drivers work with both of the KMD versions from f.d.o:
However, while basic CL stuff seems to work, all Sysman metric queries return Is there something I need to use to get at least some Sysman metrics to work, or is Xe KMD still lacking all metric support? PS. I think this ticket should be open until:
[1] corresponding |
With (I.e. Sysman supports However, when querying engine metrics, there's a segfault:
Those 2 metrics types are only ones |
@eero-t we are looking into this and update you when fix is ready |
Segfault on engine metrics query is specific to "zello_sysman" (built from same 2024-02-09 master branch sources as driver itself). There's no crash with my own zesInit() using program with Xe KMD, engine metrics just do not work: #707 |
Tried latest Xe KMD (6.8.0-rc3) tags:
Because latest "24.05.28454.10" release is still missing reguired 278ced3 commit, I built again latest compute-runtime In quick testing, driver build seemed to work OK with "drm-xe-next-2024-02-25" one, except for missing engine metrics regression, that happens also with As to "drm-xe-fixes-2024-02-29" Xe KMD, OpenCL read/write/copy tester hanged both on TGL iGPU and Arc. When stracing the tester, it was either using 100% by constantly sched_yield()ing (TGL), or nanosleeping (Arc). For now, I'm assuming driver is not even supposed to work with that Xe KMD version... |
with new release it is fixed, please close |
@saik-intel Haven't yet had time to verify latest release functionality. I'll try to do it before end of week. |
Closing. On quick testing ( |
Problem
Compute-runtime Xe KMD support does not actually work with Xe KMD, it asserts
Details
When building kernel from Xe repo default "drm-xe-next" branch (yesterday HEAD commit): https://gitlab.freedesktop.org/drm/xe/kernel
With Xe driver enabled:
Booting TGL device with it being enabled:
And using compute stack built from following versions:
Using options enabling Xe KMD support:
Compute-runtime and its
zello_sysman
tool just abort with an assert:The text was updated successfully, but these errors were encountered: