panic on attach a NAVI10 card: Unregistered use of FPU in kernel #10

NorwegianRockCat · 2020-06-21T13:29:16Z

I spent some time trying to on the current master branch at commit efc692b with a Radeon 5700XT card (NAVI 10). The code needs an adjustment for the firmware to finish loading. That is, the PSP_SOS has mdelay of 20 in psp_v11_0.c:302 as it waits for something to "finish", but fails in FreeBSD (too quick?) I increased it to 200 and it worked.

Once that finishes, the kernel panics with an unregistered use of the FPU in the kernel.

The backtrace is:

Unread portion of the kernel message buffer:
<6>[drm] reserve 0x900000 from 0x8002400000 for PSP TMR
<6>amdgpu: [powerplay] smu driver if version = 0x00000033, smu fw if version = 0x00000035, smu fw version = 0x002a3200 (42.50.0)
<4>amdgpu: [powerplay] SMU driver if version not matched
<6>amdgpu: [powerplay] SMU is initialized successfully!
panic: Unregistered use of FPU in kernel
cpuid = 9
time = 1592673856
KDB: stack backtrace:
db_trace_self_wrapper() at db_trace_self_wrapper+0x2b/frame 0xfffffe00ea3b8bd0
vpanic() at vpanic+0x182/frame 0xfffffe00ea3b8c20
panic() at panic+0x43/frame 0xfffffe00ea3b8c80
trap() at trap+0x828/frame 0xfffffe00ea3b8d90
calltrap() at calltrap+0x8/frame 0xfffffe00ea3b8d90
--- trap 0x16, rip = 0xffffffff8488afcb, rsp = 0xfffffe00ea3b8e60, rbp = 0xfffffe00ea3b91c0 ---
dcn20_create_resource_pool() at dcn20_create_resource_pool+0x13ab/frame 0xfffffe00ea3b91c0
dc_create_resource_pool() at dc_create_resource_pool+0xc0/frame 0xfffffe00ea3b9220
dc_create() at dc_create+0x2df/frame 0xfffffe00ea3b9280
dm_hw_init() at dm_hw_init+0x27a/frame 0xfffffe00ea3b93b0
amdgpu_device_init() at amdgpu_device_init+0x1d27/frame 0xfffffe00ea3b9480
amdgpu_driver_load_kms() at amdgpu_driver_load_kms+0xd0/frame 0xfffffe00ea3b94b0
drm_dev_register() at drm_dev_register+0xc6/frame 0xfffffe00ea3b94e0
amdgpu_pci_probe() at amdgpu_pci_probe+0x17d/frame 0xfffffe00ea3b9520
linux_pci_attach_device() at linux_pci_attach_device+0x569/frame 0xfffffe00ea3b9580
device_attach() at device_attach+0x3dd/frame 0xfffffe00ea3b95d0
bus_generic_driver_added() at bus_generic_driver_added+0xb6/frame 0xfffffe00ea3b9600
devclass_driver_added() at devclass_driver_added+0x39/frame 0xfffffe00ea3b9640
devclass_add_driver() at devclass_add_driver+0x13d/frame 0xfffffe00ea3b9680
_linux_pci_register_driver() at _linux_pci_register_driver+0xdf/frame 0xfffffe00ea3b96b0
amdgpu_evh() at amdgpu_evh+0x92/frame 0xfffffe00ea3b96c0
module_register_init() at module_register_init+0xa4/frame 0xfffffe00ea3b96f0
linker_load_module() at linker_load_module+0xba9/frame 0xfffffe00ea3b9a10
kern_kldload() at kern_kldload+0xb8/frame 0xfffffe00ea3b9a60
sys_kldload() at sys_kldload+0x5b/frame 0xfffffe00ea3b9a90
amd64_syscall() at amd64_syscall+0x119/frame 0xfffffe00ea3b9bb0
fast_syscall_common() at fast_syscall_common+0x101/frame 0xfffffe00ea3b9bb0
--- syscall (304, FreeBSD ELF64, sys_kldload), rip = 0x8002db06a, rsp = 0x7fffffffd538, rbp = 0x7fffffffdab0 ---
KDB: enter: panic

I looked at the dcn20_create_resource_pool() function and it seems to happen somewhere in the pool's construction. The file has a kernel_fpu_begin/end pair in one function, but perhaps FreeBSD is being more strict or something is touching the FPU along the way, or something else is wrong with the functions. There are some places where floating-point variables are being initialized but are not wrapped by these pairs.

I don't know enough about these FPU kernel semantics to fix this. I naively just tried to wrap the dcn20_create_resource_pool() with the kernel_fpu_begin/end, but that didn't solve the issue (and is likely not reenterant).

I can certainly try out things if there is a need.

Dmesg for the amdgpu load:

[drm] amdgpu kernel modesetting enabled.
drmn0: <drmn> on vgapci0
vgapci0: child drmn0 requested pci_enable_io
vgapci0: child drmn0 requested pci_enable_io
sysctl_warn_reuse: can't re-use a leaf (hw.dri.debug)!
[drm] initializing kernel modesetting (NAVI10 0x1002:0x731F 0x1DA2:0xE411 0xC1).
[drm] register mmio base: 0xFCB00000
[drm] register mmio size: 524288
[drm] set register base offset for ATHUB
[drm] set register base offset for CLKA
[drm] set register base offset for CLKA
[drm] set register base offset for CLKA
[drm] set register base offset for CLKA
[drm] set register base offset for CLKA
[drm] set register base offset for DF
[drm] set register base offset for DMU
[drm] set register base offset for GC
[drm] set register base offset for HDP
[drm] set register base offset for MMHUB
[drm] set register base offset for MP0
[drm] set register base offset for MP1
[drm] set register base offset for NBIF
[drm] set register base offset for NBIF
[drm] set register base offset for OSSSYS
[drm] set register base offset for SDMA0
[drm] set register base offset for SDMA1
[drm] set register base offset for SMUIO
[drm] set register base offset for THM
[drm] set register base offset for UVD
[drm] add ip block number 0 <nv_common>
[drm] add ip block number 1 <gmc_v10_0>
[drm] add ip block number 2 <navi10_ih>
[drm] add ip block number 3 <psp>
[drm] add ip block number 4 <smu>
[drm] add ip block number 5 <dm>
[drm] add ip block number 6 <gfx_v10_0>
[drm] add ip block number 7 <sdma_v5_0>
[drm] add ip block number 8 <vcn_v2_0>
drmn0: successfully loaded firmware image with name: amdgpu/navi10_gpu_info.bin
ATOM BIOS: 113-1E4112U-O45
[drm] VCN decode is enabled in VM mode
[drm] VCN encode is enabled in VM mode
[drm] VCN jpeg decode is enabled in VM mode
pci_is_thunderbolt_attached not implemented -- see your local kernel hacker
[drm] vm size is 262144 GB, 4 levels, block size is 9-bit, fragment size is 9-bit
drmn0: VRAM: 8176M 0x0000008000000000 - 0x00000081FEFFFFFF (8176M used)
drmn0: GART: 512M 0x0000000000000000 - 0x000000001FFFFFFF
Successfully added WC MTRR for [0xe0000000-0xefffffff]: 0; 
[drm] Detected VRAM RAM=8176M, BAR=256M
[drm] RAM width 256bits GDDR6
[TTM] Zone  kernel: Available graphics memory: 16712344 KiB
[TTM] Zone   dma32: Available graphics memory: 2097152 KiB
[TTM] Initializing pool allocator
[drm] amdgpu: 8176M of VRAM memory ready
[drm] amdgpu: 8176M of GTT memory ready.
[drm] GART: num cpu pages 131072, num gpu pages 131072
[drm] PCIE GART of 512M enabled (table at 0x0000008001FA4000).
get_nr_swap_pages not implemented -- see your local kernel hacker
Jun 20 19:24:09 concordia kernel: pci_is_thunderbolt_attached not implemented -- see your local kernel hacker
Jun 20 19:24:09 concordia kernel: Successfully added WC MTRR for [0xe0000000-0xefffffff]: 0; 
Jun 20 19:24:09 concordia kernel: get_nr_swap_pages not implemented -- see your local kernel hacker
drmn0: successfully loaded firmware image with name: amdgpu/navi10_sos.bin
drmn0: successfully loaded firmware image with name: amdgpu/navi10_asd.bin
drmn0: successfully loaded firmware image with name: amdgpu/navi10_smc.bin
[drm] ppt_offset_bytes: 3
[drm] ppt_size_bytes: 262912
drmn0: successfully loaded firmware image with name: amdgpu/navi10_pfp.bin
drmn0: successfully loaded firmware image with name: amdgpu/navi10_me.bin
drmn0: successfully loaded firmware image with name: amdgpu/navi10_ce.bin
drmn0: successfully loaded firmware image with name: amdgpu/navi10_rlc.bin
drmn0: successfully loaded firmware image with name: amdgpu/navi10_mec.bin
drmn0: successfully loaded firmware image with name: amdgpu/navi10_mec2.bin
sched_setscheduler not implemented -- see your local kernel hacker
Jun 20 19:24:14 concordia kernel: sched_setscheduler not implemented -- see your local kernel hacker
drmn0: successfully loaded firmware image with name: amdgpu/navi10_sdma.bin
drmn0: successfully loaded firmware image with name: amdgpu/navi10_sdma1.bin
[drm] use_doorbell being set to: [true]
[drm] use_doorbell being set to: [true]
drmn0: successfully loaded firmware image with name: amdgpu/navi10_vcn.bin
[drm] Found VCN firmware Version ENC: 1.7 DEC: 4 VEP: 0 Revision: 17
[drm] PSP loading VCN firmware
[drm] reserve 0x900000 from 0x8002400000 for PSP TMR
amdgpu: [powerplay] smu driver if version = 0x00000033, smu fw if version = 0x00000035, smu fw version = 0x002a3200 (42.50.0)
amdgpu: [powerplay] SMU driver if version not matched
amdgpu: [powerplay] SMU is initialized successfully!
panic: Unregistered use of FPU in kernel
cpuid = 9
time = 1592673856
KDB: stack backtrace:
db_trace_self_wrapper() at db_trace_self_wrapper+0x2b/frame 0xfffffe00ea3b8bd0
vpanic() at vpanic+0x182/frame 0xfffffe00ea3b8c20
panic() at panic+0x43/frame 0xfffffe00ea3b8c80
trap() at trap+0x828/frame 0xfffffe00ea3b8d90
calltrap() at calltrap+0x8/frame 0xfffffe00ea3b8d90
--- trap 0x16, rip = 0xffffffff8488afcb, rsp = 0xfffffe00ea3b8e60, rbp = 0xfffffe00ea3b91c0 ---
dcn20_create_resource_pool() at dcn20_create_resource_pool+0x13ab/frame 0xfffffe00ea3b91c0
dc_create_resource_pool() at dc_create_resource_pool+0xc0/frame 0xfffffe00ea3b9220
dc_create() at dc_create+0x2df/frame 0xfffffe00ea3b9280
dm_hw_init() at dm_hw_init+0x27a/frame 0xfffffe00ea3b93b0
amdgpu_device_init() at amdgpu_device_init+0x1d27/frame 0xfffffe00ea3b9480
amdgpu_driver_load_kms() at amdgpu_driver_load_kms+0xd0/frame 0xfffffe00ea3b94b0
drm_dev_register() at drm_dev_register+0xc6/frame 0xfffffe00ea3b94e0
amdgpu_pci_probe() at amdgpu_pci_probe+0x17d/frame 0xfffffe00ea3b9520
linux_pci_attach_device() at linux_pci_attach_device+0x569/frame 0xfffffe00ea3b9580
device_attach() at device_attach+0x3dd/frame 0xfffffe00ea3b95d0
bus_generic_driver_added() at bus_generic_driver_added+0xb6/frame 0xfffffe00ea3b9600
devclass_driver_added() at devclass_driver_added+0x39/frame 0xfffffe00ea3b9640
devclass_add_driver() at devclass_add_driver+0x13d/frame 0xfffffe00ea3b9680
_linux_pci_register_driver() at _linux_pci_register_driver+0xdf/frame 0xfffffe00ea3b96b0
amdgpu_evh() at amdgpu_evh+0x92/frame 0xfffffe00ea3b96c0
module_register_init() at module_register_init+0xa4/frame 0xfffffe00ea3b96f0
linker_load_module() at linker_load_module+0xba9/frame 0xfffffe00ea3b9a10
kern_kldload() at kern_kldload+0xb8/frame 0xfffffe00ea3b9a60
sys_kldload() at sys_kldload+0x5b/frame 0xfffffe00ea3b9a90
amd64_syscall() at amd64_syscall+0x119/frame 0xfffffe00ea3b9bb0
fast_syscall_common() at fast_syscall_common+0x101/frame 0xfffffe00ea3b9bb0
--- syscall (304, FreeBSD ELF64, sys_kldload), rip = 0x8002db06a, rsp = 0x7fffffffd538, rbp = 0x7fffffffdab0 ---
KDB: enter: panic

The text was updated successfully, but these errors were encountered:

evadot · 2020-06-25T13:28:51Z

I don't see where some fpu instructions are used so it's a bit weird.
Also since a lot function are static they are inlined by the compiler.
I suggest you test recompiling by adding :
attribute ((noinline)) just before every function name for every static function in this file.
i.e.:
`
-static bool construct(

+static bool attribute ((noinline)) construct(

`
Then the panic will show the real function using fpu instr and you can then add some kernel_fpu_begin/kernel_fpu_end in it.

NorwegianRockCat · 2020-06-26T18:16:37Z

I added the noinline attribute and recompiled. It seems that the FPU is being triggered inside construct(). I added a call to kernel_fpu_begin to the beginning and end of the function. Then, the screen goes blank and the computer freezes (can't ping or ssh into it) and I have to do a hard restart.

I thought it might be some sort of problem with holding the context too long. So, I added a bunch of DRM_INFOs to see if I could limit the section where it is called and it seems that the panic happens after the call to dml_init_instance() but before the call to dal_irq_service_dcn20_create().

I then wrapped this section of code kernel_fpu_begin/kernel_fpu_end instead, but I got the same result as before (a blank screen and an unresponsive machine). It doesn't panic, so I don't know what is going wrong.

I must admit I'm stumped. Any suggestions? Would you like the panic from after all the inlining?

NorwegianRockCat · 2020-06-27T09:07:49Z

I had some more time to narrow this down more. The lines in construct are around 3025 and 3026:

  ranges.reader_wm_sets[i].min_fill_clk_mhz = (i > 0) ? (dcn2_0_soc.clock_limits[i - 1].dram_speed_mts / 16) + 1 : 0;
  ranges.reader_wm_sets[i].max_fill_clk_mhz = dcn2_0_soc.clock_limits[i].dram_speed_mts / 16;

The element dram_speed_mts is a double.

I don't know why this is OK in Linux, but it appears, to me at least, that it is a floating point operation.

Regardless the kernel_fpu_begin and kernel_fpu_end make it through the first iteration of the loop here, but then the screen goes completely blank and the computer locks up during a later iteration requiring a hard restart.

Since the screen goes blank and it doesn't panic, I don't know how to get any more debug information at the moment.

evadot · 2020-06-28T11:04:20Z

It could be that it panics but in the middle of switching vt.
You could try blindly typing 'dump' and then after a few seconds 'reboot'.

NorwegianRockCat · 2020-06-29T20:09:41Z

I thought I had actually tried that, but obviously I had typed incorrectly. doh

It turns out there are several FPU accesses as part of the DCN20 init. Through a process of adding kernel_fpu_begin/end pairs and rebooting, I finally got the thing to load. Hooray!

The code currently has a lot of debug information and attributes and what not. I hope to have some time later in the week or over the weekend to clean it up and I can send a pull request.

Now, to the next step, getting X to work.

evadot · 2020-06-30T09:22:41Z

I thought I had actually tried that, but obviously I had typed incorrectly. doh

It turns out there are several FPU accesses as part of the DCN20 init. Through a process of adding kernel_fpu_begin/end pairs and rebooting, I finally got the thing to load. Hooray!

The code currently has a lot of debug information and attributes and what not. I hope to have some time later in the week or over the weekend to clean it up and I can send a pull request.

Now, to the next step, getting X to work.

Great news !

zoujiaqing · 2020-06-30T09:57:14Z

Thanks :)

This put us in sync with Linux 5.3 drm drivers with one cavehat: Navi10 AMDGPU support, an issue is created upstream and we hope to have it working soon. freebsd/drm-kmod#10 git-svn-id: svn+ssh://svn.freebsd.org/ports/head@540883 35697150-7ecd-e111-bb59-0022644237b5

This put us in sync with Linux 5.3 drm drivers with one cavehat: Navi10 AMDGPU support, an issue is created upstream and we hope to have it working soon. freebsd/drm-kmod#10

This put us in sync with Linux 5.3 drm drivers with one cavehat: Navi10 AMDGPU support, an issue is created upstream and we hope to have it working soon. freebsd/drm-kmod#10 git-svn-id: svn+ssh://svn.freebsd.org/ports/head@540883 35697150-7ecd-e111-bb59-0022644237b5

NorwegianRockCat · 2020-07-05T07:25:18Z

I have now made the pull request. I have no idea if I followed the style correctly, but I'm certainly willing to make changes.

zoujiaqing · 2020-07-06T14:23:13Z

Now upgrade notice: kldload: an error occurred while loading module amd gpu. Please cheack dmesg(8) for more details.

Please upgrade it : https://www.freshports.org/graphics/drm-devel-kmod/

evadot · 2020-07-06T14:25:23Z

Now upgrade notice: kldload: an error occurred while loading module amd gpu. Please cheack dmesg(8) for more details.

Please upgrade it : https://www.freshports.org/graphics/drm-devel-kmod/

Could you give more details ?
Are you using the port or directly the master branch from here ?

zoujiaqing · 2020-07-07T03:26:58Z

pkg version:

pkg-devel-kmod-5.3.g20200612

FreeBSD version:

FreeBSD 13-CURRENT #2 9963b6e1cfe-c269658: Mon Jun 29 14:20:24 CST 2020 GENERIC amd64

NorwegianRockCat · 2020-07-07T07:52:00Z

That package is from 12 June. 20200612 == 12 June 2020, and it seems to be out of sync with the kernel version you built.

Regardless, it won't have the changes from my pull request in there yet.

For the moment, if you want the change you'll need to clone and build this repo, or wait until the port gets updated.

zoujiaqing · 2020-07-07T12:15:10Z

Thanks @NorwegianRockCat , I install it from source code.
but notice:

# kldload amggpu
Jul   7 20:11:56 freebsd kernel: pm_runtime_mark_last_busy not implemented -- see your local. kernel hacker

I need upgrade kernel?

zoujiaqing · 2020-07-08T02:39:55Z

@NorwegianRockCat , I upgrade kernel from latest freebsd source code. The problem has not been resolved.

NorwegianRockCat · 2020-07-08T06:03:24Z

@zoujiaqing That message is that that system call is not yet implemented. You are running a development version of the OS, so not everything is completed. If you have a local kernel hacker, you can perhaps persuade them to implement it.

As long as you have the console after the module has loaded, then it everything has loaded. You should be ready to go with the mesa-devel package.

zoujiaqing · 2020-07-09T10:29:37Z

Very good! Thank you @NorwegianRockCat ;)

Bos can be put with multiple unrelated dma-resv locks held. But imported bos attempt to grab the bo dma-resv during dma-buf detach that typically happens during cleanup. That leads to lockde splats similar to the below and a potential ABBA deadlock. Fix this by always taking the delayed workqueue cleanup path for imported bos. Requesting stable fixes from when the Xe driver was introduced, since its usage of drm_exec and wide vm dma_resvs appear to be the first reliable trigger of this. [22982.116427] ============================================ [22982.116428] WARNING: possible recursive locking detected [22982.116429] 6.10.0-rc2+ freebsd#10 Tainted: G U W [22982.116430] -------------------------------------------- [22982.116430] glxgears:sh0/5785 is trying to acquire lock: [22982.116431] ffff8c2bafa539a8 (reservation_ww_class_mutex){+.+.}-{3:3}, at: dma_buf_detach+0x3b/0xf0 [22982.116438] but task is already holding lock: [22982.116438] ffff8c2d9aba6da8 (reservation_ww_class_mutex){+.+.}-{3:3}, at: drm_exec_lock_obj+0x49/0x2b0 [drm_exec] [22982.116442] other info that might help us debug this: [22982.116442] Possible unsafe locking scenario: [22982.116443] CPU0 [22982.116444] ---- [22982.116444] lock(reservation_ww_class_mutex); [22982.116445] lock(reservation_ww_class_mutex); [22982.116447] *** DEADLOCK *** [22982.116447] May be due to missing lock nesting notation [22982.116448] 5 locks held by glxgears:sh0/5785: [22982.116449] #0: ffff8c2d9aba58c8 (&xef->vm.lock){+.+.}-{3:3}, at: xe_file_close+0xde/0x1c0 [xe] [22982.116507] freebsd#1: ffff8c2e28cc8480 (&vm->lock){++++}-{3:3}, at: xe_vm_close_and_put+0x161/0x9b0 [xe] [22982.116578] freebsd#2: ffff8c2e31982970 (&val->lock){.+.+}-{3:3}, at: xe_validation_ctx_init+0x6d/0x70 [xe] [22982.116647] freebsd#3: ffffacdc469478a8 (reservation_ww_class_acquire){+.+.}-{0:0}, at: xe_vma_destroy_unlocked+0x7f/0xe0 [xe] [22982.116716] freebsd#4: ffff8c2d9aba6da8 (reservation_ww_class_mutex){+.+.}-{3:3}, at: drm_exec_lock_obj+0x49/0x2b0 [drm_exec] [22982.116719] stack backtrace: [22982.116720] CPU: 8 PID: 5785 Comm: glxgears:sh0 Tainted: G U W 6.10.0-rc2+ freebsd#10 [22982.116721] Hardware name: ASUS System Product Name/PRIME B560M-A AC, BIOS 2001 02/01/2023 [22982.116723] Call Trace: [22982.116724] <TASK> [22982.116725] dump_stack_lvl+0x77/0xb0 [22982.116727] __lock_acquire+0x1232/0x2160 [22982.116730] lock_acquire+0xcb/0x2d0 [22982.116732] ? dma_buf_detach+0x3b/0xf0 [22982.116734] ? __lock_acquire+0x417/0x2160 [22982.116736] __ww_mutex_lock.constprop.0+0xd0/0x13b0 [22982.116738] ? dma_buf_detach+0x3b/0xf0 [22982.116741] ? dma_buf_detach+0x3b/0xf0 [22982.116743] ? ww_mutex_lock+0x2b/0x90 [22982.116745] ww_mutex_lock+0x2b/0x90 [22982.116747] dma_buf_detach+0x3b/0xf0 [22982.116749] drm_prime_gem_destroy+0x2f/0x40 [drm] [22982.116775] xe_ttm_bo_destroy+0x32/0x220 [xe] [22982.116818] ? __mutex_unlock_slowpath+0x3a/0x290 [22982.116821] drm_exec_unlock_all+0xa1/0xd0 [drm_exec] [22982.116823] drm_exec_fini+0x12/0xb0 [drm_exec] [22982.116824] xe_validation_ctx_fini+0x15/0x40 [xe] [22982.116892] xe_vma_destroy_unlocked+0xb1/0xe0 [xe] [22982.116959] xe_vm_close_and_put+0x41a/0x9b0 [xe] [22982.117025] ? xa_find+0xe3/0x1e0 [22982.117028] xe_file_close+0x10a/0x1c0 [xe] [22982.117074] drm_file_free+0x22a/0x280 [drm] [22982.117099] drm_release_noglobal+0x22/0x70 [drm] [22982.117119] __fput+0xf1/0x2d0 [22982.117122] task_work_run+0x59/0x90 [22982.117125] do_exit+0x330/0xb40 [22982.117127] do_group_exit+0x36/0xa0 [22982.117129] get_signal+0xbd2/0xbe0 [22982.117131] arch_do_signal_or_restart+0x3e/0x240 [22982.117134] syscall_exit_to_user_mode+0x1e7/0x290 [22982.117137] do_syscall_64+0xa1/0x180 [22982.117139] ? lock_acquire+0xcb/0x2d0 [22982.117140] ? __set_task_comm+0x28/0x1e0 [22982.117141] ? find_held_lock+0x2b/0x80 [22982.117144] ? __set_task_comm+0xe1/0x1e0 [22982.117145] ? lock_release+0xca/0x290 [22982.117147] ? __do_sys_prctl+0x245/0xab0 [22982.117149] ? lockdep_hardirqs_on_prepare+0xde/0x190 [22982.117150] ? syscall_exit_to_user_mode+0xb0/0x290 [22982.117152] ? do_syscall_64+0xa1/0x180 [22982.117154] ? __lock_acquire+0x417/0x2160 [22982.117155] ? reacquire_held_locks+0xd1/0x1f0 [22982.117156] ? do_user_addr_fault+0x30c/0x790 [22982.117158] ? lock_acquire+0xcb/0x2d0 [22982.117160] ? find_held_lock+0x2b/0x80 [22982.117162] ? do_user_addr_fault+0x357/0x790 [22982.117163] ? lock_release+0xca/0x290 [22982.117164] ? do_user_addr_fault+0x361/0x790 [22982.117166] ? trace_hardirqs_off+0x4b/0xc0 [22982.117168] ? clear_bhb_loop+0x45/0xa0 [22982.117170] ? clear_bhb_loop+0x45/0xa0 [22982.117172] ? clear_bhb_loop+0x45/0xa0 [22982.117174] entry_SYSCALL_64_after_hwframe+0x76/0x7e [22982.117176] RIP: 0033:0x7f943d267169 [22982.117192] Code: Unable to access opcode bytes at 0x7f943d26713f. [22982.117193] RSP: 002b:00007f9430bffc80 EFLAGS: 00000246 ORIG_RAX: 00000000000000ca [22982.117195] RAX: fffffffffffffe00 RBX: 0000000000000000 RCX: 00007f943d267169 [22982.117196] RDX: 0000000000000000 RSI: 0000000000000189 RDI: 00005622f89579d0 [22982.117197] RBP: 00007f9430bffcb0 R08: 0000000000000000 R09: 00000000ffffffff [22982.117198] R10: 0000000000000000 R11: 0000000000000246 R12: 0000000000000000 [22982.117199] R13: 0000000000000000 R14: 0000000000000000 R15: 00005622f89579d0 [22982.117202] </TASK> Fixes: dd08ebf6c352 ("drm/xe: Introduce a new DRM driver for Intel GPUs") Cc: Christian König <[email protected]> Cc: Daniel Vetter <[email protected]> Cc: [email protected] Cc: [email protected] Cc: <[email protected]> # v6.8+ Signed-off-by: Thomas Hellström <[email protected]> Reviewed-by: Matthew Brost <[email protected]> Reviewed-by: Daniel Vetter <[email protected]> Reviewed-by: Christian König <[email protected]> Link: https://patchwork.freedesktop.org/patch/msgid/[email protected]

NorwegianRockCat closed this as completed Jul 5, 2020

tsujp mentioned this issue Nov 28, 2020

AMD Navi 5700XT on 13-CURRENT does not work #42

Closed

sausunoki mentioned this issue Apr 12, 2021

amdgpu navi10, panic on 13-STABLE or 14-CURRENT #68

Closed

ctipper mentioned this issue May 2, 2021

kernel panic from linux_dump_stack() caused by drm_atomic_helper.c:621 #69

Closed

thisplacestinksoffascism mentioned this issue Apr 11, 2022

panic on 14-CURRENT; AMD NAVI10; X suspension/resumption; vm_fault_lookup: fault on nodefault entry ... #157

Open

JustAnotherHumanBeing mentioned this issue Jan 16, 2023

Update to Linux 5.13 drivers #224

Merged

JustAnotherHumanBeing mentioned this issue Mar 20, 2023

Update to Linux 5.17 drivers #236

Merged

daemonblade mentioned this issue Dec 31, 2023

drm-515-kmod 5.15.118_3 panic on STABLE-14/amd64 built 27-Dec-2023 #276

Closed

Canvis-Me mentioned this issue Jun 15, 2024

kernel panic/Hyprland crash at start since 0.41.1 #308

Closed

sc68cal mentioned this issue Jul 25, 2024

FreeBSD 14.1-RELEASE-p2 panics with amdgpu when SDDM starts #311

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

panic on attach a NAVI10 card: Unregistered use of FPU in kernel #10

panic on attach a NAVI10 card: Unregistered use of FPU in kernel #10

NorwegianRockCat commented Jun 21, 2020

evadot commented Jun 25, 2020 •

edited

Loading

NorwegianRockCat commented Jun 26, 2020

NorwegianRockCat commented Jun 27, 2020

evadot commented Jun 28, 2020

NorwegianRockCat commented Jun 29, 2020

evadot commented Jun 30, 2020

zoujiaqing commented Jun 30, 2020

NorwegianRockCat commented Jul 5, 2020

zoujiaqing commented Jul 6, 2020

evadot commented Jul 6, 2020

zoujiaqing commented Jul 7, 2020

NorwegianRockCat commented Jul 7, 2020

zoujiaqing commented Jul 7, 2020

zoujiaqing commented Jul 8, 2020

NorwegianRockCat commented Jul 8, 2020 •

edited

Loading

zoujiaqing commented Jul 9, 2020

panic on attach a NAVI10 card: Unregistered use of FPU in kernel #10

panic on attach a NAVI10 card: Unregistered use of FPU in kernel #10

Comments

NorwegianRockCat commented Jun 21, 2020

evadot commented Jun 25, 2020 • edited Loading

NorwegianRockCat commented Jun 26, 2020

NorwegianRockCat commented Jun 27, 2020

evadot commented Jun 28, 2020

NorwegianRockCat commented Jun 29, 2020

evadot commented Jun 30, 2020

zoujiaqing commented Jun 30, 2020

NorwegianRockCat commented Jul 5, 2020

zoujiaqing commented Jul 6, 2020

evadot commented Jul 6, 2020

zoujiaqing commented Jul 7, 2020

NorwegianRockCat commented Jul 7, 2020

zoujiaqing commented Jul 7, 2020

zoujiaqing commented Jul 8, 2020

NorwegianRockCat commented Jul 8, 2020 • edited Loading

zoujiaqing commented Jul 9, 2020

evadot commented Jun 25, 2020 •

edited

Loading

NorwegianRockCat commented Jul 8, 2020 •

edited

Loading