-
Notifications
You must be signed in to change notification settings - Fork 1.7k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
null pointer deref on 2.2.0 #15485
Comments
Lucky I created this, system locked up, pool died. This may be related to the encryption / block cloning (suggested on irc) I have a picture of trying to reboot (an older kernel but still, backtrace, also occurred on the above referenced versions) |
Just a hint: If this is related to block cloning you can use version 2.1.13 or earlier. On this versions is no block cloning support wired up to the linux kernel. Did you tried it on one of this versions? |
I've been on 2.2 based modules for a long time now (probably over a year, very long time). I've added checkpoints to the pool for my weekly backup progress (checkpoint a pool after a week of 'testing'). Also added an external backup I can boot from that's basically a clone of the internal drive. |
Oh, no easy way to just install an 2.1.13 to check if this is related to non block cloning stuff. Thanks for the information. |
Ya, I tried every kernel I had available back til they started to fail to import due to pool features being enabled that were not supported. |
@prometheanfire: |
Thanks, I'll be watching the 2.2.1 branch. |
I'm not sure if this traceback is directly related to the block cloning issue so I'm not sure it'd be right to close this in favor of that pull. |
It's still there on current git - tested with Kernel 6.1.61 LTS, Kernel 6.6.1, Kernel 6.5.9 (all on Arch Linux)
It's also there on zfs 2.2.0 - the pool just has a single encrypted dataset that is not used during the build. I can trigger it reliable building OpenWrt (Linux Distro for Wireless Routers)
Additionally the whole pool was corrupted one time also during OpenWrt compile using archzfs dkms git ( unfortunatly no textual represention - in that case only importing read-only and send/recv to a new pool helped. I tried to disable zil playback using I also tried using https://github.com/zabbly/zfs on Ubuntu 22.04 (Kernel 5.15) but importing also fails and machine hangs. I dd'ed the pool to a disk so if someone needs additional debugging information I can try. coretuils on arch is recent enough to use block cloning but the whole pool was encrypted. |
The same errors with branch 2.2.1 and master /lib/modules/6.5.11-300.fc39.x86_64/extra/zfs.ko.xz
and no errors at least before running zfs upgrade (and with zfs 2.1.*) |
The panics on reboot quoted here look very alike to ones in this issue: #15513 , caused by improper block cloning ZIL records encryption. The fix for that is currently in review. The original panic though may or may not be related to the block cloning. |
@amotin my related issue when building openwrt is fixed in current master @prometheanfire maybe you could retry with current master and check if that fixes the issue? |
System information
Describe the problem you're observing
Backtrace when emerging packages, so high IO. Normally I use tmpfs but not enough ram for webkit...
Describe how to reproduce the problem
happend when installing gentoo-kernel-bin
Include any warning/errors/backtraces from the system logs
The text was updated successfully, but these errors were encountered: