storage: Test behavior when disk fills up #19656
Labels
A-storage
Relating to our storage engine (Pebble) on-disk storage.
A-testing
Testing tools and infrastructure
C-cleanup
Tech debt, refactors, loose ends, etc. Solution not expected to significantly change behavior.
S-3-ux-surprise
Issue leaves users wondering whether CRDB is behaving properly. Likely to hurt reputation/adoption.
Prior to #19447, certain disk errors (the most likely being
ENOSPC
) were not being handled correctly, and we suspect that inconsistent reads could be served after this had happened. We need more testing of our behavior after disk writes have failed.One way to do this would be a process that alternately writes a file to fill up the disk (or maybe just
fallocate()
), waits a bit, then deletes the file (and restarts the cockroach process if it crashed). Maybe this would make sense as a new jepsen nemesis.The text was updated successfully, but these errors were encountered: