You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
When a disk stall results in the termination of a process and GOTRACEBACK is set appropriately, the panic includes a dump of stack traces. These stack traces can be used to confirm the presence of a goroutine stuck in the described syscall. However, there's no way to verify that the two syscall invocations are actually the same... The previous syscall could have completed and a new one introduced. We could add the unix timestamp in nanoseconds of the operation's start as a parameter to the function performing the timing. This would allow us to inspect the stack trace and verify that the start time matches the alleged start of the disk stall.
The text was updated successfully, but these errors were encountered:
Pass the start time in the form of nanoseconds since the unix epoch as a
parameter to timeDiskOp and timeFilesystemOp. This aids post-mortem debugging
when the disk-health checker fatals the process and GOTRACEBACK is set to dump
the stacks, including arguments. The start time argument will be printed in hex
form, allowing us to decode the start time of the operation. This can be used
to confirm that the timed operation was still inflight at the time stacks were
collected.
Closecockroachdb#3009.
Pass the start time in the form of nanoseconds since the unix epoch as a
parameter to timeDiskOp and timeFilesystemOp. This aids post-mortem debugging
when the disk-health checker fatals the process and GOTRACEBACK is set to dump
the stacks, including arguments. The start time argument will be printed in hex
form, allowing us to decode the start time of the operation. This can be used
to confirm that the timed operation was still inflight at the time stacks were
collected.
Close#3009.
When a disk stall results in the termination of a process and
GOTRACEBACK
is set appropriately, the panic includes a dump of stack traces. These stack traces can be used to confirm the presence of a goroutine stuck in the described syscall. However, there's no way to verify that the two syscall invocations are actually the same... The previous syscall could have completed and a new one introduced. We could add the unix timestamp in nanoseconds of the operation's start as a parameter to the function performing the timing. This would allow us to inspect the stack trace and verify that the start time matches the alleged start of the disk stall.The text was updated successfully, but these errors were encountered: