Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Fix panic in ProgressMap when the underlying block may have been dumped and repaired #2671

Closed
wen-coding opened this issue Aug 20, 2024 · 0 comments · Fixed by #2676
Closed
Assignees

Comments

@wen-coding
Copy link

Problem

During today's testnet accident, we saw validators panicking in ProgressMap when underlying blocks have been purged:

[2024-08-20T07:07:42.166017158Z WARN solana_core::replay_stage] purging slot 288668257
[2024-08-20T07:07:43.020116943Z WARN solana_core::replay_stage] purging duplicate descendant: 288668259
thread 'solReplayStage' panicked at core/src/consensus/progress_map.rs:294:32:
slot=288668259 must exist in ProgressMap
0: rust_begin_unwind
1: core::panicking::panic_fmt
2: solana_core::consensus::progress_map::ProgressMap::get_propagated_stats_must_exist
3: solana_core::consensus::progress_map::ProgressMap::get_leader_propagation_slot_must_exist
4: solana_core::replay_stage::ReplayStage::retransmit_latest_unpropagated_leader_slot

Proposed Solution

Audit all _must_exist function and fix.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
1 participant