TiKV slow initialization scan and stuck cause ticdc replication stuck #3110
Labels
area/ticdc
Issues or PRs related to TiCDC.
component/tikv
TiKV component.
severity/critical
type/bug
The issue is confirmed as a bug.
type/enhancement
The issue or PR belongs to an enhancement.
What did you do?
systemctl restart
to restart one of the TiKV node (172.16.6.139 in this case, at 2021/10/20 16:13:38.581 +08:00)What did you expect to see?
TiCDC replication can become normal in less than 10minutes.
What did you see instead?
One of the TiKV node suffered slow initialization scan.
What's more, two TiKV seems to be stuck with region initialization.
The replication doesn't recover after 1 hour.
cdc log:
cdc.log.tar.gz
TiKV logs:
issue-3110-tikv.log.tar.gz
TiCDC metrics:
Test-Cluster-TiCDC-master-20211020_2021-10-20T09_12_04.064Z.zip
Versions of the cluster
Upstream TiDB cluster version (execute
SELECT tidb_version();
in a MySQL client):TiKV v5.2.1
TiCDC version (execute
cdc version
):The text was updated successfully, but these errors were encountered: