The "Snapshot Predecode Duration" metric doesn't take syncSchema time in to account #3759

CalvinNeo · 2021-12-28T04:46:22Z

Bug Report

1. Minimal reproduce step (Required)

We have a scenario that involves heavy syncSchema(up to 7min) during PrehandleSnapshot.

However, there is no easy way for us to achieve this solution from metrics. The "Snapshot Predecode Duration" does not include syncSchema time(such as AtomicGetStorageSchema).

2. What did you expect to see? (Required)

There is a easy way to find out a heavy syncSchema or PrehandleSnapshot job, through metrics.

3. What did you see instead (Required)

There is no easy way.

4. What is your TiFlash version? (Required)

nightly

The text was updated successfully, but these errors were encountered:

JaySon-Huang · 2021-12-28T05:11:53Z

What make syncSchema take 7 min?

CalvinNeo · 2021-12-28T11:12:00Z

I deleted 3000 tables

JaySon-Huang · 2021-12-28T12:55:06Z

3000.0/(7 * 60) = 7.14. So it means dropping one table cost about 7 seconds, still too slow.
Are you dropping 3000 empty tables? However, dropping one table even containing a large amount of data takes 7 seconds still too slow. Do we have logs or can we reproduce this scenario to explore why?

CalvinNeo · 2021-12-28T13:15:11Z

"Drop 3000 tables cost 7min" is a rough description, I have a doc about this.

close #3759

CalvinNeo added the type/bug The issue is confirmed as a bug. label Dec 28, 2021

CalvinNeo self-assigned this Feb 8, 2022

CalvinNeo mentioned this issue Feb 8, 2022

Add metric to record end to end time to pre-handle snapshot #3978

Merged

12 tasks

CalvinNeo added the severity/moderate label Feb 28, 2022

ti-chi-bot closed this as completed in #3978 Mar 22, 2022

ti-chi-bot pushed a commit that referenced this issue Mar 22, 2022

Add metric to record end to end time to pre-handle snapshot (#3978)

2b71569

close #3759

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

The "Snapshot Predecode Duration" metric doesn't take syncSchema time in to account #3759

The "Snapshot Predecode Duration" metric doesn't take syncSchema time in to account #3759

CalvinNeo commented Dec 28, 2021

JaySon-Huang commented Dec 28, 2021

CalvinNeo commented Dec 28, 2021

JaySon-Huang commented Dec 28, 2021 •

edited

Loading

CalvinNeo commented Dec 28, 2021

The "Snapshot Predecode Duration" metric doesn't take syncSchema time in to account #3759

The "Snapshot Predecode Duration" metric doesn't take syncSchema time in to account #3759

Comments

CalvinNeo commented Dec 28, 2021

Bug Report

1. Minimal reproduce step (Required)

2. What did you expect to see? (Required)

3. What did you see instead (Required)

4. What is your TiFlash version? (Required)

JaySon-Huang commented Dec 28, 2021

CalvinNeo commented Dec 28, 2021

JaySon-Huang commented Dec 28, 2021 • edited Loading

CalvinNeo commented Dec 28, 2021

JaySon-Huang commented Dec 28, 2021 •

edited

Loading