Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Make CI great again! #30822

Closed
6 of 8 tasks
tiancaiamao opened this issue Dec 17, 2021 · 6 comments · Fixed by #30828, #31371, #31876, #31892 or #31950
Closed
6 of 8 tasks

Make CI great again! #30822

tiancaiamao opened this issue Dec 17, 2021 · 6 comments · Fixed by #30828, #31371, #31876, #31892 or #31950
Assignees
Labels
type/enhancement The issue or PR belongs to an enhancement.

Comments

@tiancaiamao
Copy link
Contributor

tiancaiamao commented Dec 17, 2021

Enhancement

The unstable CI has been a hinder to our daily development. Sometimes we have to run it again and again, and again ... to merge a PR.
It takes a very long time to run the CI and if a unstable test fails, all the time are wasted.

There are many bad decisions we made in the past:

  • To make it run faster, we use a wrong way to parallel
  • Auto re-run to amend for the unstable test cases
  • No timeout limitation on a single unit test
  • High coupling of the code and running environment
  • Multiple teams to maintain CI thus the unclear responsbilities

Let's review it one by one.

We use a wrong way to parallel. Some test cases are not side effect free, for example:

  • the enable and disable of the failpoint
  • modification of the configuration
  • change of the global variable in a single process.
  • etc...

If the test cases with side effect run parallelly with others, it could cause some test fail unexpectedly.
When they're in a single OS process, make them parallel is not a big deal.
Parallel make the CI run a bit faster but when some test fail, it takes us a lot of time to investigate the root cause.

We employ auto re-run using the CI bot to amend for the unstable tests. IMHO, this is another failure.
The auto re-run tend to make us ignore the unstable cases. See #25899
As time goes on, there are more and more unstable test cases. And test cases almost fail randomly!
Later, if someone file a PR and run all test pass without retry, maybe he would think: WOW, today I'm so lucky~

Back to the point of parallel, why we want to make the test parallel? we want it to run faster because it's slow.
So why it's slow? because we're adding more and more test cases.
Well, I found many code are poorly writen. Some test cases call sleep() at will, and a single test case may run even more than 120s!
The problem is, there was no timeout limitation on it, so I added one.
However, the timeout makes the CI more unstable #26717

Parallel requires more machine resource and with retrying, that can eat up all the machine resources!
The high load of the CI environment makes the test cases more likely to fail.
The situation comes from bad to worse.

So high coupling of the code and running environment makes the running time of a single test case undecidable.
Timeout limitation on a single unit test doesn't work well.
I can see the test case always finish within 3s in my own computer, but that's not the truth in the CI environment.

========================

What's the solution?

For step2, we can specify the test case by the name manually:

go test -run TextXXX

And we can run different test cases parallely, each with its own OS process, so the code is isolated without side effect.

@tiancaiamao
Copy link
Contributor Author

tiancaiamao commented Dec 24, 2021

Those are the failed test cases when running make ut:

  • run test case error ddl TestDropAndTruncatePartition false 697.840205ms exit status 1 DONE
  • run test case error ddl TestTable false 987.525765ms exit status 1 DONE
  • run test case error ddl TestRenameTables false 698.35512ms exit status 1 DONE
  • run test case error session testPessimisticSuite.TestKillStopTTLManager true 2.925076624s exit status 1 DONE
  • run test case error statistics/handle testStatsSuite.TestUnsignedFeedbackRanges true 296.910124ms exit status 1 DONE
  • run test case error session testPessimisticSuite.TestDeadlock true 3.049245822s exit status 1 DONE
  • run test case error statistics/handle testStatsSuite.TestShowGlobalStats true 418.460595ms exit status 1
  • run test case error statistics/handle testStatsSuite.TestIndexQueryFeedback true 332.774841ms exit status 1 DONE
  • run test case error executor TestListPartitionWithGeneratedColumn false 711.682486ms signal: segmentation fault (core dumped)
  • run test case error statistics/handle testStatsSuite.TestQueryFeedbackForPartition true 297.275436ms exit status 1 DONE
  • run test case error statistics/handle testStatsSuite.TestLogDetailedInfo true 382.852014ms exit status 1
  • run test case error statistics/handle testSerialStatsSuite.TestFastAnalyzeColumnHistWithNullValue true 249.035558ms exit status 1 DONE DONE
  • run test case error store TestStoreSwitchPeer false 20.057916914s exit status 2
  • run test case error statistics/handle testStatsSuite.TestUpdatePartitionStatsByLocalFeedback true 358.007742ms exit status 1 DONE
  • run test case error statistics/handle testStatsSuite.TestSingleSessionInsert true 360.083119ms exit status 1
  • run test case error executor TestInsert false 1.171421508s signal: segmentation fault (core dumped)
  • run test case error statistics/handle testStatsSuite.TestUpdateErrorRate true 257.252356ms exit status 1 DONE
  • run test case error statistics/handle testStatsSuite.TestInitStats true 354.398747ms exit status 1 DONE
  • run test case error statistics/handle testSerialStatsSuite.TestAnalyzeGlobalStatsWithOpts true 2.569504521s exit status 1
  • run test case error statistics/handle testStatsSuite.TestUpdatePartition true 481.278584ms exit status 1
  • run test case error executor tiflashTestSuite.TestMppExecution true 20.061100711s exit status 2
  • run test case error statistics/handle testStatsSuite.TestQueryFeedback true 460.476413ms exit status 1 DONE
  • run test case error statistics/handle testStatsSuite.TestLoadStats true 246.205158ms exit status 1
  • run test case error ddl/failtest TestAddIndexWorkerNum false 20.047321335s exit status 2
  • run test case error statistics/handle testStatsSuite.TestIndexFMSketch true 772.683592ms exit status 1
  • run test case error statistics/handle testStatsSuite.TestUpdateStatsByLocalFeedback true 352.762524ms exit status 1 DONE
  • run test case error executor partitionTableSuite.TestIssue26251 true 861.502449ms exit status 1
  • run test case error session testPessimisticSuite.TestAsyncRollBackNoWait true 4.106900656s exit status 1 DONE
  • run test case error statistics/handle testSerialStatsSuite.TestAnalyzeGlobalStatsWithOpts2 true 561.472751ms exit status 1
  • run test case error statistics/handle testStatsSuite.TestIndexQueryFeedback4TopN true 311.97097ms exit status 1 DONE
  • run test case error executor partitionTableSuite.TestDirectReadingWithAgg true 20.056776646s exit status 2
  • run test case error store TestStoreErr false 20.073694749s exit status 2

fengou1 added a commit that referenced this issue Dec 27, 2021
* dumpling: fix default collation with upstream when dump database and table (#30292)

* ddl: fix the enum default value by triming trailing space (#30356)

* expression: migrate test-infra to testify for flag_simplify_test.go (#30407)

* server: refine code logic in handleDownloadFile (#30422)

* refine logic

Signed-off-by: yisaer <[email protected]>

* fix

Signed-off-by: yisaer <[email protected]>

* ddl: migrate test-infra to testify for ddl/table_test.go (#30267)

* ddl: handle the error from `addBatchDDLJobs()` correctly (#30401)

* br: fix the integration tests (#30423)

* util, cmd: remove unused filesort (#30438)

* *: update client-go for small backoff time (#30436)

* server: Fix unstable tests with FakeAuthSwitch (#30287)

* dumpling: fix dump failed when sequence exists (#30164)

* *: replace compareDatum by compare (#30421)

* lightning: fix gcs max key limit (#30393)

* expression, parser: add built-in func is_uuid (#30318)

* expression: migrate test-infra to testify for constant_fold_test.go (#30424)

* executor: fix pipelined window invalid memory address (#30418)

* makefile: add gotestsum for verify ci (#29848)

* server: close sql rows to fix unstable test (#30306)

* Makefile: add coverage record for BR and Dumpling (#30457)

* executor: track the mem usage of IndexMergeReader (#30210)

* infosync: close body when ReadAll encounters error (#30462)

* planner: show accessed partition when explain mpp query over partition table (#30367)

* *: Fix use of user identity in SHOW GRANTS + error messages (#30294)

* ddl: add not null flag for auto_increment column  (#30477)

* expression: make some unstable test serial (#30323)

* expression: migrate test-infra to testify for constant_propagation_test.go (#30430)

* executor: stable test TestSetDDLReorgBatchSize and TestSetDDLReorgWorkerCnt (#30480)

* statistics, util/ranger: add cardinality estimation trace for `GetRowCountBy...` (#30321)

* *: skip mysql client goroutine leak detection in integration ddl (#30467)

* executor,util: write slow query to slow log no matter what log level (#30461)

* executor: enable index_merge used in transaction. (#29875)

* logutil: add testcase for SlowQueryLogger.MaxDays/MaxSize/MaxBackups (#30316)

* expression: fix data race in builtin_other_vec_generated_test.go (#30503)

* expression: fix data race in the collationInfo (#30490)

* planner/core, session: fix error message of wrong variable scope (#30510)

* lightning: support Re/ReregisterMySQL by different tls name (#30463)

* executor: TestBatchGetandPointGetwithHashPartition test typo (#29669) (#29671)

* mockstore: improve log to avoid panic for nil pointer (#30513)

* *: replace compareDatum by compare, PR 10 (#30456)

* planner: Disable dynamic partition prune mode for all non-autocommit (#27532) (#30505)

* expression: change the log level of an confusing log from warn to debug (#30484)

* br: Check crypter.key valid before backup (#29991)

* *: replace compareDatum by compare, PR 11 (#30465)

* dumpling: fix default column collation with upstream when dump table (#30531)

* server: fix prepared cursor select (#30285)

* executor: HashJoinExec checks the buildError even if the probeSide is empty (#30471)

* parser, expression: follow mysql, increase interval precedence (#30528)

* makefile: set timeout 25m for make race (#30555)

* planner: fix the unstable test TestAnalyzeGlobalStatsWithOpts/2 (#30576)

* expression,types: Adjusts UNIX_TIMESTAMP() for non-existing DST values (#28739) (#30405)

* br: add res.Body.close to avoid leak (#30545)

* lightning: add back integration test lightning_error_summary (#30547)

* sessionctx/variable: small refactor (split large file) (#30511)

* ddl: let `admin cancel ddl jobs` run in a new transaction (#30549)

* *: Retry when placement PutBundles failed (#30590)

* dumpling: delete unit test in github actions (#30562)

* *: support trace plan target='estimation' statement (#30491)

* expression: migrate test-infra to testify for integration_test.go (#30548)

* planner: support trace for min/max eliminate (#30441)

* support min/max trace

Signed-off-by: yisaer <[email protected]>

* address the comment

Signed-off-by: yisaer <[email protected]>

Co-authored-by: Ti Chi Robot <[email protected]>

* br: remove cdclog in br (#30573)

* *: show cmd to check if all needed histograms are loaded (#29672)

* expression: clone repertoire when clone the scalar function (#30602)

* *: use the real StateRemote interface implementation for cached table (#30066)

* *: query failed after add index / timestamp out-of-range (#28424) (#29323)

* planner: implement collecting predicate columns from logical plan (#29878)

* *: show PK name when decoding the clustered index row key (#30623)

* ddl/callback_test.go: migrate test-infra to testify (#30317)

* *: Rename some names of placement ddl operation (#30622)

* executor: fix data race in the index_lookup_hash_join (#30619)

* ddl: remove unnecessary locking when adding an index (#29772)

* server: try to make `TidbTestSuite` more stable (#30643)

* *: Add some PD tests for placement and fix some bug found (#30621)

* *: migrate sync.WaitGroup to util.WaitGroupWrapper (#30644)

* planner: add trace for join eliminate rule (#30343)

* executor: migrate test-infra to testify for executor/shuffle_test.go (#30514)

* planner: make (*AccessPath).OnlyPointRange more succinct (#30520)

* planner: add trace for join reorder (#30394)

* executor: migrate test-infra to testify for executor/union_scan_test.go (#30525)

* expression: make cast return error if cast binary literal to another character set (#30537)

* *: update tikv client (#30670)

* *: update sysutil in go.mod to fix panic when search log (#30523)

* topsql: shouldn't evict the SQL meta, since the evicted SQL can be appear on Other components (TiKV) TopN records (#27050)

* testify: migrate test-infra to testify for analyze_test.go (#30640)

* util: replace compareDatum by compare,  point part (#30575)

* test: make all the tests run in serial (#30692)

* statistics: add mutex for Handle.globalMap and Handle.feedback (#30550)

* executor: fix regular expression in json so that it could match identifer start with '$' (#29750)

* util/testkit/testkit.go: fix typo (#30638)

* planner: Introduce a new global variable to control the historical statistics feature (#30646)

* topsql: introduce datasink interface (#30662)

* planner: unify the argument of stats functions to use SessionCtx instead of StatementContext (#30668)

* metrics: fix the Max SafeTS Gap metrics (#30689)

* lightning: Add source dir existence check for s3 (#30674)

* golangci-lint: support durationcheck (#30027)

* executor: fix data race on IndexHashJoin.cancelFunc (#30701)

* sessionctx/variable: change tidb_store_limit to global only (#30522)

* statistics: remove reassignment of Handle.pool in NewHandle (#30675)

* br: fix some unstable unit test cases. (#30716)

* bindinfo: fix the comment typo (#30616)

* server: support decoding prepared string args to character_set_client (#30723)

* expression: fix enum type join binary get wrong result (#30445)

* cmd/explaintest: fix wrong result comparison for explain test (#30717)

* parallel create tables in br

* metrics: fix copr-cache metrics (#30712)

* test: merge executor's serial tests to other tests (#30711)

* statistics: avoid deadlock when create/drop extended stats and analyze at the same time (#30566)

* ddl: add batch create table api

Signed-off-by: xhe <[email protected]>

* ddl: add unit tests

Signed-off-by: xhe <[email protected]>

* ddl: fix fmt

Signed-off-by: xhe <[email protected]>

* ddl: typo

Co-authored-by: Arenatlx <[email protected]>

* ddl: fix tests

Signed-off-by: xhe <[email protected]>

* ddl: rename to BatchCreateTableWithInfo

Signed-off-by: xhe <[email protected]>

* ddl: trace the error

Signed-off-by: xhe <[email protected]>

* ddl: comments

Signed-off-by: xhe <[email protected]>

* ddl: cancle the job right

Signed-off-by: xhe <[email protected]>

* ddl: cancel the job right 2

Signed-off-by: xhe <[email protected]>

* ddl: report error if entry too large

Signed-off-by: xhe <[email protected]>

* ddl: report error when table is duplicated

Signed-off-by: xhe <[email protected]>

* ddl: go fmt

Signed-off-by: xhe <[email protected]>

* infoschema: improve batch memory perf

Signed-off-by: xhe <[email protected]>

* ddl: retain ID

Signed-off-by: xhe <[email protected]>

* sessionctx: fix the value of analyze_version when upgrading 4.x to 5.… (#30743)

* ddl: reduce log frequency

Signed-off-by: xhe <[email protected]>

* ddl: fix tests

Signed-off-by: xhe <[email protected]>

* server: disable socket listener for `basicHTTPHandlerTestSuite` (#30680)

* planner: support the plan cache aware of bindings (#30169)

* planner: fix early set of plan's statisticsTable (#30754)

* *: implement renew write lock lease for cached table (#30206)

* *: Modify placement rule index to reserve some indexes for future work (#30737)

* executor: add an unit test case for unreasonable invoking Close (#30696)

* planner: fix wrong subquery's coercibility  (#30750)

* executor: add more testcases for index merge (#30497)

* server: add grpc server config for a suitable behavior (#30774)

* config, charset: make charset config not affected by collation config (#30572)

* lightning: emit tidb log by change FilterCore to only allow matched packages (#30700)

* topsql: a centralized place to generate tipb report data (#30781)

* planner: add trace for partition pruning (#30546)

* planner: refine collation handling for between (#30793)

* test: merge serial tests in bindinfo, expression, parser and statistics (#30749)

* br: update log description for split check (#30763)

* *: replace compareDatum by compare, range part (#30710)

* *: placement policy ref will be converted to direct options when recover or flashback table (#30705)

* ddl: handle the incorrect number of placement followers (#30715)

* ddl: revert "ddl: remove unnecessary locking when adding an index" (#30667)

* br/pkg/task: migrate test-infra to testify (#30605)

* *: fix the flen type datetime for union/case-when/control-funcs (#30588)

* types, util: clean up compareDatum (#30815)

* ddl: add helper function to set and query TiFlash's sync status (#30473)

* dumpling: fix more dumpling log level query template (#30176)

* parser: support `admin flush plan_cache` (#30747)

* topsql: support multiple datasinks (#30808)

* br: update permission, so tikv can write to folder when backup to local (#30396)

* session: fix bootstrap to only persist global variables (#30593)

close #28667

* docs/design: update collation compatibility issues in charsets doc (#30806)

* executor: improve SET sysvar=DEFAULT handling (#29680)

close #29670

* br: add error handling for group context cancel when restore file is corrupted (#30190)

close #30135

* executor: buildWindow cannot call typeInfer twice (#30773)

close #30402

* *: refactor encoding and uniform usages (#30288)

* lightning: optimize region split check logic (#30428)

close #30018

* br: ignore mock directory when gcov in br (#30586)

* *: forbid set tiflash replica count for a placement table (#30844)

close #30741

* execute: don't transform charset in internal sql (#30843)

close #30789

* planner: update PlanBuilder.windowSpecs when building subquery (#30878)

close #30804

* br: fix S3 backup endpoint suffix (#30530)

close #30104

* lightning: make pre-check output message clearer (#30439)

close #30395

* expression: wrap to_binary and from_binary for cast function's argument (#30706)

* executor: fix bug when using IndexMerge in transaction (#30719)

close #30685

* ddl: migrate test-infra to testify for ddl/foreign_key_test.go (#30853)

close #29103

* expression: fix wrong retType for reverse function (#30829)

close #30809

* planner: support trace topn push down (#30800)

ref #29661

* github: add issue requirement to pull request template (#30817)

close #30814

* fix merge issue

* topsql: introduce stmtstats and sql execution count (#30277)

* topsql: add pubsub datasink (#30860)

* executor: fix the incorrect untouch used in optimistic transactions (#30447)

close #30410

* expression, cmd: let crc32() support gbk (#30900)

close #30898

* server: Add uptime status var and statistics (#29790)

close #8842

* br: error log optimization (#29640)

close #27015

* planner: fix wrong collation when rewrite in condition (#30492)

close #30486

* planner: add extractor for tikv_region_peers (#30656)

* fix issue that loss table restore

* lightning: add back table empty check and add a switch config (#30887)

close #27919

* br: improve backoff unit test (#30892)

* *: add TxnManager to manage txn in session (#30574)

* *: add TxnManager to manage txn in session

* modify

* add tests

* move failpoint content to a single file

* Makefile: add `t.Parallel` check to ensure tests are run in serial (#30869)

* refactoring code

* refactoring code

* placement: remove isolationlevel (#30859)

close #30858

* planner: revise the optimize trace output (#30882)

* table: set the datum collation correctly in CastValue() (#30931)

close #30930

* *: Use TxnManager.GetTxnInfoSchema() to get the txn infoschema (#30934)

close #30933

* parser: add IsValid() to Encoding to speed up string validation for UTF-8 (#30937)

close #30936

* planner: rename pstmtPlanCacheXX to PlanCacheXX (#30909)

* table/tables: make CI TestCacheTableBasicReadAndWrite more stable (#30924)

close #30922

* restore: use new ScatterRegions API (#30899)

close #30425

* *: when placement conflicts with tiflash, cancel the job (#30945)

* Makefile,tools: make CI great again! (#30828)

close #30822

* br/pkg/membuf: remove global buffer pool (#29934)

* ddl: add format error for incorrect dict syntax in the placement rule (#30919)

close #30454

* planner: fix index merge plan when expr cannot be pushed to tikv (#30341)

close #30200

* executor: display 'show create table' and INFOSCHEMA for cached table correctly (#30951)

close #30950

* br: extend the timeout for scan region since 3 seconds is not enough (#30889)

close #30720

* planner: remove bindSQL from planCacheKey to planCacheValue (#30916)

* execution: refine precision of cast as decimal in agg func (#30805)

* *: fix data race in the tikv_client (#30964)

close #30658

* ddl: migrate test-infra to testify for ddl/db_partition_test.go (#30952)

close #28635

* planner: fix `AccessPath.TableFilters` got modified unexpectedly (#30966)

close #30965

* test: merge serial tests in ddl, infoschema, session, store, table, telemetry and types (#30874)

* executor: fix the returned field count of the prepare statement (#30981)

close #30971

* binlog: allow multiple ddl targets (#30904)

* planner: trace predicate push down  (#30902)

ref #29661

* placement: give default 2 followers for non-sugar syntax (#31000)

* flatten the json output (#30905)

Signed-off-by: yisaer <[email protected]>

Co-authored-by: Ti Chi Robot <[email protected]>

* test: control log level with environment variables (#30871)

* planner: add usage of paging copr in optimizer (#30536)

close #30578

* test: merge serial tests in cmd, planner, server, util (#31003)

* planner: change predicateColumnCollector to columnStatsUsageCollector and collect histogram-needed columns (#30671)

* executor: migrate test-infra to testify for distsql_test.go (#31023)

close #28574

* remote uncessary package errors

* reused the retry code from lightning

* refactoring retryable

* ddl: add batch create table api

Signed-off-by: xhe <[email protected]>

* ddl: add unit tests

Signed-off-by: xhe <[email protected]>

* ddl: fix fmt

Signed-off-by: xhe <[email protected]>

* br ddl code

* parallel create tables in br

* ddl: add batch create table api

Signed-off-by: xhe <[email protected]>

* ddl: add unit tests

Signed-off-by: xhe <[email protected]>

* ddl: fix fmt

Signed-off-by: xhe <[email protected]>

* ddl: typo

Co-authored-by: Arenatlx <[email protected]>

* ddl: fix tests

Signed-off-by: xhe <[email protected]>

* ddl: rename to BatchCreateTableWithInfo

Signed-off-by: xhe <[email protected]>

* ddl: trace the error

Signed-off-by: xhe <[email protected]>

* ddl: comments

Signed-off-by: xhe <[email protected]>

* ddl: cancle the job right

Signed-off-by: xhe <[email protected]>

* ddl: cancel the job right 2

Signed-off-by: xhe <[email protected]>

* ddl: report error if entry too large

Signed-off-by: xhe <[email protected]>

* ddl: report error when table is duplicated

Signed-off-by: xhe <[email protected]>

* ddl: go fmt

Signed-off-by: xhe <[email protected]>

* infoschema: improve batch memory perf

Signed-off-by: xhe <[email protected]>

* ddl: retain ID

Signed-off-by: xhe <[email protected]>

* ddl: reduce log frequency

Signed-off-by: xhe <[email protected]>

* ddl: fix tests

Signed-off-by: xhe <[email protected]>

* ddl: remove retainID from the interface

Signed-off-by: xhe <[email protected]>

* ddl: fix tests

Signed-off-by: xhe <[email protected]>

* executor: fix rebasing problem

Signed-off-by: xhe <[email protected]>

Co-authored-by: WizardXiao <[email protected]>
Co-authored-by: sylzd <[email protected]>
Co-authored-by: tison <[email protected]>
Co-authored-by: Song Gao <[email protected]>
Co-authored-by: Weizhen Wang <[email protected]>
Co-authored-by: tangenta <[email protected]>
Co-authored-by: 3pointer <[email protected]>
Co-authored-by: wjHuang <[email protected]>
Co-authored-by: Lei Zhao <[email protected]>
Co-authored-by: Daniël van Eeden <[email protected]>
Co-authored-by: Jianjun Liao <[email protected]>
Co-authored-by: unconsolable <[email protected]>
Co-authored-by: Shenghui Wu <[email protected]>
Co-authored-by: guo-shaoge <[email protected]>
Co-authored-by: Ryan Leung <[email protected]>
Co-authored-by: xufei <[email protected]>
Co-authored-by: Morgan Tocker <[email protected]>
Co-authored-by: Zhou Kunqin <[email protected]>
Co-authored-by: 王超 <[email protected]>
Co-authored-by: TonsnakeLin <[email protected]>
Co-authored-by: Ehco <[email protected]>
Co-authored-by: Mattias Jonsson <[email protected]>
Co-authored-by: HuaiyuXu <[email protected]>
Co-authored-by: Zak Zhao <[email protected]>
Co-authored-by: xhe <[email protected]>
Co-authored-by: Hangjie Mo <[email protected]>
Co-authored-by: Yuanjia Zhang <[email protected]>
Co-authored-by: glorv <[email protected]>
Co-authored-by: djshow832 <[email protected]>
Co-authored-by: Chunzhu Li <[email protected]>
Co-authored-by: Ti Chi Robot <[email protected]>
Co-authored-by: Xiaoju Wu <[email protected]>
Co-authored-by: xiongjiwei <[email protected]>
Co-authored-by: tiancaiamao <[email protected]>
Co-authored-by: Yifan Xu <[email protected]>
Co-authored-by: JmPotato <[email protected]>
Co-authored-by: Zach <[email protected]>
Co-authored-by: bb7133 <[email protected]>
Co-authored-by: lvtu <[email protected]>
Co-authored-by: crazycs <[email protected]>
Co-authored-by: znhh6018 <[email protected]>
Co-authored-by: eddie lin <[email protected]>
Co-authored-by: dongjunduo <[email protected]>
Co-authored-by: Zhenchi <[email protected]>
Co-authored-by: wangggong <[email protected]>
Co-authored-by: zhangjinpeng1987 <[email protected]>
Co-authored-by: Jack Yu <[email protected]>
Co-authored-by: Arenatlx <[email protected]>
Co-authored-by: Yiding Cui <[email protected]>
Co-authored-by: Chengpeng Yan <[email protected]>
Co-authored-by: bestwoody <[email protected]>
Co-authored-by: Calvin Neo <[email protected]>
Co-authored-by: Lynn <[email protected]>
Co-authored-by: Zhuhe Fang <[email protected]>
Co-authored-by: Mini256 <[email protected]>
Co-authored-by: Xiang Zhang <[email protected]>
Co-authored-by: Yexiang Zhang <[email protected]>
Co-authored-by: cfzjywxk <[email protected]>
Co-authored-by: db <[email protected]>
Co-authored-by: 山岚 <[email protected]>
Co-authored-by: Yujie Xia <[email protected]>
Co-authored-by: Yilong Li <[email protected]>
Co-authored-by: tuuuuuu <[email protected]>
Co-authored-by: qupeng <[email protected]>
Co-authored-by: you06 <[email protected]>
fengou1 added a commit that referenced this issue Dec 27, 2021
* br: fix the integration tests (#30423)

* util, cmd: remove unused filesort (#30438)

* *: update client-go for small backoff time (#30436)

* server: Fix unstable tests with FakeAuthSwitch (#30287)

* dumpling: fix dump failed when sequence exists (#30164)

* *: replace compareDatum by compare (#30421)

* lightning: fix gcs max key limit (#30393)

* expression, parser: add built-in func is_uuid (#30318)

* expression: migrate test-infra to testify for constant_fold_test.go (#30424)

* executor: fix pipelined window invalid memory address (#30418)

* makefile: add gotestsum for verify ci (#29848)

* server: close sql rows to fix unstable test (#30306)

* Makefile: add coverage record for BR and Dumpling (#30457)

* executor: track the mem usage of IndexMergeReader (#30210)

* infosync: close body when ReadAll encounters error (#30462)

* planner: show accessed partition when explain mpp query over partition table (#30367)

* *: Fix use of user identity in SHOW GRANTS + error messages (#30294)

* ddl: add not null flag for auto_increment column  (#30477)

* expression: make some unstable test serial (#30323)

* expression: migrate test-infra to testify for constant_propagation_test.go (#30430)

* executor: stable test TestSetDDLReorgBatchSize and TestSetDDLReorgWorkerCnt (#30480)

* statistics, util/ranger: add cardinality estimation trace for `GetRowCountBy...` (#30321)

* *: skip mysql client goroutine leak detection in integration ddl (#30467)

* executor,util: write slow query to slow log no matter what log level (#30461)

* executor: enable index_merge used in transaction. (#29875)

* logutil: add testcase for SlowQueryLogger.MaxDays/MaxSize/MaxBackups (#30316)

* expression: fix data race in builtin_other_vec_generated_test.go (#30503)

* expression: fix data race in the collationInfo (#30490)

* planner/core, session: fix error message of wrong variable scope (#30510)

* lightning: support Re/ReregisterMySQL by different tls name (#30463)

* executor: TestBatchGetandPointGetwithHashPartition test typo (#29669) (#29671)

* mockstore: improve log to avoid panic for nil pointer (#30513)

* *: replace compareDatum by compare, PR 10 (#30456)

* planner: Disable dynamic partition prune mode for all non-autocommit (#27532) (#30505)

* expression: change the log level of an confusing log from warn to debug (#30484)

* br: Check crypter.key valid before backup (#29991)

* *: replace compareDatum by compare, PR 11 (#30465)

* dumpling: fix default column collation with upstream when dump table (#30531)

* server: fix prepared cursor select (#30285)

* executor: HashJoinExec checks the buildError even if the probeSide is empty (#30471)

* parser, expression: follow mysql, increase interval precedence (#30528)

* makefile: set timeout 25m for make race (#30555)

* planner: fix the unstable test TestAnalyzeGlobalStatsWithOpts/2 (#30576)

* expression,types: Adjusts UNIX_TIMESTAMP() for non-existing DST values (#28739) (#30405)

* br: add res.Body.close to avoid leak (#30545)

* lightning: add back integration test lightning_error_summary (#30547)

* sessionctx/variable: small refactor (split large file) (#30511)

* ddl: let `admin cancel ddl jobs` run in a new transaction (#30549)

* *: Retry when placement PutBundles failed (#30590)

* dumpling: delete unit test in github actions (#30562)

* *: support trace plan target='estimation' statement (#30491)

* expression: migrate test-infra to testify for integration_test.go (#30548)

* planner: support trace for min/max eliminate (#30441)

* support min/max trace

Signed-off-by: yisaer <[email protected]>

* address the comment

Signed-off-by: yisaer <[email protected]>

Co-authored-by: Ti Chi Robot <[email protected]>

* br: remove cdclog in br (#30573)

* *: show cmd to check if all needed histograms are loaded (#29672)

* expression: clone repertoire when clone the scalar function (#30602)

* *: use the real StateRemote interface implementation for cached table (#30066)

* *: query failed after add index / timestamp out-of-range (#28424) (#29323)

* planner: implement collecting predicate columns from logical plan (#29878)

* *: show PK name when decoding the clustered index row key (#30623)

* ddl/callback_test.go: migrate test-infra to testify (#30317)

* *: Rename some names of placement ddl operation (#30622)

* executor: fix data race in the index_lookup_hash_join (#30619)

* ddl: remove unnecessary locking when adding an index (#29772)

* server: try to make `TidbTestSuite` more stable (#30643)

* *: Add some PD tests for placement and fix some bug found (#30621)

* *: migrate sync.WaitGroup to util.WaitGroupWrapper (#30644)

* planner: add trace for join eliminate rule (#30343)

* executor: migrate test-infra to testify for executor/shuffle_test.go (#30514)

* planner: make (*AccessPath).OnlyPointRange more succinct (#30520)

* planner: add trace for join reorder (#30394)

* executor: migrate test-infra to testify for executor/union_scan_test.go (#30525)

* expression: make cast return error if cast binary literal to another character set (#30537)

* *: update tikv client (#30670)

* *: update sysutil in go.mod to fix panic when search log (#30523)

* topsql: shouldn't evict the SQL meta, since the evicted SQL can be appear on Other components (TiKV) TopN records (#27050)

* testify: migrate test-infra to testify for analyze_test.go (#30640)

* util: replace compareDatum by compare,  point part (#30575)

* test: make all the tests run in serial (#30692)

* statistics: add mutex for Handle.globalMap and Handle.feedback (#30550)

* executor: fix regular expression in json so that it could match identifer start with '$' (#29750)

* util/testkit/testkit.go: fix typo (#30638)

* planner: Introduce a new global variable to control the historical statistics feature (#30646)

* topsql: introduce datasink interface (#30662)

* planner: unify the argument of stats functions to use SessionCtx instead of StatementContext (#30668)

* metrics: fix the Max SafeTS Gap metrics (#30689)

* lightning: Add source dir existence check for s3 (#30674)

* golangci-lint: support durationcheck (#30027)

* executor: fix data race on IndexHashJoin.cancelFunc (#30701)

* sessionctx/variable: change tidb_store_limit to global only (#30522)

* statistics: remove reassignment of Handle.pool in NewHandle (#30675)

* br: fix some unstable unit test cases. (#30716)

* bindinfo: fix the comment typo (#30616)

* server: support decoding prepared string args to character_set_client (#30723)

* expression: fix enum type join binary get wrong result (#30445)

* cmd/explaintest: fix wrong result comparison for explain test (#30717)

* parallel create tables in br

* metrics: fix copr-cache metrics (#30712)

* test: merge executor's serial tests to other tests (#30711)

* statistics: avoid deadlock when create/drop extended stats and analyze at the same time (#30566)

* ddl: add batch create table api

Signed-off-by: xhe <[email protected]>

* ddl: add unit tests

Signed-off-by: xhe <[email protected]>

* ddl: fix fmt

Signed-off-by: xhe <[email protected]>

* ddl: typo

Co-authored-by: Arenatlx <[email protected]>

* ddl: fix tests

Signed-off-by: xhe <[email protected]>

* ddl: rename to BatchCreateTableWithInfo

Signed-off-by: xhe <[email protected]>

* ddl: trace the error

Signed-off-by: xhe <[email protected]>

* ddl: comments

Signed-off-by: xhe <[email protected]>

* ddl: cancle the job right

Signed-off-by: xhe <[email protected]>

* ddl: cancel the job right 2

Signed-off-by: xhe <[email protected]>

* ddl: report error if entry too large

Signed-off-by: xhe <[email protected]>

* ddl: report error when table is duplicated

Signed-off-by: xhe <[email protected]>

* ddl: go fmt

Signed-off-by: xhe <[email protected]>

* infoschema: improve batch memory perf

Signed-off-by: xhe <[email protected]>

* ddl: retain ID

Signed-off-by: xhe <[email protected]>

* sessionctx: fix the value of analyze_version when upgrading 4.x to 5.… (#30743)

* ddl: reduce log frequency

Signed-off-by: xhe <[email protected]>

* ddl: fix tests

Signed-off-by: xhe <[email protected]>

* server: disable socket listener for `basicHTTPHandlerTestSuite` (#30680)

* planner: support the plan cache aware of bindings (#30169)

* planner: fix early set of plan's statisticsTable (#30754)

* *: implement renew write lock lease for cached table (#30206)

* *: Modify placement rule index to reserve some indexes for future work (#30737)

* executor: add an unit test case for unreasonable invoking Close (#30696)

* planner: fix wrong subquery's coercibility  (#30750)

* executor: add more testcases for index merge (#30497)

* server: add grpc server config for a suitable behavior (#30774)

* config, charset: make charset config not affected by collation config (#30572)

* lightning: emit tidb log by change FilterCore to only allow matched packages (#30700)

* topsql: a centralized place to generate tipb report data (#30781)

* planner: add trace for partition pruning (#30546)

* planner: refine collation handling for between (#30793)

* test: merge serial tests in bindinfo, expression, parser and statistics (#30749)

* br: update log description for split check (#30763)

* *: replace compareDatum by compare, range part (#30710)

* *: placement policy ref will be converted to direct options when recover or flashback table (#30705)

* ddl: handle the incorrect number of placement followers (#30715)

* ddl: revert "ddl: remove unnecessary locking when adding an index" (#30667)

* br/pkg/task: migrate test-infra to testify (#30605)

* *: fix the flen type datetime for union/case-when/control-funcs (#30588)

* types, util: clean up compareDatum (#30815)

* ddl: add helper function to set and query TiFlash's sync status (#30473)

* dumpling: fix more dumpling log level query template (#30176)

* parser: support `admin flush plan_cache` (#30747)

* topsql: support multiple datasinks (#30808)

* br: update permission, so tikv can write to folder when backup to local (#30396)

* session: fix bootstrap to only persist global variables (#30593)

close #28667

* docs/design: update collation compatibility issues in charsets doc (#30806)

* executor: improve SET sysvar=DEFAULT handling (#29680)

close #29670

* br: add error handling for group context cancel when restore file is corrupted (#30190)

close #30135

* executor: buildWindow cannot call typeInfer twice (#30773)

close #30402

* *: refactor encoding and uniform usages (#30288)

* lightning: optimize region split check logic (#30428)

close #30018

* br: ignore mock directory when gcov in br (#30586)

* *: forbid set tiflash replica count for a placement table (#30844)

close #30741

* execute: don't transform charset in internal sql (#30843)

close #30789

* planner: update PlanBuilder.windowSpecs when building subquery (#30878)

close #30804

* br: fix S3 backup endpoint suffix (#30530)

close #30104

* lightning: make pre-check output message clearer (#30439)

close #30395

* expression: wrap to_binary and from_binary for cast function's argument (#30706)

* executor: fix bug when using IndexMerge in transaction (#30719)

close #30685

* ddl: migrate test-infra to testify for ddl/foreign_key_test.go (#30853)

close #29103

* expression: fix wrong retType for reverse function (#30829)

close #30809

* planner: support trace topn push down (#30800)

ref #29661

* github: add issue requirement to pull request template (#30817)

close #30814

* fix merge issue

* topsql: introduce stmtstats and sql execution count (#30277)

* topsql: add pubsub datasink (#30860)

* executor: fix the incorrect untouch used in optimistic transactions (#30447)

close #30410

* expression, cmd: let crc32() support gbk (#30900)

close #30898

* server: Add uptime status var and statistics (#29790)

close #8842

* br: error log optimization (#29640)

close #27015

* planner: fix wrong collation when rewrite in condition (#30492)

close #30486

* planner: add extractor for tikv_region_peers (#30656)

* fix issue that loss table restore

* lightning: add back table empty check and add a switch config (#30887)

close #27919

* br: improve backoff unit test (#30892)

* *: add TxnManager to manage txn in session (#30574)

* *: add TxnManager to manage txn in session

* modify

* add tests

* move failpoint content to a single file

* Makefile: add `t.Parallel` check to ensure tests are run in serial (#30869)

* refactoring code

* refactoring code

* placement: remove isolationlevel (#30859)

close #30858

* planner: revise the optimize trace output (#30882)

* table: set the datum collation correctly in CastValue() (#30931)

close #30930

* *: Use TxnManager.GetTxnInfoSchema() to get the txn infoschema (#30934)

close #30933

* parser: add IsValid() to Encoding to speed up string validation for UTF-8 (#30937)

close #30936

* planner: rename pstmtPlanCacheXX to PlanCacheXX (#30909)

* table/tables: make CI TestCacheTableBasicReadAndWrite more stable (#30924)

close #30922

* restore: use new ScatterRegions API (#30899)

close #30425

* *: when placement conflicts with tiflash, cancel the job (#30945)

* Makefile,tools: make CI great again! (#30828)

close #30822

* br/pkg/membuf: remove global buffer pool (#29934)

* ddl: add format error for incorrect dict syntax in the placement rule (#30919)

close #30454

* planner: fix index merge plan when expr cannot be pushed to tikv (#30341)

close #30200

* executor: display 'show create table' and INFOSCHEMA for cached table correctly (#30951)

close #30950

* br: extend the timeout for scan region since 3 seconds is not enough (#30889)

close #30720

* planner: remove bindSQL from planCacheKey to planCacheValue (#30916)

* execution: refine precision of cast as decimal in agg func (#30805)

* *: fix data race in the tikv_client (#30964)

close #30658

* ddl: migrate test-infra to testify for ddl/db_partition_test.go (#30952)

close #28635

* planner: fix `AccessPath.TableFilters` got modified unexpectedly (#30966)

close #30965

* test: merge serial tests in ddl, infoschema, session, store, table, telemetry and types (#30874)

* executor: fix the returned field count of the prepare statement (#30981)

close #30971

* binlog: allow multiple ddl targets (#30904)

* planner: trace predicate push down  (#30902)

ref #29661

* placement: give default 2 followers for non-sugar syntax (#31000)

* flatten the json output (#30905)

Signed-off-by: yisaer <[email protected]>

Co-authored-by: Ti Chi Robot <[email protected]>

* test: control log level with environment variables (#30871)

* planner: add usage of paging copr in optimizer (#30536)

close #30578

* test: merge serial tests in cmd, planner, server, util (#31003)

* planner: change predicateColumnCollector to columnStatsUsageCollector and collect histogram-needed columns (#30671)

* executor: migrate test-infra to testify for distsql_test.go (#31023)

close #28574

* remote uncessary package errors

* reused the retry code from lightning

* refactoring retryable

* ddl: add batch create table api

Signed-off-by: xhe <[email protected]>

* ddl: add unit tests

Signed-off-by: xhe <[email protected]>

* ddl: fix fmt

Signed-off-by: xhe <[email protected]>

* br ddl code

* parallel create tables in br

* ddl: add batch create table api

Signed-off-by: xhe <[email protected]>

* ddl: add unit tests

Signed-off-by: xhe <[email protected]>

* ddl: fix fmt

Signed-off-by: xhe <[email protected]>

* ddl: typo

Co-authored-by: Arenatlx <[email protected]>

* ddl: fix tests

Signed-off-by: xhe <[email protected]>

* ddl: rename to BatchCreateTableWithInfo

Signed-off-by: xhe <[email protected]>

* ddl: trace the error

Signed-off-by: xhe <[email protected]>

* ddl: comments

Signed-off-by: xhe <[email protected]>

* ddl: cancle the job right

Signed-off-by: xhe <[email protected]>

* ddl: cancel the job right 2

Signed-off-by: xhe <[email protected]>

* ddl: report error if entry too large

Signed-off-by: xhe <[email protected]>

* ddl: report error when table is duplicated

Signed-off-by: xhe <[email protected]>

* ddl: go fmt

Signed-off-by: xhe <[email protected]>

* infoschema: improve batch memory perf

Signed-off-by: xhe <[email protected]>

* ddl: retain ID

Signed-off-by: xhe <[email protected]>

* ddl: reduce log frequency

Signed-off-by: xhe <[email protected]>

* ddl: fix tests

Signed-off-by: xhe <[email protected]>

* ddl: remove retainID from the interface

Signed-off-by: xhe <[email protected]>

* ddl: fix tests

Signed-off-by: xhe <[email protected]>

* executor: fix rebasing problem

Signed-off-by: xhe <[email protected]>

* sessionctx: enable IndexMerge by default (#30650)

close #29597

* br: Enable lint `gosec` in br (#30895)

close #30699

* planner: support 'admin flush plan cache' (#30370)

* merge from tidb batch_1

Co-authored-by: 3pointer <[email protected]>
Co-authored-by: wjHuang <[email protected]>
Co-authored-by: Lei Zhao <[email protected]>
Co-authored-by: Daniël van Eeden <[email protected]>
Co-authored-by: sylzd <[email protected]>
Co-authored-by: Jianjun Liao <[email protected]>
Co-authored-by: unconsolable <[email protected]>
Co-authored-by: tison <[email protected]>
Co-authored-by: Shenghui Wu <[email protected]>
Co-authored-by: tangenta <[email protected]>
Co-authored-by: Weizhen Wang <[email protected]>
Co-authored-by: guo-shaoge <[email protected]>
Co-authored-by: Ryan Leung <[email protected]>
Co-authored-by: xufei <[email protected]>
Co-authored-by: Morgan Tocker <[email protected]>
Co-authored-by: Zhou Kunqin <[email protected]>
Co-authored-by: 王超 <[email protected]>
Co-authored-by: TonsnakeLin <[email protected]>
Co-authored-by: Ehco <[email protected]>
Co-authored-by: Mattias Jonsson <[email protected]>
Co-authored-by: HuaiyuXu <[email protected]>
Co-authored-by: Zak Zhao <[email protected]>
Co-authored-by: WizardXiao <[email protected]>
Co-authored-by: xhe <[email protected]>
Co-authored-by: Hangjie Mo <[email protected]>
Co-authored-by: Yuanjia Zhang <[email protected]>
Co-authored-by: glorv <[email protected]>
Co-authored-by: djshow832 <[email protected]>
Co-authored-by: Chunzhu Li <[email protected]>
Co-authored-by: Song Gao <[email protected]>
Co-authored-by: Ti Chi Robot <[email protected]>
Co-authored-by: Xiaoju Wu <[email protected]>
Co-authored-by: xiongjiwei <[email protected]>
Co-authored-by: tiancaiamao <[email protected]>
Co-authored-by: Yifan Xu <[email protected]>
Co-authored-by: JmPotato <[email protected]>
Co-authored-by: Zach <[email protected]>
Co-authored-by: bb7133 <[email protected]>
Co-authored-by: lvtu <[email protected]>
Co-authored-by: crazycs <[email protected]>
Co-authored-by: znhh6018 <[email protected]>
Co-authored-by: eddie lin <[email protected]>
Co-authored-by: dongjunduo <[email protected]>
Co-authored-by: Zhenchi <[email protected]>
Co-authored-by: wangggong <[email protected]>
Co-authored-by: zhangjinpeng1987 <[email protected]>
Co-authored-by: Jack Yu <[email protected]>
Co-authored-by: Arenatlx <[email protected]>
Co-authored-by: Yiding Cui <[email protected]>
Co-authored-by: Chengpeng Yan <[email protected]>
Co-authored-by: bestwoody <[email protected]>
Co-authored-by: Calvin Neo <[email protected]>
Co-authored-by: Lynn <[email protected]>
Co-authored-by: Zhuhe Fang <[email protected]>
Co-authored-by: Mini256 <[email protected]>
Co-authored-by: Xiang Zhang <[email protected]>
Co-authored-by: Yexiang Zhang <[email protected]>
Co-authored-by: cfzjywxk <[email protected]>
Co-authored-by: db <[email protected]>
Co-authored-by: 山岚 <[email protected]>
Co-authored-by: Yujie Xia <[email protected]>
Co-authored-by: Yilong Li <[email protected]>
Co-authored-by: tuuuuuu <[email protected]>
Co-authored-by: qupeng <[email protected]>
Co-authored-by: you06 <[email protected]>
@Xuanwo Xuanwo moved this to 📋 Backlog in Xuanwo's Work Dec 31, 2021
@Xuanwo
Copy link
Contributor

Xuanwo commented Dec 31, 2021

Let's make CI great again! I'm interested to fix some unit tests. Any advice for me to pick up one?

@tiancaiamao
Copy link
Contributor Author

tiancaiamao commented Dec 31, 2021

Let's make CI great again! I'm interested to fix some unit tests. Any advice for me to pick up one?

Of course! You can try, for example, make ut X='run statistics/handle' and fix the failed testcases, or
make ut X='run executor'

You can file a issue and a PR, just link to here, feel free! @Xuanwo

After all the test cases are done, and some other problem solved, we'll use it to replace the default pipeline script, at that time the CI should be more stable.

@tiancaiamao
Copy link
Contributor Author

tiancaiamao commented Feb 14, 2022

Use another issue to track the unstable test case after using the new CI script.
#32303

@tisonkun
Copy link
Contributor

tisonkun commented May 18, 2022

@tiancaiamao we can now remove "old" test suite support as #26022 has been resolved.

Here is a patch you may make use of:

patch.diff.txt

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment