cdc/bank roachtest pull 260MB off a 3rd party vendor upon every CI run, and fails if upstream unavailable #51543

knz · 2020-07-17T08:56:01Z

Describe the problem

The cdc/bank roachtest runs the following command every time it runs:

 curl -s https://packages.confluent.io/archive/4.0/confluent-oss-4.0.0-2.11.tar.gz | tar -xz -C /tmp/confluent

I went and checked and that is a 262MB archive to download (compressed).

The archive is not cached, unlike the builder image, so that's a mandatory ingress cost on every CI run.

Moreover, today the upstream HTTP server is saying "no" and is causing all the CI runs to fails.

Expected behavior

The archive should be embedded in the builder image, and/or the fetch should use a cached copy if it was already downloaded earlier on the TC agent.

(At the very least we should be fetching from a proxy cache inside the CRL infra so that the CI downloads are internal to GCP).

cc @jlinder @tbg for triage.

Epic DEVINF-109

Jira issue: CRDB-4033

The text was updated successfully, but these errors were encountered:

knz · 2020-07-17T09:01:54Z

I have marked the 3 roachtests that use this facility as skipped.

cockroachdb#51543 Release note: None

knz · 2020-07-21T16:44:21Z

@mwang1026 @dt the KV team meeting concluded that since Bulk I/O is owning the CDC product area, the Bulk I/O team is responsible to enhance the testing infrastructure for CDC tests. So we're pushing this to your plate.

Note that the test is currently skipped. That means we disabled test coverage for CDC. That means that addressing this becomes critical path to the next release.

dt · 2020-07-21T19:11:08Z

Thanks @knz.

@mwang1026 we should potentially re-enable this for now -- while it'd be nice to have it cached, 262MB once a night is a pretty minimal cost (compared to, say, the vms), and while i hate flakes due non-reproducible builds depending on external infra, not testing at all is worse.

jlinder · 2020-07-21T19:32:00Z

It turns out that cdc/bank is one of the roachtests run on every PR build too.

https://github.com/cockroachdb/cockroach/blob/master/build/teamcity-local-roachtest.sh#L37

knz · 2020-07-22T09:53:55Z

Yes, in fact on every CI there are three (not one) tests that do this. So the archive gets downloaded and extracted 3 times.

It's not just our network ingress $$ that this impacts; the upstream server probably blocked us because we were incurring outrageous egress $$ on their side.

blathers-crl · 2023-08-16T19:34:26Z

cc @cockroachdb/cdc

blathers-crl · 2023-08-16T19:34:49Z

cc @cockroachdb/cdc

kenliu-crl · 2023-08-16T19:35:20Z

reassigning this to CDC team as this has to do with the implementation of the roachtest.

blathers-crl · 2023-08-16T19:35:56Z

cc @cockroachdb/cdc

knz added C-test-failure Broken test (automatically or manually discovered). S-3-productivity Severe issues that impede the productivity of CockroachDB developers. A-testing Testing tools and infrastructure A-cdc Change Data Capture A-roachprod labels Jul 17, 2020

This was referenced Jul 17, 2020

sql: pull password hashing back to the sql package #51501

Merged

cli: temporarily skip TestPartialZip and CDC tests #51542

Merged

knz changed the title ~~cdc/bank roachtest pull 260MB off a 3rd party vendor upon every CI run, and fails if upstrream unavailable~~ cdc/bank roachtest pull 260MB off a 3rd party vendor upon every CI run, and fails if upstream unavailable Jul 17, 2020

knz added a commit to knz/cockroach that referenced this issue Jul 17, 2020

roachtest: skip CDC basic tests

eff39b9

cockroachdb#51543 Release note: None

knz mentioned this issue Aug 4, 2020

Example-ORMs pulls files from 3rd party vendors on every CI run, and fails if upstream is unavailable #52342

Open

kenliu added T-disaster-recovery T-cdc labels Dec 5, 2020

knz mentioned this issue Mar 15, 2021

roachtest: jepsen tests pull code from github upon every run #62023

Open

This was referenced Mar 15, 2021

acceptance: all acceptance tests that use docker download images from docker-io upon every use #62024

Open

roachtest: jepsen tests pull code from clojar upon every run #62066

Open

mwang1026 removed the T-disaster-recovery label May 28, 2021

jlinder added the T-dev-inf label Jun 16, 2021

This was referenced Aug 24, 2021

CI pulls from npmjs registry upon every run and fails if npmjs reg is not available #69333

Closed

CockroachDB CI does not properly cache dependencies #69334

Open

otan mentioned this issue Sep 21, 2021

place fixtures into a mirrored bucket / bake onto roachtest image #70462

Closed

exalate-issue-sync bot removed the T-cdc label Oct 6, 2021

srosenberg added the branch-master Failures and bugs on the master branch. label Jul 7, 2023

kenliu-crl added the T-cdc label Aug 16, 2023

kenliu-crl added T-cdc and removed T-cdc T-dev-inf labels Aug 16, 2023

exalate-issue-sync bot added T-dev-inf and removed T-cdc labels Aug 16, 2023

kenliu-crl added T-cdc and removed T-dev-inf labels Aug 16, 2023

jayshrivastava self-assigned this Aug 16, 2023

jayshrivastava removed the C-test-failure Broken test (automatically or manually discovered). label Aug 18, 2023

knz mentioned this issue Aug 21, 2023

ui: type check target pulls 178MB off a 3rd party vendor upon every CI run, and fails if upstream is unavailable #109154

Closed

exalate-issue-sync bot unassigned jayshrivastava Aug 12, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

cdc/bank roachtest pull 260MB off a 3rd party vendor upon every CI run, and fails if upstream unavailable #51543

cdc/bank roachtest pull 260MB off a 3rd party vendor upon every CI run, and fails if upstream unavailable #51543

knz commented Jul 17, 2020 •

edited by cockroach-jira-scripts

Loading

knz commented Jul 17, 2020

knz commented Jul 21, 2020

dt commented Jul 21, 2020

jlinder commented Jul 21, 2020

knz commented Jul 22, 2020

blathers-crl bot commented Aug 16, 2023

blathers-crl bot commented Aug 16, 2023

kenliu-crl commented Aug 16, 2023

blathers-crl bot commented Aug 16, 2023

cdc/bank roachtest pull 260MB off a 3rd party vendor upon every CI run, and fails if upstream unavailable #51543

cdc/bank roachtest pull 260MB off a 3rd party vendor upon every CI run, and fails if upstream unavailable #51543

Comments

knz commented Jul 17, 2020 • edited by cockroach-jira-scripts Loading

knz commented Jul 17, 2020

knz commented Jul 21, 2020

dt commented Jul 21, 2020

jlinder commented Jul 21, 2020

knz commented Jul 22, 2020

blathers-crl bot commented Aug 16, 2023

blathers-crl bot commented Aug 16, 2023

kenliu-crl commented Aug 16, 2023

blathers-crl bot commented Aug 16, 2023

knz commented Jul 17, 2020 •

edited by cockroach-jira-scripts

Loading