Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

CI Failure (failed to hydrate chunk starting at 0) in CloudStorageTimingStressTest.test_cloud_storage_with_partition_moves #10868

Closed
travisdowns opened this issue May 18, 2023 · 11 comments · Fixed by #11093
Assignees
Labels
area/cloud-storage Shadow indexing subsystem ci-failure

Comments

@travisdowns
Copy link
Member

https://buildkite.com/redpanda/redpanda/builds/29339#01882bb2-7bdb-49d9-b79b-7bcc668a9a40

Module: rptest.tests.cloud_storage_timing_stress_test
Class:  CloudStorageTimingStressTest
Method: test_cloud_storage_with_partition_moves

Failure:

    <BadLogLines nodes=docker-rp-18(1) example="ERROR 2023-05-17 22:06:46,417 [shard 1] cloud_storage - [fiber278 bee5085b/kafka/test-topic/0_20/7905-8036-4213422-2-v1.log.2] - segment_chunk_data_source.cc:84 - failed to hydrate chunk starting at 0, error: seastar::abort_requested_exception (abort requested)">
Traceback (most recent call last):
  File "/usr/local/lib/python3.10/dist-packages/ducktape/tests/runner_client.py", line 135, in run
    data = self.run_test()
  File "/usr/local/lib/python3.10/dist-packages/ducktape/tests/runner_client.py", line 227, in run_test
    return self.test_context.function(self.test)
  File "/root/tests/rptest/services/cluster.py", line 87, in wrapped
    redpanda.raise_on_bad_logs(
  File "/root/tests/rptest/services/redpanda.py", line 2274, in raise_on_bad_logs
    raise BadLogLines(bad_lines)
rptest.services.utils.BadLogLines: <BadLogLines nodes=docker-rp-18(1) example="ERROR 2023-05-17 22:06:46,417 [shard 1] cloud_storage - [fiber278 bee5085b/kafka/test-topic/0_20/7905-8036-4213422-2-v1.log.2] - segment_chunk_data_source.cc:84 - failed to hydrate chunk starting at 0, error: seastar::abort_requested_exception (abort requested)">
@travisdowns travisdowns added kind/bug Something isn't working ci-failure labels May 18, 2023
@mmaslankaprv mmaslankaprv added the area/cloud-storage Shadow indexing subsystem label May 20, 2023
@dlex
Copy link
Contributor

dlex commented May 22, 2023

@abhijat
Copy link
Contributor

abhijat commented May 22, 2023

This should have been fixed with the extended allow list in #10852. Will keep open for observation and close in a couple of days if no new instances are seen.

Removed the bug label as the behavior is expected.

@michael-redpanda
Copy link
Contributor

@NyaliaLui
Copy link
Contributor

FAIL test: CloudStorageTimingStressTest.test_cloud_storage_with_partition_moves.cleanup_policy=compact.delete (7/31 runs)
  failure at 2023-05-26T07:26:58.598Z: <BadLogLines nodes=docker-rp-30(1) example="ERROR 2023-05-26 06:53:35,183 [shard 1] cloud_storage - [fiber658 889ded78/kafka/test-topic/0_20/6804-6933-4210586-1-v1.log.1] - segment_chunk_data_source.cc:108 - failed to hydrate chunk starting at 0, error: seastar::abort_requested_exception (abort requested)">
      on (arm64, container) in job https://buildkite.com/redpanda/redpanda/builds/29955#018856bd-c63f-4485-97e7-488126129857
FAIL test: CloudStorageTimingStressTest.test_cloud_storage_with_partition_moves.cleanup_policy=delete (4/30 runs)
  failure at 2023-05-25T19:47:10.649Z: <BadLogLines nodes=docker-rp-21(1) example="ERROR 2023-05-25 19:09:24,798 [shard 0] cloud_storage - [fiber175 e7fb37e0/kafka/test-topic/0_20/6726-7118-12640220-1-v1.log.1] - segment_chunk_data_source.cc:108 - failed to hydrate chunk starting at 0, error: seastar::abort_requested_exception (abort requested)">
      on (amd64, container) in job https://buildkite.com/redpanda/redpanda/builds/29913#0188543e-447d-4d57-bc2d-7ec634915b39

@abhijat
Copy link
Contributor

abhijat commented May 30, 2023

It looks like the chain of exception handlers passes the exception down:

DEBUG 2023-05-29 22:22:42,203 [shard 3] cloud_storage - [fiber254 a95c2145/kafka/test-topic/0_21/7565-7693-4213193-1-v1.log.1] - segment_chunk_data_source.cc:85 - abort requested during hydration of chunk starting at 0: seastar::abort_requested_exception (abort requested)
ERROR 2023-05-29 22:22:42,203 [shard 3] cloud_storage - [fiber254 a95c2145/kafka/test-topic/0_21/7565-7693-4213193-1-v1.log.1] - segment_chunk_data_source.cc:108 - failed to hydrate chunk starting at 0, error: seastar::abort_requested_exception (abort requested)

The first debug log is intended, but the exception returned from that handler is passed onto the last handler which results in the error log.

@abhijat
Copy link
Contributor

abhijat commented May 30, 2023

https://buildkite.com/redpanda/redpanda/builds/30090#01886640-337a-4ad4-b2d8-1bc3bb6ea98a

[INFO  - 2023-05-29 07:21:11,123 - runner_client - log - lineno:278]: RunnerClient: rptest.tests.cloud_storage_timing_stress_test.CloudStorageTimingStressTest.test_cloud_storage_with_partition_moves.cleanup_policy=delete: Summary: <BadLogLines nodes=docker-rp-17(1) example="ERROR 2023-05-29 07:20:22,496 [shard 0] cloud_storage - [fiber455 84bcf2f1/kafka/test-topic/0_21/7630-8024-12640498-2-v1.log.2] - segment_chunk_data_source.cc:108 - failed to hydrate chunk starting at 0, error: seastar::abort_requested_exception (abort requested)">
Traceback (most recent call last):
  File "/usr/local/lib/python3.10/dist-packages/ducktape/tests/runner_client.py", line 135, in run
    data = self.run_test()
  File "/usr/local/lib/python3.10/dist-packages/ducktape/tests/runner_client.py", line 227, in run_test
    return self.test_context.function(self.test)
  File "/usr/local/lib/python3.10/dist-packages/ducktape/mark/_mark.py", line 481, in wrapper
    return functools.partial(f, *args, **kwargs)(*w_args, **w_kwargs)
  File "/root/tests/rptest/services/cluster.py", line 87, in wrapped
    redpanda.raise_on_bad_logs(
  File "/root/tests/rptest/services/redpanda.py", line 2663, in raise_on_bad_logs
    raise BadLogLines(bad_lines)
rptest.services.utils.BadLogLines: <BadLogLines nodes=docker-rp-17(1) example="ERROR 2023-05-29 07:20:22,496 [shard 0] cloud_storage - [fiber455 84bcf2f1/kafka/test-topic/0_21/7630-8024-12640498-2-v1.log.2] - segment_chunk_data_source.cc:108 - failed to hydrate chunk starting at 0, error: seastar::abort_requested_exception (abort requested)">

https://buildkite.com/redpanda/vtools/builds/7827#0188691e-0212-4567-a5bb-e59ae97dd98d

21/7565-7693-4213193-1-v1.log.1] - segment_chunk_data_source.cc:108 - failed to hydrate chunk starting at 0, error: seastar::abort_requested_exception (abort requested)">
Traceback (most recent call last):
  File "/usr/local/lib/python3.10/dist-packages/ducktape/tests/runner_client.py", line 135, in run
    data = self.run_test()
  File "/usr/local/lib/python3.10/dist-packages/ducktape/tests/runner_client.py", line 227, in run_test
    return self.test_context.function(self.test)
  File "/usr/local/lib/python3.10/dist-packages/ducktape/mark/_mark.py", line 481, in wrapper
    return functools.partial(f, *args, **kwargs)(*w_args, **w_kwargs)
  File "/home/ubuntu/redpanda/tests/rptest/services/cluster.py", line 87, in wrapped
    redpanda.raise_on_bad_logs(
  File "/home/ubuntu/redpanda/tests/rptest/services/redpanda.py", line 2663, in raise_on_bad_logs
    raise BadLogLines(bad_lines)
rptest.services.utils.BadLogLines: <BadLogLines nodes=ip-172-31-0-250(1) example="ERROR 2023-05-29 22:22:42,203 [shard 3] cloud_storage - [fiber254 a95c2145/kafka/test-topic/0_21/7565-7693-4213193-1-v1.log.1] - segment_chunk_data_source.cc:108 - failed to hydrate chunk starting at 0, error: seastar::abort_requested_exception (abort requested)">

@vshtokman
Copy link
Contributor

@twmb
Copy link
Contributor

twmb commented Jun 2, 2023

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
area/cloud-storage Shadow indexing subsystem ci-failure
Projects
None yet
Development

Successfully merging a pull request may close this issue.

8 participants