Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

CI Failure (restored partition is too small) in TopicRecoveryTest.test_fast2 #6356

Closed
rystsov opened this issue Sep 9, 2022 · 5 comments
Closed
Assignees
Labels
area/cloud-storage Shadow indexing subsystem area/tests ci-failure kind/bug Something isn't working pr-blocker CI failures blocking a PR from being merged

Comments

@rystsov
Copy link
Contributor

rystsov commented Sep 9, 2022

https://buildkite.com/redpanda/redpanda/builds/15179#01832351-c1ef-4ee2-920f-ac41ebbb79d7

Module: rptest.tests.topic_recovery_test
Class:  TopicRecoveryTest
Method: test_fast2
test_id:    rptest.tests.topic_recovery_test.TopicRecoveryTest.test_fast2
status:     FAIL
run time:   1 minute 23.901 seconds

    AssertionError("NTP NTP(ns='kafka', topic='panda-topic-2', partition=2) the restored partition is too small 3533238. The original is 3550365 bytes which 17127 bytes larger.")
Traceback (most recent call last):
  File "/usr/local/lib/python3.10/dist-packages/ducktape/tests/runner_client.py", line 135, in run
    data = self.run_test()
  File "/usr/local/lib/python3.10/dist-packages/ducktape/tests/runner_client.py", line 227, in run_test
    return self.test_context.function(self.test)
  File "/root/tests/rptest/services/cluster.py", line 35, in wrapped
    r = f(self, *args, **kwargs)
  File "/root/tests/rptest/tests/topic_recovery_test.py", line 1306, in test_fast2
    self.do_run(test_case)
  File "/root/tests/rptest/tests/topic_recovery_test.py", line 1212, in do_run
    test_case.validate_cluster(baseline, restored)
  File "/root/tests/rptest/tests/topic_recovery_test.py", line 693, in validate_cluster
    verify_file_layout(baseline,
  File "/root/tests/rptest/utils/si_utils.py", line 204, in verify_file_layout
    assert delta <= BLOCK_SIZE, \
AssertionError: NTP NTP(ns='kafka', topic='panda-topic-2', partition=2) the restored partition is too small 3533238. The original is 3550365 bytes which 17127 bytes larger.
@rystsov rystsov added kind/bug Something isn't working ci-failure pr-blocker CI failures blocking a PR from being merged labels Sep 9, 2022
@rystsov rystsov changed the title CI Failure in TopicRecoveryTest.test_fast2 CI Failure (restored partition is too small) in TopicRecoveryTest.test_fast2 Sep 9, 2022
@rystsov
Copy link
Contributor Author

rystsov commented Sep 9, 2022

The issue was previously discussed in #4960. Also it seems that it has the same root cause as #4887

@mmedenjak
Copy link
Contributor

mmedenjak commented Oct 7, 2022

Also it seems that it has the same root cause as #4887

From Evgeny:

looks like two different issues for me (timeout issue could probably be solved by simply downsizing the data set since it generates a lot of segments)

@jcsp
Copy link
Contributor

jcsp commented Oct 18, 2022

Last failure of this kind was:
failure at 2022-10-02T09:35:22.985Z: AssertionError("NTP NTP(ns='kafka', topic='panda-topic-2', partition=1) the restored partition is too small 3469188. The original is 3496805 bytes which 27617 bytes larger.")
in job https://buildkite.com/redpanda/vtools/builds/3627#018396a6-afdf-4dae-8a44-ddac93c4a005

@NyaliaLui
Copy link
Contributor

NyaliaLui commented Nov 9, 2022

Possible related failure in https://buildkite.com/redpanda/redpanda/builds/18184#01845ac8-1a7f-4763-b63e-e53ec491599c/6-678

    AssertionError('old_hw 3319 unexpectedly same as new_hw 3319 for topic spec: panda-topic-1')
Traceback (most recent call last):
  File "/usr/local/lib/python3.10/dist-packages/ducktape/tests/runner_client.py", line 135, in run
    data = self.run_test()
  File "/usr/local/lib/python3.10/dist-packages/ducktape/tests/runner_client.py", line 227, in run_test
    return self.test_context.function(self.test)
  File "/root/tests/rptest/services/cluster.py", line 35, in wrapped
    r = f(self, *args, **kwargs)
  File "/root/tests/rptest/tests/topic_recovery_test.py", line 1352, in test_fast2
    self.do_run(test_case)
  File "/root/tests/rptest/tests/topic_recovery_test.py", line 1258, in do_run
    test_case.validate_cluster(baseline, restored)
  File "/root/tests/rptest/tests/topic_recovery_test.py", line 706, in validate_cluster
    self._produce_and_verify(topic)
  File "/root/tests/rptest/tests/topic_recovery_test.py", line 219, in _produce_and_verify
    assert old_state.hwm != new_state.hwm, \
AssertionError: old_hw 3319 unexpectedly same as new_hw 3319 for topic spec: panda-topic-1

@jcsp
Copy link
Contributor

jcsp commented Nov 10, 2022

The failure in this ticket's OP has not reoccurred in the last 30 days. Opened #7214 for the failure that was seen yesterday.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
area/cloud-storage Shadow indexing subsystem area/tests ci-failure kind/bug Something isn't working pr-blocker CI failures blocking a PR from being merged
Projects
None yet
Development

No branches or pull requests

6 participants