Fix data loss consistency violation #6019

rystsov · 2022-08-13T19:38:13Z

Cover letter

Kafka API doesn't have explicit begin txn API, when a transaction coordinator recieves first add_partition or add_group it starts a transaction. Also Redpanda defers disk flushes to the commit moment. Combinations of those thing caues a problem:

a client issues add_partition to txn coordinator
txn coordinator starts a transaction (this is in memory state)
the client writes a message to data partition
redpanda treats the write as acks=1 and acks the request before the replication is finished (it's safe to do it because on commit redpanda checks that all pending replication is done)
txn coordinator & data partition experience re-election and the in memory state is lost
the client issues add_group to txn coordinator
txn coordinator is unaware about the ongoing txn and starts new transaction
the client commits the txn and only the consumer group change is written

Since the data record was written with acks=1 just before re-election it gets lost and the client hasn't figured out that there was something wrong with transaction.

Fixes #6018

Backport Required

UX changes

Release notes

Bug Fixes

Fix consistency violation caused by split-brain of the txn coordinator

tools/offline_log_viewer/kafka.py

src/v/cluster/tx_gateway_frontend.cc

src/v/cluster/tm_stm.cc

tools/offline_log_viewer/kafka.py

src/v/cluster/tx_gateway_frontend.cc

rystsov · 2022-08-19T17:23:12Z

PartitionBalancerTest.test_full_nodes - #5884

src/v/cluster/tx_gateway_frontend.cc

tools/offline_log_viewer/storage.py

dotnwat · 2022-08-19T21:00:45Z

tools/offline_log_viewer/viewer.py

-                            help='Path to the log desired to be analyzed')
+        parser.add_argument(
+            '--path',
+            type=str,


if you'd like to integrate more tightly with argument parsing, you can change type = str to type = validate_path and validate_path to:

def validate_path(path): if not os.path.exists(path): raise ArgumentTypeError(f"Path doesn't exist {path}") controller = join(path, "redpanda", "controller") if not os.path.exists(controller): raise ArgumentTypeError(f"Each redpanda data dir should have controller piece but {controller} isn't found") return path

tools/offline_log_viewer/storage.py

tools/offline_log_viewer/tx_coordinator.py

Kafka API doesn't have explicit begin txn API, when a transaction coordinator recieves first add_partition or add_group it starts a transaction. Also Redpanda defers disk flushes to the commit moment. Combinations of those thing caues a problem: 1. a client issues add_partition to txn coordinator 2. txn coordinator starts a transaction (this is in memory state) 3. the client writes a message to data partition 4. redpanda treats the write as acks=1 and acks the request before the replication is finished (it's safe to do it because on commit redpanda checks that all pending replication is done) 5. txn coordinator & data partition experience re-election and the in memory state is lost 6. the client issues add_group to txn coordinator 7. txn coordinator is unaware about the ongoing txn and starts new transaction 8. the client commits the txn and only the consumer group change is written Since the data record was written with acks=1 just before re-election it gets lost and the client hasn't figured out that there was some- thing wrong with transaction. The fix is to write to the txn coordinator's log when a transaction starts; in this case when a crush-induced re-election happens new txn coordinator has an opportunity to detect an ongoing txn and fail it.

the checks didn't handle well empty dirs

bharathv

new changes lgtm.

dotnwat

🎾 🎾 🎾 🎾 🎾 🎾 🎾 🎾 🎾

rystsov · 2022-08-20T14:25:08Z

SIPartitionMovementTest.test_shadow_indexing - #4702

rystsov requested review from bharathv and VadimPlh August 13, 2022 19:38

rystsov requested a review from a team as a code owner August 13, 2022 19:38

rystsov requested review from andrewhsu and removed request for a team August 13, 2022 19:38

github-actions bot added the area/redpanda label Aug 13, 2022

rystsov requested review from mmaslankaprv and removed request for andrewhsu August 13, 2022 19:38

bharathv requested changes Aug 15, 2022

View reviewed changes

dotnwat reviewed Aug 16, 2022

View reviewed changes

rystsov force-pushed the issue-15 branch from 58b6b9a to a56e89e Compare August 18, 2022 21:00

rystsov requested review from dotnwat and bharathv August 18, 2022 21:00

dotnwat previously approved these changes Aug 19, 2022

View reviewed changes

rystsov added 6 commits August 19, 2022 15:14

txn coordinator: coroutinized do_add_partition_to_tx

1f5a7e1

txn coordinator: apply formatting

726b515

log_viewer: fix false assert

906a9e7

the checks didn't handle well empty dirs

log_viewer: improve validation

93b2125

log_viewer: support txn coordinator

45694c8

rystsov dismissed dotnwat’s stale review via 45694c8 August 19, 2022 22:18

rystsov force-pushed the issue-15 branch from a56e89e to 45694c8 Compare August 19, 2022 22:18

rystsov requested a review from dotnwat August 19, 2022 23:07

bharathv approved these changes Aug 19, 2022

View reviewed changes

dotnwat approved these changes Aug 19, 2022

View reviewed changes

rystsov merged commit b9834fb into redpanda-data:dev Aug 20, 2022

This pull request was closed.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix data loss consistency violation #6019

Fix data loss consistency violation #6019

rystsov commented Aug 13, 2022 •

edited by andrewhsu

Loading

rystsov commented Aug 19, 2022

dotnwat Aug 19, 2022

bharathv left a comment

dotnwat left a comment

rystsov commented Aug 20, 2022

Fix data loss consistency violation #6019

Fix data loss consistency violation #6019

Conversation

rystsov commented Aug 13, 2022 • edited by andrewhsu Loading

Cover letter

Backport Required

UX changes

Release notes

Bug Fixes

rystsov commented Aug 19, 2022

dotnwat Aug 19, 2022

Choose a reason for hiding this comment

bharathv left a comment

Choose a reason for hiding this comment

dotnwat left a comment

Choose a reason for hiding this comment

rystsov commented Aug 20, 2022

rystsov commented Aug 13, 2022 •

edited by andrewhsu

Loading