Raft flush fixes #2937

mmaslankaprv · 2021-11-11T10:33:14Z

Cover letter

Fixed tracking flushed offset in an events of log truncation (both prefix and suffix). Fixed checking if configuration update is taking place when transferring leadership.

Fixes: #2932

Release notes

When truncating log we need to update raft flushed offset to reflect truncated log state Signed-off-by: Michal Maslanka <michal@vectorized.io>

jcsp · 2021-11-11T12:08:26Z

These changes look good to me, although I think we're still going to have a very far-behind _flush_offset on followers when doing acks=1 writes, right? That is harmless as long as _commit_offset on the leader is only essential to these types of situation, but if we are e.g. exposing _commit_offset as a statistic, it's going to look pretty weird to human eyes.

I think it would make sense to also auto-flush followers when they are more than N bytes behind the head of the log, even if it's not strictly necessary for the raft code. But that could be a separate PR, as I'd like to merge this one promptly to resolve the failing test.

jcsp

Let's wait for one more +1 on this before merging, since it's such a fiddly area.

mmaslankaprv · 2021-11-11T14:02:55Z

These changes look good to me, although I think we're still going to have a very far-behind _flush_offset on followers when doing acks=1 writes, right? That is harmless as long as _commit_offset on the leader is only essential to these types of situation, but if we are e.g. exposing _commit_offset as a statistic, it's going to look pretty weird to human eyes.

I think it would make sense to also auto-flush followers when they are more than N bytes behind the head of the log, even if it's not strictly necessary for the raft code. But that could be a separate PR, as I'd like to merge this one promptly to resolve the failing test.

I agree that we should periodically flush on followers. The committed_index is never exposed to Kafka API so this should not be misleading to users. I know @rystsov is going to work on periodic flushes in raft

src/v/raft/consensus.cc

Signed-off-by: Michal Maslanka <michal@vectorized.io>

In redpanda raft implementation supports relaxed consistency. With relaxed consistency we not always update `_commit_index` even tho entries were replicated to all the nodes. When performing leadership transfer we need to take that into account. When configuration is successfully replicated to majority of nodes and it is not in joint consensus state we are safe to proceed with the transfer. Signed-off-by: Michal Maslanka <michal@vectorized.io>

rystsov

LGTM

Backport of #2856, #1576, #2917, #2901, #2937

r/consensus: update flushed offset when truncating log

4d87512

When truncating log we need to update raft flushed offset to reflect truncated log state Signed-off-by: Michal Maslanka <michal@vectorized.io>

github-actions bot added the area/redpanda label Nov 11, 2021

mmaslankaprv force-pushed the raft-flush-fixes branch 2 times, most recently from 5c37af5 to b1f2142 Compare November 11, 2021 11:04

mmaslankaprv marked this pull request as ready for review November 11, 2021 11:06

mmaslankaprv requested review from jcsp, rystsov, VadimPlh and ztlpn as code owners November 11, 2021 11:06

jcsp previously approved these changes Nov 11, 2021

View reviewed changes

ztlpn reviewed Nov 11, 2021

View reviewed changes

src/v/raft/consensus.cc Outdated Show resolved Hide resolved

mmaslankaprv dismissed jcsp’s stale review via b4a6b1b November 11, 2021 15:13

mmaslankaprv force-pushed the raft-flush-fixes branch from b1f2142 to b4a6b1b Compare November 11, 2021 15:13

mmaslankaprv requested review from ztlpn and jcsp November 11, 2021 15:28

ztlpn reviewed Nov 11, 2021

View reviewed changes

src/v/raft/consensus.cc Outdated Show resolved Hide resolved

ztlpn previously approved these changes Nov 11, 2021

View reviewed changes

mmaslankaprv added 2 commits November 11, 2021 17:11

r/consensus: updated flushed offset when hydrating a snapshot

535b995

Signed-off-by: Michal Maslanka <michal@vectorized.io>

mmaslankaprv dismissed ztlpn’s stale review via 42d3d54 November 11, 2021 16:11

mmaslankaprv force-pushed the raft-flush-fixes branch from b4a6b1b to 42d3d54 Compare November 11, 2021 16:11

mmaslankaprv requested a review from ztlpn November 11, 2021 16:11

ztlpn approved these changes Nov 11, 2021

View reviewed changes

This was referenced Nov 11, 2021

Backport of #2856, #1576, #2917, #2901, #2937 #2929

Merged

serde: introduce checksum_envelope #2722

Merged

rystsov approved these changes Nov 11, 2021

View reviewed changes

mmaslankaprv merged commit ab330cb into redpanda-data:dev Nov 11, 2021

dotnwat added a commit that referenced this pull request Nov 11, 2021

Merge pull request #2929 from mmaslankaprv/v21.10.x

e7b6714

Backport of #2856, #1576, #2917, #2901, #2937

dotnwat mentioned this pull request Nov 11, 2021

Failure in prefix_truncate_recovery_test #2460

Closed

This pull request was closed.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Raft flush fixes #2937

Raft flush fixes #2937

mmaslankaprv commented Nov 11, 2021 •

edited

Loading

jcsp commented Nov 11, 2021

jcsp left a comment

mmaslankaprv commented Nov 11, 2021

rystsov left a comment

Raft flush fixes #2937

Raft flush fixes #2937

Conversation

mmaslankaprv commented Nov 11, 2021 • edited Loading

Cover letter

Release notes

jcsp commented Nov 11, 2021

jcsp left a comment

Choose a reason for hiding this comment

mmaslankaprv commented Nov 11, 2021

rystsov left a comment

Choose a reason for hiding this comment

mmaslankaprv commented Nov 11, 2021 •

edited

Loading