Cache vertical shards in query frontend #5648

fpetkovski · 2022-08-26T08:55:50Z

The vertical sharding middleware is currently executed after the
caching middleware. Because of this, individual vertical shards are
not getting cached when the succeed. Caching is only done when the
entire requests including all shards complete successfully.

This commit moves the vertical sharding middleware before the caching
middleware. It also modifies caching keys to contain the total shards
and the shard number so that each vertical shard gets an independent
caching key.

I added CHANGELOG entry for this change.
Change is not relevant to the end user.

Changes

Enable caching in query frontend for individual vertical shards.

Verification

Added unit tests and tests locally by purposefully causing failures in queriers.

fpetkovski · 2022-08-26T08:57:23Z

pkg/queryfrontend/roundtrip_test.go

+
+			// Allow other requests to execute
+			lock.Unlock()
+			<-time.After(200 * time.Millisecond)


The query frontend fanout seems to cancel all requests as soon as one request fails. Because of that, it is hard to provoke partial failures since the first failure will cause all parallel requests to fail.

This like adds a delay to failures so that successful requests can succeed, but it can still lead to flakes. Not sure if there is a better option to test this scenario.

Mhm, maybe we could check if numFailures == 0 { waitForAnotherRequestToCome(); } somehow? Perhaps a shared sync.Cond could be used?

This is a neat trick, seems to be a much better solution 👍

Hm, looks like it's not possible to add this coordination here because we need to release other threads when the sharding middleware has seen a successful response. If we release them here, failed requests can still complete before successful ones.

yeya24

Love this change. Thanks!

GiedriusS · 2022-08-29T15:00:04Z

pkg/queryfrontend/roundtrip_test.go

+
+			testutil.Equals(t, tc.expected, *res)
+
+			//if *res > tc.expected {


Can we remove this? :P

Thanks, removed

GiedriusS · 2022-08-29T15:08:02Z

pkg/queryfrontend/roundtrip_test.go

+
+			// Allow other requests to execute
+			lock.Unlock()
+			<-time.After(200 * time.Millisecond)


Mhm, maybe we could check if numFailures == 0 { waitForAnotherRequestToCome(); } somehow? Perhaps a shared sync.Cond could be used?

The vertical sharding middleware is currently executed after the caching middleware. Because of this, individual vertical shards are not getting cached when the succeed. Caching is only done when the entire requests including all shards complete successfully. This commit moves the vertical sharding middleware before the caching middleware. It also modifies caching keys to contain the total shards and the shard number so that each vertical shard gets an independent caching key. Signed-off-by: Filip Petkovski <filip.petkovsky@gmail.com>

Signed-off-by: Filip Petkovski <filip.petkovsky@gmail.com>

yeya24

Looks good!

* Cache vertical shards in query frontend The vertical sharding middleware is currently executed after the caching middleware. Because of this, individual vertical shards are not getting cached when the succeed. Caching is only done when the entire requests including all shards complete successfully. This commit moves the vertical sharding middleware before the caching middleware. It also modifies caching keys to contain the total shards and the shard number so that each vertical shard gets an independent caching key. Signed-off-by: Filip Petkovski <filip.petkovsky@gmail.com> * Adjust cache key tests Signed-off-by: Filip Petkovski <filip.petkovsky@gmail.com> * Remove source of flakiness by using sync.Cond Signed-off-by: Filip Petkovski <filip.petkovsky@gmail.com> Signed-off-by: Filip Petkovski <filip.petkovsky@gmail.com> Signed-off-by: Prakul Jain <prakul.jain@udaan.com>

pull-request-size bot added the size/L label Aug 26, 2022

fpetkovski commented Aug 26, 2022

View reviewed changes

fpetkovski requested review from yeya24, hanjm and GiedriusS August 26, 2022 08:57

fpetkovski force-pushed the vertical-sharding-caching branch from f25e569 to 445e2d0 Compare August 26, 2022 08:58

yeya24 previously approved these changes Aug 26, 2022

View reviewed changes

GiedriusS reviewed Aug 29, 2022

View reviewed changes

fpetkovski dismissed yeya24’s stale review via 348a8b3 August 31, 2022 10:53

fpetkovski added 2 commits August 31, 2022 13:01

Adjust cache key tests

40fab1d

Signed-off-by: Filip Petkovski <filip.petkovsky@gmail.com>

fpetkovski force-pushed the vertical-sharding-caching branch from 348a8b3 to cd96510 Compare August 31, 2022 11:02

Remove source of flakiness by using sync.Cond

9962948

Signed-off-by: Filip Petkovski <filip.petkovsky@gmail.com>

fpetkovski force-pushed the vertical-sharding-caching branch from cd96510 to 9962948 Compare August 31, 2022 13:12

yeya24 approved these changes Sep 3, 2022

View reviewed changes

GiedriusS merged commit a947f33 into thanos-io:main Sep 4, 2022

prajain12 pushed a commit to prajain12/thanos that referenced this pull request Sep 6, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Cache vertical shards in query frontend #5648

Cache vertical shards in query frontend #5648

fpetkovski commented Aug 26, 2022

fpetkovski Aug 26, 2022

GiedriusS Aug 29, 2022

fpetkovski Aug 31, 2022

fpetkovski Aug 31, 2022

yeya24 left a comment

GiedriusS Aug 29, 2022

fpetkovski Aug 31, 2022

GiedriusS Aug 29, 2022

yeya24 left a comment


		testutil.Equals(t, tc.expected, *res)

		//if *res > tc.expected {

Cache vertical shards in query frontend #5648

Cache vertical shards in query frontend #5648

Conversation

fpetkovski commented Aug 26, 2022

Changes

Verification

fpetkovski Aug 26, 2022

Choose a reason for hiding this comment

GiedriusS Aug 29, 2022

Choose a reason for hiding this comment

fpetkovski Aug 31, 2022

Choose a reason for hiding this comment

fpetkovski Aug 31, 2022

Choose a reason for hiding this comment

yeya24 left a comment

Choose a reason for hiding this comment

GiedriusS Aug 29, 2022

Choose a reason for hiding this comment

fpetkovski Aug 31, 2022

Choose a reason for hiding this comment

GiedriusS Aug 29, 2022

Choose a reason for hiding this comment

yeya24 left a comment

Choose a reason for hiding this comment