Distributor: Use pooled buffers for reading/decompressing request body #6836

aknuds1 · 2023-12-06T10:43:51Z

What this PR does

The distributor currently uses pooled buffers for decompressing snappy compressed remote write requests. However, it doesn't use pooled buffers when reading/gzip decompressing (OTLP requests can be gzip compressed) request bodies. In this PR I propose to use buffer pooling also for reading uncompressed/compressed write request bodies, which should benefit OTLP requests as they're not snappy compressed.

Benchmarks

Benchmarking shows some speedup and reduction in memory usage for OTLP. I'm currently testing these changes in a dev cell, with no problems detected.

BenchmarkPushHandler:

✗ benchstat push.txt push-buffer.txt
goos: linux
goarch: amd64
pkg: github.com/grafana/mimir/pkg/distributor
cpu: AMD Ryzen 9 3950X 16-Core Processor            
               │  push.txt   │          push-buffer.txt           │
               │   sec/op    │   sec/op     vs base               │
PushHandler-32   2.926µ ± 2%   2.976µ ± 2%  +1.73% (p=0.007 n=10)

               │  push.txt  │       push-buffer.txt        │
               │    B/op    │    B/op     vs base          │
PushHandler-32   638.5 ± 1%   636.0 ± 1%  ~ (p=0.323 n=10)

               │  push.txt  │        push-buffer.txt         │
               │ allocs/op  │ allocs/op   vs base            │
PushHandler-32   16.00 ± 0%   16.00 ± 0%  ~ (p=1.000 n=10) ¹
¹ all samples are equal

BenchmarkOTLPHandler:

✗ benchstat otlp.txt otlp-buffer.txt
goos: linux
goarch: amd64
pkg: github.com/grafana/mimir/pkg/distributor
cpu: AMD Ryzen 9 3950X 16-Core Processor            
                        │   otlp.txt   │          otlp-buffer.txt          │
                        │    sec/op    │   sec/op     vs base              │
OTLPHandler/protobuf-32   1017.5µ ± 1%   991.8µ ± 9%  -2.52% (p=0.002 n=6)
OTLPHandler/JSON-32        1.409m ± 2%   1.386m ± 5%  -1.65% (p=0.015 n=6)
geomean                    1.198m        1.173m       -2.09%

                        │   otlp.txt   │           otlp-buffer.txt           │
                        │     B/op     │     B/op      vs base               │
OTLPHandler/protobuf-32   471.5Ki ± 0%   450.1Ki ± 0%   -4.53% (p=0.002 n=6)
OTLPHandler/JSON-32       553.9Ki ± 0%   497.6Ki ± 0%  -10.15% (p=0.002 n=6)
geomean                   511.0Ki        473.3Ki        -7.38%

                        │  otlp.txt   │          otlp-buffer.txt          │
                        │  allocs/op  │  allocs/op   vs base              │
OTLPHandler/protobuf-32   7.136k ± 0%   7.135k ± 0%  -0.01% (p=0.002 n=6)
OTLPHandler/JSON-32       10.17k ± 0%   10.17k ± 0%  -0.01% (p=0.002 n=6)
geomean                   8.519k        8.518k       -0.01%

Which issue(s) this PR fixes or relates to

Checklist

Tests updated.
Documentation added.
CHANGELOG.md updated - the order of entries should be [CHANGE], [FEATURE], [ENHANCEMENT], [BUGFIX].
about-versioning.md updated with experimental features.

Signed-off-by: Arve Knudsen <arve.knudsen@gmail.com>

pstibrany

Nice. I'm not sure if tying the two buffers (body / decoded) into single struct is a good idea, and I'm wondering if we should use a struct that would track all buffers used by request. Something like:

type requestBuffers struct {
	p       *sync.Pool
	buffersFromThePool []*bytes.Buffer
}

func (rb *requestBuffers) getBuffer() *bytes.Buffer {
	b := rb.p.Get().(*bytes.Buffer)
	rb.buffersFromThePool = append(rb.buffersFromThePool, b)
	return b
}

func (rb *requestBuffers) cleanup() {
	for _, b := range rb.buffersFromThePool {
		rb.p.Put(b)
	}
	rb.buffersFromThePool = rb.buffersFromThePool[:0]
}

Then we could call cleanup to return all buffers.

pkg/util/http.go

…-pool

Signed-off-by: Arve Knudsen <arve.knudsen@gmail.com>

aknuds1 · 2023-12-11T13:56:32Z

Thanks for the advice @pstibrany - I think I agree it could be beneficial to avoid coupling the two buffers together in the pool, so I implemented your suggested design instead (type util.RequestBuffers). PTAL.

The revised design fares slightly worse in the distributor push benchmark, with one extra allocation, but I guess it makes little practical difference?

✗ benchstat push.txt push-request.txt
goos: linux
goarch: amd64
pkg: github.com/grafana/mimir/pkg/distributor
cpu: AMD Ryzen 9 3950X 16-Core Processor            
               │  push.txt   │       push-request.txt        │
               │   sec/op    │   sec/op     vs base          │
PushHandler-32   27.09µ ± 2%   27.49µ ± 3%  ~ (p=0.247 n=10)

               │  push.txt  │          push-request.txt          │
               │    B/op    │    B/op     vs base                │
PushHandler-32   633.5 ± 1%   744.5 ± 1%  +17.52% (p=0.000 n=10)

               │  push.txt  │         push-request.txt          │
               │ allocs/op  │ allocs/op   vs base               │
PushHandler-32   16.00 ± 0%   17.00 ± 0%  +6.25% (p=0.000 n=10)

Signed-off-by: Arve Knudsen <arve.knudsen@gmail.com>

pstibrany

Thank you, changes make sense to me. I left some non-blocking suggestions.

pkg/util/http.go

Co-authored-by: Peter Štibraný <pstibrany@gmail.com>

Signed-off-by: Arve Knudsen <arve.knudsen@gmail.com>

pkg/util/http.go

aknuds1 requested a review from pstibrany December 6, 2023 10:43

aknuds1 requested a review from a team as a code owner December 6, 2023 10:43

aknuds1 added enhancement New feature or request component/distributor labels Dec 6, 2023

aknuds1 force-pushed the arve/parseprotoreader-pool branch 2 times, most recently from 82e8ec6 to fa84566 Compare December 6, 2023 15:38

Distributor: Optimize by reading/gzip decompressing using pooled buffers

9aaab58

Signed-off-by: Arve Knudsen <arve.knudsen@gmail.com>

aknuds1 force-pushed the arve/parseprotoreader-pool branch from fa84566 to 9aaab58 Compare December 6, 2023 15:38

pstibrany reviewed Dec 7, 2023

View reviewed changes

pkg/util/http.go Outdated Show resolved Hide resolved

pkg/util/http.go Outdated Show resolved Hide resolved

pkg/util/http.go Outdated Show resolved Hide resolved

pkg/util/http.go Outdated Show resolved Hide resolved

aknuds1 added 4 commits December 11, 2023 09:57

Merge remote-tracking branch 'origin/main' into arve/parseprotoreader…

f997eaf

…-pool

Merge remote-tracking branch 'origin/main' into arve/parseprotoreader…

fbca63a

…-pool

Implement review feedback

161d63b

Signed-off-by: Arve Knudsen <arve.knudsen@gmail.com>

Implement RequestBuffers

1aee3a6

Signed-off-by: Arve Knudsen <arve.knudsen@gmail.com>

aknuds1 requested a review from pstibrany December 11, 2023 13:56

Fix trafficdump

1b1e05a

Signed-off-by: Arve Knudsen <arve.knudsen@gmail.com>

pstibrany approved these changes Dec 11, 2023

View reviewed changes

pkg/util/http.go Show resolved Hide resolved

pkg/util/http.go Outdated Show resolved Hide resolved

pkg/util/http.go Outdated Show resolved Hide resolved

pkg/util/http.go Outdated Show resolved Hide resolved

aknuds1 and others added 4 commits December 11, 2023 16:07

Update pkg/util/http.go

9dafb4e

Co-authored-by: Peter Štibraný <pstibrany@gmail.com>

Make sure size is non-negative

42eb972

Signed-off-by: Arve Knudsen <arve.knudsen@gmail.com>

Rename variables for clarity

a0d8b4a

Signed-off-by: Arve Knudsen <arve.knudsen@gmail.com>

Rename Buffer => Get

7116e17

Signed-off-by: Arve Knudsen <arve.knudsen@gmail.com>

aknuds1 merged commit 50385f3 into main Dec 11, 2023
28 checks passed

aknuds1 deleted the arve/parseprotoreader-pool branch December 11, 2023 15:27

colega reviewed Dec 11, 2023

View reviewed changes

pkg/util/http.go Show resolved Hide resolved

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Distributor: Use pooled buffers for reading/decompressing request body #6836

Distributor: Use pooled buffers for reading/decompressing request body #6836

aknuds1 commented Dec 6, 2023 •

edited

Loading

pstibrany left a comment

aknuds1 commented Dec 11, 2023 •

edited

Loading

pstibrany left a comment

Distributor: Use pooled buffers for reading/decompressing request body #6836

Distributor: Use pooled buffers for reading/decompressing request body #6836

Conversation

aknuds1 commented Dec 6, 2023 • edited Loading

What this PR does

Benchmarks

Which issue(s) this PR fixes or relates to

Checklist

pstibrany left a comment

Choose a reason for hiding this comment

aknuds1 commented Dec 11, 2023 • edited Loading

pstibrany left a comment

Choose a reason for hiding this comment

aknuds1 commented Dec 6, 2023 •

edited

Loading

aknuds1 commented Dec 11, 2023 •

edited

Loading