receive.handler pooling improvements by gavins-db · Pull Request #299 · databricks/thanos

gavins-db · 2026-02-19T06:44:56Z

Changes

This PR has three primary changes related to existing pooling in the receive handler that should improve memory usage and performance. Those changes are detailed blow. In addition to the changes above, I added a syncutil.Pool utility that is a generic wrapper for pooling any type. This is a pattern I've found extremely useful in past lives to reduce boilerplate, improve readability, and make sure we're actually using pools correctly. And I also added flags to configure the pooling behavior (enable, max capacity, etc.).

The first change removes the copyBufPool pool which was used as an intermediate buffer for compressed request bodies. This was an unnecessary intermediate buffer; the destination buffer (a bytes.Buffer) has a ReadFrom method that can more efficiently copy over the contents of the source reader using its internal byte slice.

Note: This means that we no longer validate rate limiting at the intermediate buffer level when no content-length header was provided, but rather at the final buffer level. This is OKAY. We have reduced the overall heap size by removing this unnecessary intermediate buffer. And this pattern ensures we fully drain the request body before returning a response which can help avoid tcp connection reset that occur if there is > 256KB of data remaining in the request body (see net/http/server.go#1089). There actually may be other places we should be discarding remaining r.Body data before returning a response as well, but that was not my focus in this PR.
The second change fixes an issue that was likely rendering the uncompressed slice pool ineffective due to how the s2.Decode function works internally. Specifically, s2.Decode will allocate a new slice if the provided dst byte slice does not have enough capacity to hold the decoded data. This means that any uncompressed body larger than 128KB (the initial capacity of the byte slice) would be allocated and then thrown to GC. Now, we ensure the buffer is at least the capacity of the decoded data before passing it to the s2.Decode function so that we guarantee the buffer is large enough to be used in all cases.
The third change was to remove the writeRequestPool which was using the .Reset() of a proto object. This is not a good idea as it will replace the reference with a new struct, so we would effectively be pooling a pointer. I hope we'll be able to add pooling back to this in a future pr.

Testing

Tests were added for the syncutil.Pool implementation.
go test -race -v -count=10 -timeout 15m ./pkg/receive/.... I'm relying on existing tests for the receive handler. But I verified that the tests break if the pools were improperly configured by temporarily skipping reset steps.

gavins-db · 2026-02-19T20:45:54Z

pkg/receive/handler.go

+	defaultCompressedBufCap  = 32 * 1024
+	maxPooledCompressedCap   = 1 << 20 // 1MB
+	maxPooledDecompressedCap = 4 << 20 // 4MB


I'm curious about these numbers- where did they come from originally? i.e. do we have empirical data / histograms on incoming request sizes? Do we know what contributes to such large request sizes?

first

…ing-improvements

gavins-db · 2026-02-20T19:55:32Z

pkg/receive/handler.go

-
-	// Default / max capacities for pooled buffers. These caps prevent "pool ballooning"
-	// where a single large request permanently inflates process RSS.
-	defaultCompressedBufCap   = 32 * 1024


note: I removed this defaultCompressedBufCap variable for 3 reasons:

We don't want to over allocate from the beginning. As buffers in the pool are reused, they will grow to reach the steady state of a usual request.

I wasn't able to find any empirical evidence that this 32KB actually is an expected average size.

When pooling is deactivated, I want it to operate as close to the previous implementation as possible (before there was any pooling). The previous implementation stared with a default size of a) content-length (we still do this a la buffer.Grow(content-length)) b) 512b if no content-length header is set (which actually is what the buffer will do internally).

pkg/receive/handler.go

gavins-db added 8 commits February 19, 2026 00:30

pooling improvements

c85132d

remove unused

72aa701

repackage

b752ed7

spelling...

3efaa1f

whoops

eb59158

add copyright header....

4bc02c5

simplify comment

21f70b7

okay pool

d7d1b8a

gavins-db commented Feb 19, 2026

View reviewed changes

gavins-db added 11 commits February 19, 2026 23:48

try to make writeRequestPool actually useful

1e50ce3

add syncpool disable capability

e84b4f3

initial refactoring of pools so they can be configured

d2429b4

rename for clarity

d83ec54

flags

e8d07ca

formatting

230fc28

better defaulting behavior

b04e130

comments

2b3a4b6

enabled by default in tests....

af23367

looked through commit history; let's not allocate buffers so large at

8c280d8

first

Merge branch 'db_main' of github.com:gavins-db/thanos-db into gs/pool…

f774896

…ing-improvements

gavins-db commented Feb 20, 2026

View reviewed changes

d'oh! spelling again

6d83e2b

gavins-db commented Feb 23, 2026

View reviewed changes

pkg/receive/handler.go Outdated Show resolved Hide resolved

yuchen-db self-requested a review February 23, 2026 20:19

remove WriteRequest pool

03bddf2

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

receive.handler pooling improvements#299

receive.handler pooling improvements#299
gavins-db wants to merge 21 commits intodatabricks:db_mainfrom
gavins-db:gs/pooling-improvements

gavins-db commented Feb 19, 2026 •

edited

Loading

Uh oh!

gavins-db Feb 19, 2026

Uh oh!

gavins-db Feb 20, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

gavins-db commented Feb 19, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Changes

Testing

Uh oh!

gavins-db Feb 19, 2026

Choose a reason for hiding this comment

Uh oh!

gavins-db Feb 20, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

gavins-db commented Feb 19, 2026 •

edited

Loading