Add a limit on max remote write request samples #6935

aknuds1 · 2023-12-14T16:34:52Z

Is your feature request related to a problem? Please describe.

We need a limit on the maximum number of samples per remote write request (including OTLP). The motivation is that OTLP write requests are typically batched, and could contain so many samples that Mimir takes too long in processing them.

Describe the solution you'd like

A limit on the number of samples a remote write request is permitted to contain. When the limit is hit, the request should be rejected with HTTP 413. A suggested default is 10k.

Describe alternatives you've considered

Additional context

See https://github.com/grafana/mimir-squad/issues/2180, which mentions the following:

Another idea was to validate normal distributor limits prior to translating OTLP to Mimir requests, but this is much more difficult since the validation middlewares currently expect a normal Mimir write request.

ying-jeanne · 2024-01-10T13:50:43Z

Prior to enabling the default value of 10k, we intend to introduce a new metrics, potentially in the form of a histogram, to monitor the request samples value, so that give some time for collector configuration change. This approach ensures that no requests are rejected due to the updated default value, preventing any potential data loss.

ying-jeanne · 2024-01-23T15:16:42Z

The histograms on the client side do not provide the accurate total sample count; the actual series number should be the number of buckets plus 2, but the client side incorrectly counts this as 1. Configuring batch sizes appropriately on the client side poses a challenge.

aknuds1 · 2024-01-23T16:46:53Z

According to Ying, these are classical histograms. Uncertain as to whether the same behaviour holds for native histograms.

ying-jeanne · 2024-06-03T16:01:57Z

related ticket #8260

ying-jeanne · 2024-06-11T13:43:21Z

Prior to enabling the default value of 10k, we intend to introduce a new metrics, potentially in the form of a histogram, to monitor the request samples value, so that give some time for collector configuration change. This approach ensures that no requests are rejected due to the updated default value, preventing any potential data loss.

this is implemented.

aknuds1 added enhancement New feature or request area/opentelemetry component/distributor labels Dec 14, 2023

aknuds1 assigned aknuds1 and fayzal-g Dec 14, 2023

aknuds1 changed the title ~~Add a limit on max remote writte request samples~~ Add a limit on max remote write request samples Dec 14, 2023

ying-jeanne mentioned this issue May 30, 2024

OTLP endpoint issues #8223

Open

ying-jeanne mentioned this issue Jun 3, 2024

OTLP: Add metrics to track otlp request samples per batch #8265

Merged

4 tasks

aknuds1 assigned ying-jeanne and unassigned fayzal-g, aknuds1 and ying-jeanne Jul 30, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add a limit on max remote write request samples #6935

Add a limit on max remote write request samples #6935

aknuds1 commented Dec 14, 2023 •

edited

Loading

ying-jeanne commented Jan 10, 2024 •

edited

Loading

ying-jeanne commented Jan 23, 2024

aknuds1 commented Jan 23, 2024

ying-jeanne commented Jun 3, 2024

ying-jeanne commented Jun 11, 2024

Add a limit on max remote write request samples #6935

Add a limit on max remote write request samples #6935

Comments

aknuds1 commented Dec 14, 2023 • edited Loading

Is your feature request related to a problem? Please describe.

Describe the solution you'd like

Describe alternatives you've considered

Additional context

ying-jeanne commented Jan 10, 2024 • edited Loading

ying-jeanne commented Jan 23, 2024

aknuds1 commented Jan 23, 2024

ying-jeanne commented Jun 3, 2024

ying-jeanne commented Jun 11, 2024

aknuds1 commented Dec 14, 2023 •

edited

Loading

ying-jeanne commented Jan 10, 2024 •

edited

Loading