storage: Compaction index uses hardcoded 512kB memory limit #4645

jcsp · 2022-05-10T12:07:52Z

For topics with compaction enabled, the compaction writer's inner spill_key_index may use up to 512kB of memory for its unflushed data. This is initialized as

                make_file_backed_compacted_index(
                  path.string(),
                  writer,
                  iopc,
                  segment_appender::write_behind_memory / 2));

(segment_appender::write_behind memory is 1 meg).

512kB is too big when we have huge partition counts.

It is ameliorated somewhat by the relatively smaller number of events per partition when the partition count is huge, but for tiny messages we can still end up using the whole 512kB. Consider a writer doing tiny events with a 64 byte key and 64 byte value, with 100k partitions and 2TB of storage, where they keys do not repeat often. That's 156250 events per partition before they run out of disk space, multiply by the 64 byte key size.

So: to avoid using 50GB of memory for compaction indices on a node with 100k partitions, we need to do something smarter. This is a bit similar to #4600 in that the solution can be either an explicit config, or ideally something more dynamic. We could consider having a total per-shard memory limit for compaction index space, rather than a per-partition space limit.

The text was updated successfully, but these errors were encountered:

emaxerrno · 2022-05-10T14:45:49Z

@jcsp the per shard shared mem. Makes a lot of sense. Similar to what Noah did for the chunk appender. What if we remove the memory argument altogether and that would
Just come from a pool like you
Mentioned - good ideas !

jcsp · 2022-07-26T12:45:00Z

This is broadly the same as #5389

Fixes redpanda-data#4645

jcsp added kind/enhance New feature or request area/storage labels May 10, 2022

jcsp self-assigned this May 26, 2022

jcsp added a commit to jcsp/redpanda that referenced this issue Jul 29, 2022

storage: respect shard-wide memory limit in spill_key_index

6b9ca64

Fixes redpanda-data#4645

jcsp mentioned this issue Jul 29, 2022

storage: per-shard limit on memory for spill_key_index #5722

Merged

5 tasks

jcsp added a commit to jcsp/redpanda that referenced this issue Aug 2, 2022

storage: respect shard-wide memory limit in spill_key_index

abf7ed5

Fixes redpanda-data#4645

jcsp added a commit to jcsp/redpanda that referenced this issue Aug 3, 2022

storage: respect shard-wide memory limit in spill_key_index

78bd9b1

Fixes redpanda-data#4645

jcsp added a commit to jcsp/redpanda that referenced this issue Aug 3, 2022

storage: respect shard-wide memory limit in spill_key_index

99d1395

Fixes redpanda-data#4645

jcsp added a commit to jcsp/redpanda that referenced this issue Aug 4, 2022

storage: respect shard-wide memory limit in spill_key_index

e885773

Fixes redpanda-data#4645

jcsp added a commit to jcsp/redpanda that referenced this issue Aug 5, 2022

storage: respect shard-wide memory limit in spill_key_index

a60f1f2

Fixes redpanda-data#4645

dotnwat closed this as completed in #5722 Aug 5, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

storage: Compaction index uses hardcoded 512kB memory limit #4645

storage: Compaction index uses hardcoded 512kB memory limit #4645

jcsp commented May 10, 2022

emaxerrno commented May 10, 2022

jcsp commented Jul 26, 2022

storage: Compaction index uses hardcoded 512kB memory limit #4645

storage: Compaction index uses hardcoded 512kB memory limit #4645

Comments

jcsp commented May 10, 2022

emaxerrno commented May 10, 2022

jcsp commented Jul 26, 2022