runtime: GC can thrash on very small heaps #22743

aclements · 2017-11-15T17:55:30Z

In applications with very small heaps and high allocation rates, the garbage collector can consume a large amount of CPU by running very frequently. This particularly affects microbenchmarks, which often have virtually no live heap, causing a full GC cycle every 4MB of allocation. We've seen this several times, and Cloudflare just blogged about this problem: https://blog.cloudflare.com/go-dont-collect-my-garbage/

We should consider rate-limiting the garbage collector, allowing the heap to grow beyond the goal if the GC is running too frequently or using too much CPU. We could potentially replace the (rather arbitrary) 4MB lower-bound on heap size with a more principled rate-limiting system.

One possible approach is to enforce an upper bound on GC utilization. This could be measured GC CPU overhead (e.g., MemStats.GCCPUFraction) or simply the ratio of GC active wall-clock time to total wall-clock time. The GC trigger would be delayed until this metric drops below the upper bound. This should prevent thrashing.

With this approach, we could also enforce a lower bound, which could be a more principled replacement for the current 2 minute forced GC. This should prevent starvation.

@RLH and I have talked about this several times, but apparently neither of us filed an issue to track it. I'm correcting that. :)

The text was updated successfully, but these errors were encountered:

aclements · 2018-12-18T19:00:43Z

The actual problem here is that we fail to amortize the cost of GC on small heaps. I've detailed the problem in #23044 (comment).

prattmic · 2022-07-29T21:08:57Z

@mknyszek do we want to do more here?

mknyszek · 2022-08-18T16:06:39Z

I don't think so. While the pacer doesn't really model some GC fixed costs and things can still get weird when these fixed costs dominate, the Go 1.18 pacer also pushed this problem back enough that I'm comfortable not doing anything else here for now and calling it good.

aclements added this to the Go1.11 milestone Nov 15, 2017

aclements changed the title ~~runtime: consider rate-limiting GC~~ runtime: GC can thrash on very small heaps Nov 15, 2017

bobrik mentioned this issue Dec 7, 2017

Performance regression between 0.5.0 and 0.11.0 prometheus/blackbox_exporter#270

Closed

bradfitz modified the milestones: Go1.11, Go1.12 Jun 20, 2018

aclements modified the milestones: Go1.12, Go1.13 Dec 18, 2018

aclements added the Performance label May 28, 2019

aclements modified the milestones: Go1.13, Go1.14 Jun 25, 2019

rsc modified the milestones: Go1.14, Backlog Oct 9, 2019

cespare mentioned this issue Mar 29, 2020

proposal: runtime: add a mechanism for specifying a minimum target heap size #23044

Closed

gopherbot added the compiler/runtime Issues related to the Go compiler and/or runtime. label Jul 7, 2022

mknyszek closed this as completed Aug 18, 2022

golang locked and limited conversation to collaborators Aug 18, 2023

gopherbot added the FrozenDueToAge label Aug 18, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

runtime: GC can thrash on very small heaps #22743

runtime: GC can thrash on very small heaps #22743

aclements commented Nov 15, 2017

aclements commented Dec 18, 2018

prattmic commented Jul 29, 2022

mknyszek commented Aug 18, 2022

runtime: GC can thrash on very small heaps #22743

runtime: GC can thrash on very small heaps #22743

Comments

aclements commented Nov 15, 2017

aclements commented Dec 18, 2018

prattmic commented Jul 29, 2022

mknyszek commented Aug 18, 2022