JIT LSRA Throughput: Short-circuit register selection #6705

CarolEidt · 2016-09-21T21:22:11Z

The loop over all the candidate registers in LinearScan::tryAllocateFreeReg() and in LinearScan::allocateBusyReg() could be short-circuited when a register is found that has the best possible score. Additionally, in the case of MinOpts, it could potentially short-circuit as soon as a suitable candidate is found, though one would want to weight the throughput benefit against the code quality impact.

category:throughput
theme:register-allocator
skill-level:expert
cost:medium

The text was updated successfully, but these errors were encountered:

RussKeldorph · 2016-09-22T14:57:25Z

/cc @sivarv

pgavlin · 2017-06-20T16:59:00Z

The loop over all the candidate registers in LinearScan::tryAllocateFreeReg() and in LinearScan::allocateBusyReg() could be short-circuited when a register is found that has the best possible score

Outside of minopts, is it possible to know the best possible score for a particular attempt a priori?

CarolEidt · 2017-06-20T17:24:29Z

Outside of minopts, is it possible to know the best possible score for a particular attempt a priori?

So, a bit more description would have been warranted here, and mea culpa for that (this issue has been around for a very long time before migrating from the old work items, and I just copied most of the text from there). There are a couple of ways a register is evaluated as a candidate. The first is the score. And it is possible to know the best possible score based on the characteristics of the RefPosition itself, e.g. whether it has a relatedInterval. However, the second consideration is comparing the location of the next use of the physical register (currently, since we don't do any pre-allocation, that's always a fixed reg reference). Determining the best value for that a priori doesn't seem really practical. However, the way I currently think about this "short-circuiting" is that for both tryAllocateFreeReg and allocateBusyReg, we should ensure that 1) the cheaper and higher value criteria are evaluated first, and as soon as it fails a criteria that would cause it be scored lower than an existing candidate, the loop should short-circuit. I think that, to accomplish this cleanly, those methods could use some cleaning up.

It may be that we could maintain a list of available registers, sorted by nearest next reference. Then it would indeed be possible to short-circuit. That said, one would want to evaluated the throughput impact of the various options.

kunalspathak · 2021-04-09T00:07:39Z

We did fair amount of refactoring in #45135 that would short circuit the other heuristics once we found a single register candidate out of set of registers.

msftgits transferred this issue from dotnet/coreclr Jan 31, 2020

msftgits added this to the Future milestone Jan 31, 2020

CarolEidt mentioned this issue Oct 12, 2020

[LSRA][RyuJIT] Tune register selection heuristics #43318

Closed

11 tasks

CarolEidt modified the milestones: Future, 6.0.0 Oct 27, 2020

JulieLeeMSFT assigned kunalspathak Mar 23, 2021

JulieLeeMSFT added the needs-further-triage Issue has been initially triaged, but needs deeper consideration or reconsideration label Mar 23, 2021

kunalspathak closed this as completed Apr 9, 2021

ghost locked as resolved and limited conversation to collaborators May 9, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

JIT LSRA Throughput: Short-circuit register selection #6705

JIT LSRA Throughput: Short-circuit register selection #6705

CarolEidt commented Sep 21, 2016

RussKeldorph commented Sep 22, 2016

pgavlin commented Jun 20, 2017

CarolEidt commented Jun 20, 2017

kunalspathak commented Apr 9, 2021

JIT LSRA Throughput: Short-circuit register selection #6705

JIT LSRA Throughput: Short-circuit register selection #6705

Comments

CarolEidt commented Sep 21, 2016

RussKeldorph commented Sep 22, 2016

pgavlin commented Jun 20, 2017

CarolEidt commented Jun 20, 2017

kunalspathak commented Apr 9, 2021