AutoregressiveBisectionInverter bounds picked up as trainable parameters #173

mdmould · 2024-09-11T22:24:42Z

Because the lower and upper bounds for the inverter are arrays, they get picked up as trainable parameters when filtering with, e.g., is_inexact_array. I think this isn't desired behaviour in any cases, because it would probably interact unexpectedly with the adaptive bounding in the bisection search. Usually this won't affect anything because a flow that contains the inverter, e.g., BNAF, is trained only without ever using the numerical inverse. The only reason I noticed this was because I was counting the number of parameters I expected in the model.

I'm not sure if this is really an issue in practice. In any case, one can just wrap the inverter in a non_trainable or manually ignore the lower and upper "parameters" as necessary. But maybe there's a neater solution that keeps the bisection functions compatible with the scans and so on?

danielward27 · 2024-09-12T10:09:54Z

Good spot, thanks. Your absolutely right that they shouldn't be included, I'll get that fixed. Like you said generally it shouldn't matter (I believe JAX will error if you try to differentiate through the numerical method), but one case where I believe it matters is if regularisation of parameters is used.

danielward27 mentioned this issue Sep 12, 2024

Avoid array in AutoregressiveBisectionInverter #174

Merged

danielward27 closed this as completed in #174 Sep 12, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

AutoregressiveBisectionInverter bounds picked up as trainable parameters #173

AutoregressiveBisectionInverter bounds picked up as trainable parameters #173

mdmould commented Sep 11, 2024

danielward27 commented Sep 12, 2024

AutoregressiveBisectionInverter bounds picked up as trainable parameters #173

AutoregressiveBisectionInverter bounds picked up as trainable parameters #173

Comments

mdmould commented Sep 11, 2024

danielward27 commented Sep 12, 2024