Bug in Adam for non-real parameters #1051

dvicini · 2024-01-25T14:49:26Z

Hi,

There appears to be an edge case in the current Adam implementation. If the optimizer contains non-real variables, i.e., quaternions or complex numbers, the variance estimation might not be what we want.

The Adam implementation computes

v_t = self.beta_2 * v_tp + (1 - self.beta_2) * dr.sqr(g_p)

where g_p is the parameter gradient. If a parameter is a quaternion or complex number, the gradient will be of the same type. Then, dr.sqr will compute a quaternion or complex number product. This is different from the element-wise product that we would expect here.

While there is some literature on this subject (pytorch github issue, pytorch docs, https://arxiv.org/pdf/0906.4835.pdf), the easiest practical solution is to just optimize using Vector2f or Vector4f instead.

Maybe Mitsuba should raise an error if a quaternion or complex value is assigned to the optimizer?

The text was updated successfully, but these errors were encountered:

njroussel · 2024-01-29T09:09:27Z

Hi @dvicini

I agree with you. My suggestion would be a check to see if the type is special (dr.is_special_v()) and if it is the case raise an error as you said.

I'll leave this open for a few more days before pushing a fix, in case anyone else has some ideas to contribute :)

wjakob · 2024-01-29T18:57:36Z

The same issue would also appear with camera matrices and, e.g., complex IOR values in the next version.

The optimizer will need an extra check like

tp = type(g_p)
if dr.is_special_v(tp):
    tp_a = dr.array_t(tp)
    g_p = tp_a(tp)
    ...
v_t = self.beta_2 * v_tp + (1 - self.beta_2) * dr.sqr(g_p)

wjakob · 2024-01-29T18:59:00Z

We could either wait until the next version or backport the dr.array_t type trait.

dvicini added the bug Something isn't working label Jan 25, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Bug in Adam for non-real parameters #1051

Bug in Adam for non-real parameters #1051

dvicini commented Jan 25, 2024 •

edited

Loading

njroussel commented Jan 29, 2024

wjakob commented Jan 29, 2024 •

edited

Loading

wjakob commented Jan 29, 2024

Bug in Adam for non-real parameters #1051

Bug in Adam for non-real parameters #1051

Comments

dvicini commented Jan 25, 2024 • edited Loading

njroussel commented Jan 29, 2024

wjakob commented Jan 29, 2024 • edited Loading

wjakob commented Jan 29, 2024

dvicini commented Jan 25, 2024 •

edited

Loading

wjakob commented Jan 29, 2024 •

edited

Loading