[amp] dygraph amp support param_group #34899

zhiqiu · 2021-08-14T07:38:33Z

PR types

New features

PR changes

Others

Describe

dygraph amp support param_group

linear_1 = paddle.nn.Linear(10, 10)
linear_2 = paddle.nn.Linear(10, 10)
inp = paddle.uniform(shape=[10, 10], min=-0.1, max=0.1)
with paddle.amp.auto_cast():
    out = linear_1(inp)
    out = linear_2(out)
loss = paddle.mean(out)
sgd = paddle.optimizer.SGD(
    learning_rate=0.1,
    parameters=[{
        'params': linear_1.parameters()
    }, {
        'params': linear_2.parameters(),
        'weight_decay': 0.001,
        'learning_rate': 0.1
    }],
    weight_decay=0.01)              
scaler=paddle.amp.GradScaler()     
scaler.scale(out).backward()
scaler.step(sgd)
sgd.clear_grad()

add step method for class GradScaler, to be consistent with class Optimizer, since Optimizer.minimize() does not support param_group

paddle-bot-old · 2021-08-14T07:38:46Z

Thanks for your contribution!
Please wait for the result of CI firstly. See Paddle CI Manual for details.

jerrywgz

LGTM

TCChenlong · 2021-08-16T02:55:18Z

python/paddle/amp/grad_scaler.py

+            optimizer(Optimizer):  The optimizer used to update parameters.
+        Examples:
+            .. code-block:: python
+                import paddle


import 前要加一个空行

TCChenlong · 2021-08-16T02:55:35Z

python/paddle/amp/grad_scaler.py

+
+        If the scaled gradients of parameters contains NAN or INF, the parameters updating is skipped.
+        Otherwise, it first unscales the scaled gradients of parameters, then updates the parameters.
+        Args:


Args 前要加空行

TCChenlong · 2021-08-16T02:55:56Z

python/paddle/amp/grad_scaler.py

+        Otherwise, it first unscales the scaled gradients of parameters, then updates the parameters.
+        Args:
+            optimizer(Optimizer):  The optimizer used to update parameters.
+        Examples:


Examples 前要加空行

TCChenlong

LGTM for API docs

dygraph amp support param_group

3ba22fa

zhiqiu requested review from jerrywgz and phlrain August 14, 2021 07:46

jerrywgz reviewed Aug 15, 2021

View reviewed changes

remove unused code

db29ca0

TCChenlong reviewed Aug 16, 2021

View reviewed changes

phlrain previously approved these changes Aug 16, 2021

View reviewed changes

fix doc

2a0e202

zhiqiu dismissed phlrain’s stale review via 2a0e202 August 16, 2021 03:00

TCChenlong approved these changes Aug 16, 2021

View reviewed changes

phlrain self-requested a review August 16, 2021 04:06

phlrain approved these changes Aug 16, 2021

View reviewed changes

lanxianghit approved these changes Aug 16, 2021

View reviewed changes

zhiqiu merged commit e29c2d1 into PaddlePaddle:develop Aug 16, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[amp] dygraph amp support param_group #34899

[amp] dygraph amp support param_group #34899

zhiqiu commented Aug 14, 2021 •

edited

Loading

paddle-bot-old bot commented Aug 14, 2021

jerrywgz left a comment

TCChenlong Aug 16, 2021

zhiqiu Aug 16, 2021

TCChenlong Aug 16, 2021

zhiqiu Aug 16, 2021

TCChenlong Aug 16, 2021

zhiqiu Aug 16, 2021

TCChenlong left a comment

[amp] dygraph amp support param_group #34899

[amp] dygraph amp support param_group #34899

Conversation

zhiqiu commented Aug 14, 2021 • edited Loading

PR types

PR changes

Describe

paddle-bot-old bot commented Aug 14, 2021

jerrywgz left a comment

Choose a reason for hiding this comment

TCChenlong Aug 16, 2021

Choose a reason for hiding this comment

zhiqiu Aug 16, 2021

Choose a reason for hiding this comment

TCChenlong Aug 16, 2021

Choose a reason for hiding this comment

zhiqiu Aug 16, 2021

Choose a reason for hiding this comment

TCChenlong Aug 16, 2021

Choose a reason for hiding this comment

zhiqiu Aug 16, 2021

Choose a reason for hiding this comment

TCChenlong left a comment

Choose a reason for hiding this comment

zhiqiu commented Aug 14, 2021 •

edited

Loading