Add momentum operator #4571

sidgoyal78 · 2017-10-03T03:57:23Z

This PR adds the implementation of momentum operator.

In summary, we want to perform the update with a new velocity vector, such that,

 velocity = mu * velocity + grad
 param = param - learning_rate * velocity

(where mu is the momentum coefficient).

reyoung · 2017-10-04T22:18:38Z

paddle/operators/momentum_op.h

+    auto p = EigenVector<T>::Flatten(*ctx.Input<Tensor>("Param"));
+    auto g = EigenVector<T>::Flatten(*ctx.Input<Tensor>("Grad"));
+    auto v = EigenVector<T>::Flatten(*ctx.Input<Tensor>("Velocity"));
+    float lr = ctx.Input<Tensor>("LearningRate")->data<float>()[0];


That might be not good for GPU. If the LearningRate is in GPU memory, we cannot get float directly.

Okay.. Thanks.

Fixed as per #4598

dzhwinter

LGTM.

dzhwinter · 2017-10-18T00:46:30Z

paddle/operators/momentum_op.h

+class MomentumOpKernel : public framework::OpKernel<T> {
+ public:
+  void Compute(const framework::ExecutionContext& ctx) const override {
+    auto param_out = ctx.Output<framework::Tensor>("ParamOut");


These parameters will be better if can be named with an auto * to indicate the real type.

dzhwinter · 2017-10-18T00:57:57Z

Thanks for this PR! @sidgoyal78 . Since our book chapters heavily depend on these optimizer operators, so merge this PR ASAP. We can leave the name style unified work in the future.

Add momentum operator

d28b309

sidgoyal78 requested review from abhinavarora, reyoung and jacquesqiao October 3, 2017 03:57

abhinavarora added the OpPorting label Oct 3, 2017

reyoung reviewed Oct 4, 2017

View reviewed changes

sidgoyal78 added 2 commits October 5, 2017 14:54

Modify implementation

c10da26

Fix learning_rate usage for momentum

db77937

sidgoyal78 requested review from dzhwinter and removed request for abhinavarora October 5, 2017 23:13

sidgoyal78 requested a review from wangkuiyi October 13, 2017 17:49

dzhwinter approved these changes Oct 18, 2017

View reviewed changes

dzhwinter merged commit fd96914 into PaddlePaddle:develop Oct 18, 2017

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add momentum operator #4571

Add momentum operator #4571

sidgoyal78 commented Oct 3, 2017 •

edited

Loading

reyoung Oct 4, 2017

sidgoyal78 Oct 4, 2017

sidgoyal78 Oct 5, 2017

dzhwinter left a comment

dzhwinter Oct 18, 2017

dzhwinter commented Oct 18, 2017

Add momentum operator #4571

Add momentum operator #4571

Conversation

sidgoyal78 commented Oct 3, 2017 • edited Loading

reyoung Oct 4, 2017

Choose a reason for hiding this comment

sidgoyal78 Oct 4, 2017

Choose a reason for hiding this comment

sidgoyal78 Oct 5, 2017

Choose a reason for hiding this comment

dzhwinter left a comment

Choose a reason for hiding this comment

dzhwinter Oct 18, 2017

Choose a reason for hiding this comment

dzhwinter commented Oct 18, 2017

sidgoyal78 commented Oct 3, 2017 •

edited

Loading