[Dist Pass] Amp Pass #38764

JZ-LIANG · 2022-01-06T09:02:54Z

PR types

New features

PR changes

Others

Describe

amp pass supports for auto parallel (support dist op and dist context)
different from fleet.meta_optimizer.amp which uses nested callback for modification of forward/backward/update. this amp pass modify the full program(forward/backward/update) in once, which increases the independency and maintainability.
the performance is the same as fleet.meta_optimizer.amp

…rding-stage-1-2-3

paddle-bot-old · 2022-01-06T09:04:01Z

Thanks for your contribution!
Please wait for the result of CI firstly. See Paddle CI Manual for details.

…-pass

aoyulong · 2022-01-10T11:22:50Z

python/paddle/distributed/auto_parallel/parallelizer.py

@@ -185,10 +188,9 @@ def _get_dist_program(self, rank, dist_context=None, relaunch_phase=False):
            self._parameter_list, self._no_grad_set, self._callbacks)

        # serial forward pass
-        self._apply_serial_pass(completed_main_program, serial_startup_program)
-
+        self._apply_serial_pass(completed_main_program, serial_startup_program,


Please change _apply_serial_pass() to _apply_pre_passes() and change _apply_post_optimization_passed to _apply_post_passes.

aoyulong · 2022-01-10T11:28:00Z

python/paddle/distributed/auto_parallel/operators/dist_check_finite_and_unscale.py

+from ..dist_attribute import OperatorDistributedAttribute
+from paddle.distributed.auto_parallel.process_group import get_world_process_groups
+
+global_process_mesh = get_world_process_groups().ranks


Please change get_world_process_groups() to get_world_process_group().

fixed, though it is not my fault~ lol

aoyulong · 2022-01-10T11:29:48Z

python/paddle/distributed/passes/auto_parallel_amp.py

+                                                     g_dist_attr.dims_mapping)
+        self.dist_context.set_op_dist_attr_for_program(new_op, new_op_dist_attr)
+
+        main_block._sync_with_cpp()


Please add the necessary comments, especially for the methods without underscores.

…-pass

sneaxiy

LGTM

JZ-LIANG added 16 commits December 27, 2021 21:16

auto parallel sharding base

210b790

chmod

6f031e8

Merge remote-tracking branch 'upstream/develop' into AutoParallel/Sha…

aad24bc

…rding-stage-1-2-3

add unitest

d693a48

set unitest cmake dist label

7becc2c

revise code according to rewiew

d8d7c91

chmod

63323bb

bugfix for grad_clip and param broadcast

dbf094e

Merge remote-tracking branch 'upstream/develop' into AutoParallel/Sha…

004aa3e

…rding-stage-1-2-3

chmod

15f32f8

update unitest

1b524c3

chmod

ee9febc

add clip

30e6e20

chmod

162f229

add amp pass

5c3887e

chmod

791feab

JZ-LIANG added 5 commits January 7, 2022 11:17

Merge remote-tracking branch 'upstream/develop' into AutoParallel/amp…

72f5fb8

…-pass

add unitest

7c0548f

remove grad update

ee848f7

fixed bug

0692e3b

fixed bug

c092967

JZ-LIANG changed the title ~~[Auto Parallel] Amp Pass~~ [Dist Pass] Amp Pass Jan 10, 2022

aoyulong reviewed Jan 10, 2022

View reviewed changes

aoyulong previously approved these changes Jan 10, 2022

View reviewed changes

fixed typose

3868454

JZ-LIANG dismissed aoyulong’s stale review via 3868454 January 10, 2022 11:42

fixed typoes

7341921

Merge remote-tracking branch 'upstream/develop' into AutoParallel/amp…

6a43864

…-pass

sneaxiy approved these changes Jan 11, 2022

View reviewed changes

JZ-LIANG merged commit cc24427 into PaddlePaddle:develop Jan 12, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Dist Pass] Amp Pass #38764

[Dist Pass] Amp Pass #38764

JZ-LIANG commented Jan 6, 2022 •

edited

Loading

paddle-bot-old bot commented Jan 6, 2022

aoyulong Jan 10, 2022 •

edited

Loading

JZ-LIANG Jan 10, 2022

aoyulong Jan 10, 2022

JZ-LIANG Jan 10, 2022

aoyulong Jan 10, 2022

JZ-LIANG Jan 10, 2022

sneaxiy left a comment

[Dist Pass] Amp Pass #38764

[Dist Pass] Amp Pass #38764

Conversation

JZ-LIANG commented Jan 6, 2022 • edited Loading

PR types

PR changes

Describe

paddle-bot-old bot commented Jan 6, 2022

aoyulong Jan 10, 2022 • edited Loading

Choose a reason for hiding this comment

JZ-LIANG Jan 10, 2022

Choose a reason for hiding this comment

aoyulong Jan 10, 2022

Choose a reason for hiding this comment

JZ-LIANG Jan 10, 2022

Choose a reason for hiding this comment

aoyulong Jan 10, 2022

Choose a reason for hiding this comment

JZ-LIANG Jan 10, 2022

Choose a reason for hiding this comment

sneaxiy left a comment

Choose a reason for hiding this comment

JZ-LIANG commented Jan 6, 2022 •

edited

Loading

aoyulong Jan 10, 2022 •

edited

Loading