[Auto Parallel] Improve the fine-grained APIs #46552

aoyulong · 2022-09-27T09:23:25Z

PR types

Others

PR changes

APIs

Describe

In the pr, we proposed a fine-grained APIs to satisfy users who want to control the execution logic. This pr is used to further improve these fine-grained APIs as following:

Improved Distributed loader
- Add distributed dataloader (recommended) and dataloader_from_generator (for legacy support) methods.
- Align the interface to the serial dataloader as close as possible.

Improved Fine-grained APIs: dataloader + prepare + run

import paddle
import paddle.vision.transforms as T
import paddle.distributed.auto_parallel as auto
from paddle.vision.datasets import MNIST

transform = T.Compose([
   T.Transpose(),
   T.Normalize([127.5], [127.5])
])
train_dataset = MNIST(mode='train', transform=transform)
valid_dataset = MNIST(mode='test', transform=transform)

model = paddle.vision.models.LeNet()
loss = paddle.nn.CrossEntropyLoss() 
optimizer = paddle.optimizer.Adam(
   learning_rate=0.001, parameters=model.parameters())
metrics = paddle.metric.Accuracy(topk=(1, 2))

engine = auto.Engine(model, loss, optimizer, metrics)

# Step 1: build distributed dataloader
dataloader = engine.dataloader(train_dataset,
                              epochs=2, 
                              batch_size=64, 
                              mode="train")

# Step 2: build the distributed program
engine.prepare(mode="train")

feed_dict = ...
fetch_list = ...
# Step 3: run the distributed program by using the distributed dataloader
 for data in train_dataloader:
     outs = engine.run(data, feed=feed_dict, fetch_list=fetch_list, mode="train")

Add the prepare API to explicitly control the partition.
Replace __call__ with run to conform with the serial executor interface.

paddle-bot · 2022-09-27T09:23:32Z

你的PR提交成功，感谢你对开源项目的贡献!
请关注后续CI自动化测试结果，详情请参考Paddle-CI手册。
Your PR has been submitted. Thanks for your contribution!
Please wait for the result of CI firstly. See Paddle CI Manual for details.

… fix_bug

… callable

JZ-LIANG

LGTM

* [Auto Parallel] Suppport different dataloaders * [Auto Parallel] Add num_shards config for dataset * [Auto Parallel] Unify the logger and outputs of Engine API * [Auto Parallel] Fix the bugs of to_static * [Auto Parallel] Adjust the test_to_static.py * [Auto Parallel] Add the prepare API and replace __call__ with run * [Auto Parallel] Improve the private implementations of Engine * [Auto Parallel] Set capacity of dataloader for opt tuning * [Auto Parallel] [WIP] Change the fine-grained API * [Auto Parallel] Improve APIs to support different user cases * [Auto Parallel] Add removed config * [Auto Parallel] Add imports * [Auto Parallel] Fix bugs for to_static * [Auto Parallel] Remove unnecessary imports

…47145) * [Auto Parallel] Make Engine class callable (#46416) * [Auto Parallel] Imporve the user-defined fetches and logging * [Auto Parallel] Make Engine class callable * [Auto Parallel] Update the data loading of tuner * Print IPS in auto parallel Engine (#46554) * [AutoParallel] fix dist_split (#46505) * [AutoParallel] fix dist_split * add unittest * update cmakelist * [AutoParallel] fix sharding (#46572) * [AutoParallel] fix process_mesh (#46583) * [AutoParallel] fix reshard when train with eval (#46605) * [AutoParallel] fix reshard when train with eval * fix mppp * [AutoParallel] fix amp when predict (#46637) * [Auto Parallel]Update comp cost and completion for gpt auto search (#46387) * update comp cost and completion for gpt auto search * add unittest * [Auto Parallel] Fix bugs caused by the inconsistent outputs of Engine API (#46633) * [Auto Parallel] Unify the logger and outputs of Engine API * [Auto Parallel] Fix the bugs of to_static * [Auto Parallel] Adjust the test_to_static.py * [Auto Parallel] Improve the fine-grained APIs (#46552) * [Auto Parallel] Suppport different dataloaders * [Auto Parallel] Add num_shards config for dataset * [Auto Parallel] Unify the logger and outputs of Engine API * [Auto Parallel] Fix the bugs of to_static * [Auto Parallel] Adjust the test_to_static.py * [Auto Parallel] Add the prepare API and replace __call__ with run * [Auto Parallel] Improve the private implementations of Engine * [Auto Parallel] Set capacity of dataloader for opt tuning * [Auto Parallel] [WIP] Change the fine-grained API * [Auto Parallel] Improve APIs to support different user cases * [Auto Parallel] Add removed config * [Auto Parallel] Add imports * [Auto Parallel] Fix bugs for to_static * [Auto Parallel] Remove unnecessary imports * bugfix (#46921) * [Auto Parallel] Fix the bug for None labels (#46987) * [AutoParallel] adapt for gpt-gen (#46771) * for gpt-gen * fix reshard * adapt assign and shape op * add dist_assign & unittest * add conditional block unittest * rename unittest * [Auto Parallel] Fix the bug of completion (#47056) * [Auto Parallel] Fix the bug for None labels * [Auto Parallel] Fix the completion bug * [AutoParallel] add callbacks (#47014) * [AutoParallel] add callbacks * fix unittest * fix dist_context * fix engine * fix cmakelist * fix unittest's returns * fix cmakelist * [Auto Parallel] Add cost interface (#47043) * add cost interface * update inferface and add unittest * update unittest * update inferface * [Auto Parallel]Add parallel tuner (#46189) * add parallel tuner * add unittest * fix unittest * set timeout of unittest * set unittest timeout * fix auto_mode setting * update unittest * sync from develop and update unittest * remove unused import * update unittest * update cmakelist * add unittests Co-authored-by: Yulong Ao <aoyulong@baidu.com> Co-authored-by: Ruibiao Chen <chenruibiao@baidu.com> Co-authored-by: caozhou <48191911+Caozhou1995@users.noreply.github.com> Co-authored-by: JZ-LIANG <jianzhongliang10@gmail.com>

aoyulong added 2 commits September 27, 2022 09:11

[Auto Parallel] Suppport different dataloaders

d7be801

[Auto Parallel] Add num_shards config for dataset

02ce80f

aoyulong added 13 commits September 29, 2022 03:58

[Auto Parallel] Unify the logger and outputs of Engine API

0e712f7

Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into…

7382d0f

… fix_bug

[Auto Parallel] Fix the bugs of to_static

99e8d19

[Auto Parallel] Adjust the test_to_static.py

dc80497

[Auto Parallel] Add the prepare API and replace __call__ with run

bb6dc7a

Merge branch 'imporve_api' into callable

e5424e8

[Auto Parallel] Improve the private implementations of Engine

c6d4dff

[Auto Parallel] Set capacity of dataloader for opt tuning

97e4529

[Auto Parallel] [WIP] Change the fine-grained API

4743075

Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into…

fcfa53a

… callable

[Auto Parallel] Improve APIs to support different user cases

18a70bb

[Auto Parallel] Add removed config

139e78b

Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into…

6ee6eca

… callable

aoyulong changed the title ~~[Auto Parallel] Improve the distributed loader~~ [Auto Parallel] Improve the fine-grained APIs Oct 12, 2022

aoyulong added 3 commits October 12, 2022 04:13

[Auto Parallel] Add imports

b1fb4ff

[Auto Parallel] Fix bugs for to_static

6e89a24

[Auto Parallel] Remove unnecessary imports

fac4084

JZ-LIANG approved these changes Oct 12, 2022

View reviewed changes

JZ-LIANG merged commit 686fa07 into PaddlePaddle:develop Oct 12, 2022

zhaoyinglia mentioned this pull request Oct 18, 2022

[Cherry-Pick][AutoParallel] auto_parallel cherrypick to 2.4 #47123

Closed

zhaoyinglia mentioned this pull request Oct 19, 2022

[Cherry-Pick][AutoParallel] auto_parallel cherry-pick to release2.4 #47128

Closed

zhaoyinglia mentioned this pull request Oct 19, 2022

[Cherry-Pick][AutoParallel] auto_parallel cherry-pick to release2.4 #47140

Closed

zhaoyinglia mentioned this pull request Oct 19, 2022

[Cherry-Pick][AutoParallel] auto_parallel cherry-pick to release2.4 #47145

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Auto Parallel] Improve the fine-grained APIs #46552

[Auto Parallel] Improve the fine-grained APIs #46552

aoyulong commented Sep 27, 2022 •

edited

Loading

paddle-bot bot commented Sep 27, 2022

JZ-LIANG left a comment

[Auto Parallel] Improve the fine-grained APIs #46552

[Auto Parallel] Improve the fine-grained APIs #46552

Conversation

aoyulong commented Sep 27, 2022 • edited Loading

PR types

PR changes

Describe

paddle-bot bot commented Sep 27, 2022

JZ-LIANG left a comment

Choose a reason for hiding this comment

aoyulong commented Sep 27, 2022 •

edited

Loading