Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

image_classification cifar-10 train (use sparse trainning) #105

Closed
shenhuinuist opened this issue Sep 23, 2016 · 4 comments
Closed

image_classification cifar-10 train (use sparse trainning) #105

shenhuinuist opened this issue Sep 23, 2016 · 4 comments
Labels

Comments

@shenhuinuist
Copy link

When execute sparse training, we need to set sparse_update=True in network config. Where should I set the command when I train image_classification cifar-10?

@qingqing01
Copy link
Contributor

qingqing01 commented Sep 23, 2016

@shenhuinuist the sparse_update is an attribute of ParamAttr. For example

 fc = fc_layer(input=data, size=128, param_attr=ParamAttr(sparse_update=True))

The sparse training is usually used to accelerate calculation when input is sparse data with highly dimension.

But the data for image classification of cifar-10 is dense type, sparse_update is not applicable to this case.

@shenhuinuist
Copy link
Author

@qingqing01 Thank you so much . Now , I set sparse_update =True and sparse_remote_update = False in paddle.trainer_config_helpers.attr.py , the input data was still image classification of cifar-10, the error was as follows: TypeError: init( ) got an un expected keyword argument ‘sparse_update’. I have two questions. Where does paddle execute the command ? What's more, How does paddle judge the input data is sparse or not ?

@gangliao gangliao reopened this Sep 23, 2016
@qingqing01
Copy link
Contributor

qingqing01 commented Sep 23, 2016

@shenhuinuist I have mentioned above, the cifar-10 data is not sparse input. So the sparse_update is not applicable to this case. We will provide a demo to show how to use sparse training later.

The data type is defined in this line in the data provider file. The data provider defines the types of the input data and provides training or testing data to PaddlePaddle.

Thanks. If there is any question, we can continue to discuss :)

@shenhuinuist
Copy link
Author

@qingqing01 Thank you!

@reyoung reyoung closed this as completed Oct 25, 2016
zhhsplendid pushed a commit to zhhsplendid/Paddle that referenced this issue Sep 25, 2019
thisjiang pushed a commit to thisjiang/Paddle that referenced this issue Oct 28, 2021
* fix unroll bug

* fix broadcast X86 code gen bug
gglin001 added a commit to graphcore/Paddle-fork that referenced this issue Dec 8, 2021
AnnaTrainingG pushed a commit to AnnaTrainingG/Paddle that referenced this issue Sep 19, 2022
AnnaTrainingG pushed a commit to AnnaTrainingG/Paddle that referenced this issue Dec 6, 2023
Change default dropout value in documentation
laipaang pushed a commit to laipaang/Paddle that referenced this issue Jan 16, 2024
* add fuse mt weight only quant
lizexu123 pushed a commit to lizexu123/Paddle that referenced this issue Feb 23, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

No branches or pull requests

4 participants