Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

num_parallel affecting learning results #229

Open
spicytomatoes opened this issue Dec 14, 2021 · 1 comment
Open

num_parallel affecting learning results #229

spicytomatoes opened this issue Dec 14, 2021 · 1 comment

Comments

@spicytomatoes
Copy link

hi, I've tried training on a 32 core machine, naturally i set num_parallel to 32. However the model does not seem to learn at all. Weirdly, when i set num_parallel to 6, the model learns.
The rest of the config is exactly the same as the PubHRL config for hungry geese.

@YuriCat
Copy link
Contributor

YuriCat commented Jan 8, 2022

Thanks for your report!
We ran several experiments with 64 workers, and all the training was successful.
However, it is not easy to learn non-legal moves in this task, and I am sure that training is not stable.

If there is one thing I can say, it is that the PubHRL experiment setup was decided on the first try, so I cannot recommend it with confidence.
As I mentioned in the discussion, I think forward_steps=1 is generally better in this kind of task. Also, a larger entropy regularization coefficient would be better.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants