Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Remove mpiexec affinity binding in TensorFlow TL1 and TL3 RN50 test #3681

Merged
merged 1 commit into from
Feb 16, 2022

Conversation

JanuszL
Copy link
Contributor

@JanuszL JanuszL commented Feb 15, 2022

  • mpiexec affinity setting collides with DALI one so disable it for
    TL1 and TL3 RN50 test
  • adds multiproces index generation in TL1 and TL3 RN50 test

Signed-off-by: Janusz Lisiecki jlisiecki@nvidia.com

Category:

Other (e.g. Documentation, Tests, Configuration)

Description:

  • mpiexec affinity setting collides with DALI one so disable it for
    TL1 and TL3 RN50 test
  • adds multiproces index generation in TL1 and TL3 RN50 test

Additional information:

Affected modules and functionalities:

  • TL3_RN50_convergence

Key points relevant for the review:

  • NA

Checklist

Tests

  • Existing tests apply
  • New tests added
    • Python tests
    • GTests
    • Benchmark
    • Other
  • N/A

Documentation

  • Existing documentation applies
  • Documentation updated
    • Docstring
    • Doxygen
    • RST
    • Jupyter
    • Other
  • N/A

DALI team only

Requirements

  • Implements new requirements
  • Affects existing requirements
  • N/A

REQ IDs: N/A

JIRA TASK: N/A

@dali-automaton
Copy link
Collaborator

CI MESSAGE: [3969908]: BUILD STARTED

@dali-automaton
Copy link
Collaborator

CI MESSAGE: [3969908]: BUILD FAILED

@dali-automaton
Copy link
Collaborator

CI MESSAGE: [3969908]: BUILD PASSED

@jantonguirao jantonguirao self-assigned this Feb 16, 2022
- mpiexec affinity setting collides with DALI one so disable it for
  TL1 and TL3 RN50 test
- adds multiproces index generation in TL1 and TL3 RN50 test

Signed-off-by: Janusz Lisiecki <jlisiecki@nvidia.com>
@JanuszL JanuszL changed the title Remove mpiexec affinity binding in TensorFlow TL3 RN50 test Remove mpiexec affinity binding in TensorFlow TL1 and TL3 RN50 test Feb 16, 2022
@JanuszL
Copy link
Contributor Author

JanuszL commented Feb 16, 2022

!build

@dali-automaton
Copy link
Collaborator

CI MESSAGE: [3976316]: BUILD STARTED

@dali-automaton
Copy link
Collaborator

CI MESSAGE: [3976316]: BUILD FAILED

@dali-automaton
Copy link
Collaborator

CI MESSAGE: [3976316]: BUILD PASSED

@JanuszL JanuszL merged commit 787942b into NVIDIA:main Feb 16, 2022
@JanuszL JanuszL deleted the tf_affinity branch February 16, 2022 17:56
cyyever pushed a commit to cyyever/DALI that referenced this pull request Feb 21, 2022
…VIDIA#3681)

- mpiexec affinity setting collides with DALI one so disable it for
  TL1 and TL3 RN50 test
- adds multiproces index generation in TL1 and TL3 RN50 test

Signed-off-by: Janusz Lisiecki <jlisiecki@nvidia.com>
cyyever pushed a commit to cyyever/DALI that referenced this pull request May 13, 2022
…VIDIA#3681)

- mpiexec affinity setting collides with DALI one so disable it for
  TL1 and TL3 RN50 test
- adds multiproces index generation in TL1 and TL3 RN50 test

Signed-off-by: Janusz Lisiecki <jlisiecki@nvidia.com>
cyyever pushed a commit to cyyever/DALI that referenced this pull request Jun 7, 2022
…VIDIA#3681)

- mpiexec affinity setting collides with DALI one so disable it for
  TL1 and TL3 RN50 test
- adds multiproces index generation in TL1 and TL3 RN50 test

Signed-off-by: Janusz Lisiecki <jlisiecki@nvidia.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

5 participants