Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Adjust test scripts and section header for webadataset notebook #3162

Merged
merged 1 commit into from
Jul 19, 2021

Conversation

klecki
Copy link
Contributor

@klecki klecki commented Jul 19, 2021

Signed-off-by: Krzysztof Lecki klecki@nvidia.com

Description

  • Bug fix (non-breaking change which fixes an issue)
  • New feature (non-breaking change which adds functionality)
  • Breaking change (fix or feature that would cause existing functionality to not work as expected)
  • Refactoring (Redesign of existing code that doesn't affect functionality)
  • Other (e.g. Documentation, Tests, Configuration)

What happened in this PR

Fix CI scripts.
Move ### Sharding section header in the notebook
to separate cell as it didn't render correctly
in the docs.

Additional information

  • Affected modules and functionalities:
    CI scripts
    Docs
  • Key points relevant for the review:
    Dependencies in CI scripts?

Checklist

Tests

  • Existing tests apply
  • New tests added
    • Python tests
    • GTests
    • Benchmark
    • Other
  • N/A

Documentation

  • Existing documentation applies
  • Documentation updated
    • Docstring
    • Doxygen
    • RST
    • Jupyter
    • Other
  • N/A

DALI team only

Requirements

  • Implements new requirements
  • Affects existing requirements
  • N/A

REQ IDs: N/A

JIRA TASK: N/A

Signed-off-by: Krzysztof Lecki <klecki@nvidia.com>
@review-notebook-app
Copy link

Check out this pull request on  ReviewNB

See visual diffs & provide feedback on Jupyter Notebooks.


Powered by ReviewNB

@klecki
Copy link
Contributor Author

klecki commented Jul 19, 2021

!build

@jantonguirao jantonguirao self-assigned this Jul 19, 2021
@dali-automaton
Copy link
Collaborator

CI MESSAGE: [2610399]: BUILD STARTED

@dali-automaton
Copy link
Collaborator

CI MESSAGE: [2610804]: BUILD STARTED

@dali-automaton
Copy link
Collaborator

CI MESSAGE: [2610399]: BUILD PASSED

"metadata": {},
"source": [
"## Introduction\n",
"### Data Representation\n",
"Web Dataset is a dataset representation that heavily optimizes networked accessed storage performance. At its simplest, it stores the whole dataset in one tarball file, where individual samples are kept under the files with the same names but different extensions. This approach improves drive access caching on the RAM, since the data is represented sequentially.\n",
"Web Dataset is a dataset representation that heavily optimizes networked accessed storage performance. At its simplest, it stores the whole dataset in one tarball file, where individual samples are kept under the files with the same names but different extensions. This approach improves drive access caching on the RAM, since the data is represented sequentially."
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Note being very well versed in WebDataset, I find this a bit unclear. Does it mean this?

Suggested change
"Web Dataset is a dataset representation that heavily optimizes networked accessed storage performance. At its simplest, it stores the whole dataset in one tarball file, where individual samples are kept under the files with the same names but different extensions. This approach improves drive access caching on the RAM, since the data is represented sequentially."
"Web Dataset is a dataset representation that heavily optimizes networked accessed storage performance. At its simplest, it stores the whole dataset in one tarball file, where each sample is represented by one or more entries with the same same name but different extensions. This approach improves drive access caching in RAM, since the data is represented sequentially."

@dali-automaton
Copy link
Collaborator

CI MESSAGE: [2610804]: BUILD PASSED

@klecki klecki merged commit 6f3e231 into NVIDIA:main Jul 19, 2021
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

5 participants