Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[request] HPC filesystem to Cloud #924

Open
TomNicholas opened this issue Oct 24, 2023 · 1 comment
Open

[request] HPC filesystem to Cloud #924

TomNicholas opened this issue Oct 24, 2023 · 1 comment
Labels
enhancement New feature or request

Comments

@TomNicholas
Copy link

Please describe problem to be solved
A lot of scientists (particularly Climate Scientists) are now sharing their data via public cloud buckets (e.g. ERA-5 on GCP, see this paper for rationale). However, transferring data from large numerical simulations produced on HPC systems is often really challenging. I want to use skyplane to transfer large amounts of data from a HPC local filesystem to the cloud.

(Optional): Suggest a solution
I want to be able to use Skyplane locally on a HPC filesystem (e.g. Lustre) to transfer TBs (or even PBs) of data to the cloud. Even better would be if I could call Skyplane from Python, to integrate with existing tooling for moving scientific datasets to the cloud (which currently can only pull data from other public data portals elsewhere on the internet.)

@TomNicholas TomNicholas added the enhancement New feature or request label Oct 24, 2023
@sarahwooders
Copy link
Contributor

Hi @TomNicholas - thanks for the suggestion! I believe this should be possible if a StorageInterface that interacts with Luster (or whatever other on-prem filesystem/object storage you have).

@lynnliu030 could VM-to-VM transfers be extended to support on-prem?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

2 participants