Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Allow more recent version of pandas to be used #488

Closed
rtitle opened this issue Jul 17, 2018 · 7 comments
Closed

Allow more recent version of pandas to be used #488

rtitle opened this issue Jul 17, 2018 · 7 comments

Comments

@rtitle
Copy link
Collaborator

rtitle commented Jul 17, 2018

The AoU pyclient is currently pegging their pandas requirements because of an old version installed in Leo. See all-of-us/pyclient#27

  1. what is the user error when pandas is upgraded from within a notebook?
  2. can the Leo notebooks image include a more recent version of pandas?
@rtitle
Copy link
Collaborator Author

rtitle commented Jul 17, 2018

Note: the AoU pyclient has this requirement:
https://github.com/all-of-us/pyclient/blob/366f4c7b7f9200d52fdb91ef4d3219ee63c8ec03/py/requirements.txt
If upgrading past 0.21 it would break AoU. Should coordinate with AoU when working on this.

@rtitle
Copy link
Collaborator Author

rtitle commented Jul 18, 2018

@akarukappadath is running into some pandas issues for #401 which might be related.

@rtitle
Copy link
Collaborator Author

rtitle commented Jul 18, 2018

May be relevant: pandas-dev/pandas#20697

@rtitle
Copy link
Collaborator Author

rtitle commented Jul 18, 2018

According to the above python 3.4 is not supported by the latest versions of pandas. We might need to upgrade to python 3.5+ (a good idea anyway).

@akarukappadath
Copy link
Contributor

As Rob mentioned above, python 3.4 isn't supported by the latest version of pandas. To upgrade pandas, we would need to upgrade debian jessie to stretch (#445) in order to upgrade to Python 3.5. However, this step means that we need a newer Spark (currently 2.0.2), which would also imply upgrading Dataproc to 1.2 (#144) and Hail to 0.2 (#193). Since all of these are interwoven, it would likely be smartest (and easiest) to upgrade all of these at the same time.

@danqrodney
Copy link

@rtitle @akarukappadath

(CC @calbach)

Is there an update on when this upgrade will occur? (I'm running into more problems with running an old version of Pandas...)

@akarukappadath
Copy link
Contributor

@danqrodney we're currently working on this, and hoping to get it out by this quarter!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

3 participants