Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Enhance the data viewer for Pandas dataframes in VSCode #9986

Closed
rchiodo opened this issue May 11, 2022 Discussed in #9867 · 7 comments
Closed

Enhance the data viewer for Pandas dataframes in VSCode #9986

rchiodo opened this issue May 11, 2022 Discussed in #9867 · 7 comments
Labels
data-viewer feature-request Request for new features or functionality

Comments

@rchiodo
Copy link
Contributor

rchiodo commented May 11, 2022

Discussed in #9867

Originally posted by xareelee April 28, 2022
When you run Jupyter/IPython in VSCode, you can view the Pandas dataframes variable in the data viewer:

Screen Shot 2022-04-29 at 05 31 15

Feature requests for data viewer:

1. add info as info() in the data viewer

  • dtype for each column
  • non-null count
  • unique count
  • total memory usage

as df.info():

Screen Shot 2022-04-29 at 05 36 31

2. add a button to view describe() for columns

  • you can select columns to show stat description.
  • when pressing the button, display a popup view to show the results of describe() for selected columns (or all columns if there is no selection)

as df.describe():

Screen Shot 2022-04-29 at 05 36 41

3. provide a textfield to input the expression to apply onto the dataframe

You can add fluent dot syntax onto the dataframe and view the result on the fly. It's useful for data tidying and cleaning.

The textfield looks like:

# The below line is the input in the textfield; `(df)` is a prompt for the variable name
(df) {this is a placeholder for hints}

For example, you can type .max() in the textfield, it will update the data viewer for the output:

(df) .max()

Screen Shot 2022-04-29 at 05 50 13

You can chain the methods the with fluent dot syntax:

(df) .astype({'Open': 'int32', 'High': 'int32', 'Low': 'int32', 'Close': 'int32'}).drop(columns=['Volume'])

Screen Shot 2022-04-29 at 05 56 51

Those would be very useful to view the Pandas dataframes in VSCode.

@github-actions github-actions bot added the triage-needed Issue needs to be triaged label May 11, 2022
@IanMatthewHuff IanMatthewHuff added enhancement data-viewer and removed triage-needed Issue needs to be triaged labels May 12, 2022
@jmew
Copy link
Contributor

jmew commented May 12, 2022

This feature might be covered by Data Wrangler. Every feature that was requested here can be done in Data Wrangler today already.

@rchiodo
Copy link
Contributor Author

rchiodo commented May 12, 2022

I thought it might :)

Do you have any public information you can share about an ETA?

@jmew
Copy link
Contributor

jmew commented May 12, 2022

@rchiodo We are hoping to get an VS Code Insiders release in June but cannot share the repo at this time as it will be private until the insiders release. I think we can use this issue or the original discussion issue if users have any other comments or feedback for what else they'd like to see in this "enhanced data viewer".

@Philip-S-Martin
Copy link

@jmew is there any way I can track the progress on the data wrangler?

@nickdos
Copy link

nickdos commented Oct 31, 2022

This feature might be covered by Data Wrangler. Every feature that was requested here can be done in Data Wrangler today already.

I'm trying to solve the same issue and I can't find "Data Wrangler" via Google (just seems to give me the AWS version of "data wrangler" or general blogs on Python "data wrangling"). Can you provide a link to this, please?

@EHardiR
Copy link

EHardiR commented Oct 31, 2022

I know multiple teams (myself included) that refuse to start using VS Code until there is a better way to view and filter dataframes in Pandas. For now, they stick to PyCharm and RStudio.

@IanMatthewHuff IanMatthewHuff removed their assignment Nov 10, 2022
@rebornix rebornix added feature-request Request for new features or functionality and removed enhancement labels Dec 6, 2022
@DonJayamanne
Copy link
Contributor

Duplicate of #1148

@DonJayamanne DonJayamanne marked this as a duplicate of #1148 Dec 8, 2022
@github-actions github-actions bot locked as resolved and limited conversation to collaborators Sep 15, 2023
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
data-viewer feature-request Request for new features or functionality
Projects
None yet
Development

No branches or pull requests

8 participants