Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[REQUEST] Show multiple GPUs #865

Open
adrianlyjak opened this issue May 30, 2024 · 6 comments
Open

[REQUEST] Show multiple GPUs #865

adrianlyjak opened this issue May 30, 2024 · 6 comments
Assignees
Labels
enhancement New feature or request

Comments

@adrianlyjak
Copy link

This was discussed fairly thoroughly in #426, the issue to add GPU support, but that's been fixed, so figured I'd open a new issue to keep this particular feature visible.

I have multiple GPUs, and expect that a lot of people using btop are futzing around on servers doing ML and want to monitor GPU usage. btop currently supports GPUs (yay!), however it just shows 1

image

My personal preference would be to use the CPU style of UI, but modified a little to support GPUs

  • treat each GPU like the CPU UI treats a core. It would show current GPU usage, current Memory usage, and current power usage.
  • the historical graph would show average total GPU utilization over time like CPU does

While this leaves some historical information out, this approach seems like it should be simple and fit within the current btop design.

There were some other alternatives discussed in #426, such as binding digits 5-0 to display different gpus. This seems perhaps like it might make the feature more complicated to implement well. I personally don't have strong opinion though, and just want to see my other GPU

@adrianlyjak adrianlyjak added the enhancement New feature or request label May 30, 2024
@actionless
Copy link

actionless commented Jul 5, 2024

i can see 2 GPUs on the left side of the app (with the graph), but on the right side it shows only GPU0 and a blank line following it

here's a comparison, of 1gpu vs 2gpu setup:

2024-07-05--1720139716_3809x774_scrot

not sure if that's a separate issue or the same as on topic

@PorkyC
Copy link

PorkyC commented Jul 23, 2024

i can see 2 GPUs on the left side of the app (with the graph), but on the right side it shows only GPU0 and a blank line following it

here's a comparison, of 1gpu vs 2gpu setup:

2024-07-05--1720139716_3809x774_scrot

not sure if that's a separate issue or the same as on topic

Try pressing 5, 6 to toggle GPU settings.

@actionless
Copy link

if i press 5 it shows separately info for gpu 0 and if 6 - for gpu 1

but then this detailed info is hidden - in the overview it shows only gpu 0 and a blank line

@reukiodo
Copy link

I came here with the same issue! Thanks for the 5/6 tip - I didn't know that, but that uses a LOT more vertical space as there are 4 GPUs on this server:

024-09-11 13:34:02 root@ads-vs18:~# nvidia-smi 
Wed Sep 11 13:34:07 2024       
+-----------------------------------------------------------------------------------------+
| NVIDIA-SMI 550.90.07              Driver Version: 550.90.07      CUDA Version: 12.4     |
|-----------------------------------------+------------------------+----------------------+
| GPU  Name                 Persistence-M | Bus-Id          Disp.A | Volatile Uncorr. ECC |
| Fan  Temp   Perf          Pwr:Usage/Cap |           Memory-Usage | GPU-Util  Compute M. |
|                                         |                        |               MIG M. |
|=========================================+========================+======================|
|   0  NVIDIA A100 80GB PCIe          Off |   00000000:01:00.0 Off |                    0 |
| N/A   75C    P0            258W /  300W |   62881MiB /  81920MiB |    100%      Default |
|                                         |                        |             Disabled |
+-----------------------------------------+------------------------+----------------------+
|   1  NVIDIA A100 80GB PCIe          Off |   00000000:41:00.0 Off |                    0 |
| N/A   31C    P0             43W /  300W |       1MiB /  81920MiB |      0%      Default |
|                                         |                        |             Disabled |
+-----------------------------------------+------------------------+----------------------+
|   2  NVIDIA A100 80GB PCIe          Off |   00000000:81:00.0 Off |                    0 |
| N/A   33C    P0             42W /  300W |       1MiB /  81920MiB |      0%      Default |
|                                         |                        |             Disabled |
+-----------------------------------------+------------------------+----------------------+
|   3  NVIDIA A100 80GB PCIe          Off |   00000000:C1:00.0 Off |                    0 |
| N/A   68C    P0             83W /  300W |   10759MiB /  81920MiB |      8%      Default |
|                                         |                        |             Disabled |
+-----------------------------------------+------------------------+----------------------+
                                                                                         
+-----------------------------------------------------------------------------------------+
| Processes:                                                                              |
|  GPU   GI   CI        PID   Type   Process name                              GPU Memory |
|        ID   ID                                                               Usage      |
|=========================================================================================|
|    0   N/A  N/A     61951      C   python                                      62872MiB |
|    3   N/A  N/A    116494      C   python                                      10750MiB |
+-----------------------------------------------------------------------------------------+
2024-09-11 13:34:08 root@ads-vs18:~# 

If I try to view all 4 (press 5, 6, 7, 8) then the dashboard starts to look a lot more cluttered than just showing all 4 under the CPU core list.
image

@reukiodo
Copy link

I think this enhancement would be quite welcome to show all GPU quickstats under the CPU core list, as there is still space under the first GPU in what would appear should be a list:
image

@aristocratos
Copy link
Owner

Yeah, it's supposed to be showing, the empty room has been left there for the additional GPU's.

The current "draw" code for GPU is a bit bugged, including this particular issue.
Haven't had time to go through it yet.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

5 participants