Comments (10)
There are additional memory sensors available for Radeon VII, but the amdgpu utilities currently only read/report the memory loading. You can exam the other memory parameters and determine if any may provide insight into your issue. First, use amdgpu-ls
to get the card path
of the GPU in question. If you examine the contents of the directory you will see several memory related driver files. You can cat each one to examine their contents. Let me know if you find that any of the additional memory information is useful and perhaps I can add visibility to in in a future release.
from gpu-utils.
Tom raises a good point. The RX 5600 XT also has these sensors, and from looking at its files in the device directory, I can find explanation for some task performance issues I've seen for E@H gravitational wave crunching.
For my two cents, I'd think it would be handy to have amdgpu-monitor list, right below Mem Load %, an entry for Mem Use %
, which would be mem_info_gtt_used divided by mem_info_gtt_total.
It also would be handy if amdgpu-ls reported, in the last section, following Current Memory Loading
:
Current Memory Used (GB): <mem_info_gtt_used>
Total Memory (GB): <mem_info_gtt_total>
Current Memory VRAM Used (GB): <mem_info_vram_used>
Total Memory VRAM (GB): < mem_info_vram_total>
Current memory values vary depending on the number of concurrent tasks, the data in the current task, and the app being used by the boinc-client. There has been a bit of discussion on the E@H forums about GPU memory and VRAM use for certain tasks. I don't see the need to plot any of the memory values because they don't seem to be that dynamic.
from gpu-utils.
I started working on this. Only visible in amdgpu-ls
and formatting improvements needed. Available in the latest on master.
from gpu-utils.
That's a nice feature addition for amdgpu-ls.
from gpu-utils.
I am now including usage and improved formatting in amdgpu-ls
. Will work on monitor next.
from gpu-utils.
Latest on master includes memory in amdgpu-monitor
from gpu-utils.
One concern is that gtt memory appears to be system memory on my Radeon VII system, but is the same as GPU VRAM for Fiji and Vega64 systems. Maybe there is a better description of GTT memory.
from gpu-utils.
Thank you Rick. I will download/update my utilities tomorrow and see if that helps.
I fired up the Radeon VII I have under Windows where it "cheerfully" ran 7 Gamma Ray (E@H) tasks without running out of gpu memory. But the memory controller couldn't quite manage 8. Under Linux the same tasks seem to be limited to 4-5 on a Radeon VII.
I hope all this will help me figure out how far I can push these gpus. I probably will end up only with the R5700's rather than keeping the Radeon VII.
That will leave me with 6-9 RX 5XX gpus to stare at on one system.
Tom M
from gpu-utils.
Downloaded the latest Master. From my RX 5600 XT, amdgpu-ls lists Total GTT Memory (GB): 5.984
, but the content of mem_info_gtt_total is 6425673728 (6.425 GB). Similarly, for my RX 570, total GTT is listed as 4.000 GB, but mem_info_gtt_total has 4294967296 (4.294 GB). Why the difference?
The RX5600xt and RX 570 are the same as your Fiji and Vega64 cards; total VRAM is the same as total GTT.
When running the RX5600xt with E@H grav. wave tasks, my VRAM Usage is ~83%, depending on the tasks being run, but GTT Usage is 0.763%, which I don't understand. Seems like a mighty low GPU memory usage. On the RX570s running E@H pulsar tasks, VRAM Usage is ~35%, and GTT usage is ~3%. Again, unexpectedly low.
from gpu-utils.
Downloaded the latest Master. From my RX 5600 XT, amdgpu-ls lists
Total GTT Memory (GB): 5.984
, but the content of mem_info_gtt_total is 6425673728 (6.425 GB). Similarly, for my RX 570, total GTT is listed as 4.000 GB, but mem_info_gtt_total has 4294967296 (4.294 GB). Why the difference?
Iām using 1024 instead of 1000 for conversions. It seems to match advertised capacity better. I researched and found there was a standard published in 1998 that 1000 should be used for KB and 1024 for KiB, but it is not widely followed.
from gpu-utils.
Related Issues (20)
- Feature Request: Support values from sysfs gpu_metrics
- index out of bound error HOT 4
- Can not read parameter and list index out of range HOT 5
- Tool doesn't see GPUs when PCI address has multiple domains HOT 9
- Add GDDR6X memory temps to the sensor readouts for Nvidia cards
- unsupported operand type(s) for *: 'float' and 'NoneType' in GPUmodule.py HOT 29
- Ubuntu 22.04 Debian Package - Invalid Icon Path HOT 1
- Warning: apt-key is deprecated in Ubuntu 22.04 HOT 1
- env.py: conversion of hexadecimal to int fails HOT 2
- Wrong vendor for on-board GPU HOT 1
- AMD Ryzen 5 5700G with Cezanne not working HOT 5
- gpu-mon requires gtk modules even when --gui is not used HOT 10
- ValueError: invalid literal for int() with base 10 at GPUmodules.py line 1236 in read_gpu_pstates HOT 10
- RX7900 (gfx11) Cards Fan Control is Not Functional HOT 10
- When using gpu-pac I receive an error HOT 2
- Support for MacOS-X ? HOT 2
- ValueError: invalid literal for int() with base 10: HOT 12
- VRAM and GTT memory usage reported as None HOT 5
- Q: had it been tested w/ latest Debian bpo Kernel? HOT 14
- PermissionError: [Errno 1] Operation not permitted HOT 9
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
š Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. ššš
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ā¤ļø Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from gpu-utils.