-
Notifications
You must be signed in to change notification settings - Fork 149
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
How much the size of GDR can pin? Is there differences on Tesla and Quadro? #301
Comments
Both of the two device' driver are: NVIDIA-SMI 530.30.02 Driver Version: 530.30.02 CUDA Version: 12.1. |
I have tried to disable the CPU PA 46 bits limitation in bios, but still could only pin the GPU memory less than 220MB. |
Here is the error log: 12 may means out of memory? Is there some information about this? GPU id:0; name: NVIDIA RTX A4000; Bus id: 0000:51:00 |
When I want to gdr_pin 221MB, it fails. And here is the information from dmesg. [64513.944169] gdrdrv:gdrdrv_open:minor=0 filep=0xff2796ccd17ff600 |
I notice that V100 ‘s bar is about 32GB, but A4000 only has 256MB. I notice this bar. Does A4000 could get 16GB bar by compute mode? How could I switch the mode? |
Hi @Notherthing , As you have already figured out, the limitation is your GPU BAR size. This is the GPU HW characteristic. There is nothing much we can do here. You cannot map the entire GPU memory at once because of the small GPU BAR. But you can use a sliding window technique to map the region you want to use. When you need to access a different region, you free the current mapping first and then map the new region. |
Thank you, my friend. It's pity to learn about that cheap Quadro device has small BAR size. I notice this displaymode is used to switch the GPU mode to have larger BAR size. But it doesn't mention A4000 (only A5000 and devices with higher specification). Does it will work for A4000? And thanks for your valuable advice sincerely. If we could not enlarge the BAR size, I think it is necessary to use special designs when using GDR. |
I am not sure what that script does. Because A4000 is not in the support list, I would not advise you to try it. Generally, small BAR GPUs remain as small BAR. You may be able to disable the graphic mode using |
@Notherthing depending in your motherboard, you might also be able to get a larger BAR1 by taking advantage of the PCIe "Resizable BAR" feature. |
Thanks. I am going to try. |
When I use V100 , it shows that I can gdr_pin nearly all of the device memory (about 32GB).
But when I use A4000, it can only pin about 220MB (the device of memory is about 16GB).
Is there differences on Tesla and Quadro?
The text was updated successfully, but these errors were encountered: