-
Notifications
You must be signed in to change notification settings - Fork 4.3k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[REQUEST] Support for Nvidia 50 Series GPUs: Pytorch >=2.6 and CUDA 12.8 required #7144
Comments
This is all on Windows 11, fully up to date. Hardware is 9800x3d, 64GB, RTX 5090. CUDA is: nvcc: NVIDIA (R) Cuda compiler driver The environment is Python 3.11.7 via Conda. Exact Torch versions are: torch 2.7.0.dev20250311+cu128 Pytorch CUDA is working: python -c "import torch; print(f'PyTorch is installed: {torch.version}'); print(f'CUDA available: {torch.cuda.is_available()}')" DeepSpeed code is the latest master branch from Github. I have tried various combinations and not been able to get a build to succeed, trying to turn off AIO, SPARSE_ATTN etc. and no combination of anything I've done has made a build succeed. I'm not new to building from source, and I've read building DeepSpeed is supposed to be pretty straightforward, but I've had no luck. With a fresh command prompt (no additional environment variables etc. yet), the build fails in the following manner. I'm pretty at a loss how to continue. (speech_env) C:\AI-Content\xtts-api-server\DeepSpeed>pip install . × python setup.py egg_info did not run successfully. note: This error originates from a subprocess, and is likely not a problem with pip. × Encountered error while generating package metadata. note: This is an issue with the package mentioned above, not pip. |
Also, if I do install DeepSpeed via pip, deepspeed-0.16.4 gets installed but this error is thrown by the app on launch. The last line is what led me to believe Torch 2.6 and/or CUDA 12.8 were not currently supported. [2025-03-17 19:43:43,963] [INFO] [real_accelerator.py:222:get_accelerator] Setting ds_accelerator to cuda (auto detect) |
FYI I've manually grabbed the 0.16.4 3.11 wheel from here and still have the same errors as 2nd post above after installing that wheel. https://pypi.org/project/deepspeed/#files The error seems to be indicating that DeepSpeed was compiled with pytorch 2.3 and that's not going to work on an RTX 5090 that needs a minimum of 2.6, and I'm running 2.7. |
@elkay - this is because the Windows whls come with all DeepSpeed ops pre-built. The unfortunate side effect of this is that we need to pick a torch version at build time, and we had previously chosen 2.3 (due to the changes with lib_uv and its lack of easy WIndows support after that). So that's why you are seeing the mismatch here:
If you set your local environment to be 2.3, you should see no issues with the pre-built wheels. I assume you need a newer torch in order to support these 50 series devices? In that case you would either need to build the wheel locally or wait for a future release and we can try publishing wheels with a newer torch version. |
Yes, as mentioned the 50 series cards are only supported on Pytorch >= 2.6 and CUDA >= 12.8. This is going to prevent anyone with 50 series consumer 5070-5090 cards, also the new Blackwell workstation cards coming out next month, and also the new Blackwell server cards next month as well, from being able to use DeepSpeed in any of their solutions unless they can get this self-compiled successfully. I did try to build locally but that is not working, either. I'm attaching a stack trace. |
@elkay - this will prevent building on Windows for the 5070-5090 cards, we have already verified support for these generations on Linux. We can look into building future DeepSpeed Windows whls with pytorch 2.7, this will just be a change for existing users of those whls. For your stack trace, the error is here:
This means that the VC vars aren't set correctly, please run this command to set them before trying to build:
|
Well that definitely got further! Built for a while, but did eventually choke again. I've attached the latest build log. |
Removing those two lines also worked for me. deepspeed build time = 387.7761631011963 secs Installing collected packages: deepspeed Also happy to report that not only did it successfully build and install, it appears to be working as it did before without any problems on my 5090! |
Great, thanks @elkay - I'll announce that we will publish our next release Windows wheels with pytorch 2.7 for support here. And glad it is tested, thanks. |
Is your feature request related to a problem? Please describe.
Deepspeed cannot be used on a system with an Nvidia 50 Series GPU (i.e. 5080 or 5090) because these GPUs are only supported on Pytorch 2.6 and above, as well as CUDA 12.8.
Describe the solution you'd like
Add support for Pytorch >= 2.6 (2.7 is what most are already using) and CUDA 12.8.
Describe alternatives you've considered
There is no alternative solution.
Additional context
This is affecting projects that use TTS, among others.
The text was updated successfully, but these errors were encountered: