-
Notifications
You must be signed in to change notification settings - Fork 4.3k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Change the sparse attention API to be compatible with latest changes of triton #902
Conversation
…on the triton side
Hi @RezaYazdaniAminabadi will this update also make the sparse kernels compatible with different GPU architectures (e.g A100s?) |
Hi @sdtblck Yes, I think the main intension of these changes on the Triton is to support A100. I have already tested it on V100. I will run more tests on A100 as well. Please feel free to try them out and let me know if there is any issue. |
Hi @RezaYazdaniAminabadi I updated our fork to the
But I did notice that triton version in requirements-sparse-attn.txt is |
The issue is that we have several places for installation and JIT compile that we guard the usage of higher version of Triton. I am ganna resolve them soon! |
@RezaYazdaniAminabadi can confirm this runs on the A100s 🚀 . I had to make some changes to op builder to pass the compatibility steps https://github.com/EleutherAI/DeeperSpeed/blob/eb9a6a8201215307ba071357a06a9b03c03af3be/op_builder/sparse_attn.py |
@RezaYazdaniAminabadi |
Hi @RezaYazdaniAminabadi - it seems triton has totally changed their API and removed |
Same problem. I cannot run ruGPT3 on my 3090 because I should use newest CUDA and therefore newest triton. But it seems that triton==1.0.0.dev20210329 was removed from pip, and I can't know what I should do. Newest version of triton==1.0.0.dev20210510 seems not work for me. UPD: |
UPD2: You can get this error if install older triton==0.4.0 , setup DeepSpeed and import this:
|
UPD3 - problem have been solved. |
This PR makes some changes to the parameters passed for the triton kernel to be compatible with the latest Triton.
This address #900 and #838