-
Notifications
You must be signed in to change notification settings - Fork 1.4k
feat: Add FP8 support for SM 120 #3248
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
pamelap-nvidia
commented
Apr 2, 2025
- Added fp8 support for sm120
- Avoided sm120 for fp4 in a few places
- Cubins are already updated from previous MRs.
/bot run |
edb80f7
to
97e6fb3
Compare
/bot run |
1 similar comment
/bot run |
PR_Github #1143 [ run ] triggered by Bot |
PR_Github #1143 [ run ] completed with state |
2bd610c
to
4006bfc
Compare
/bot run |
PR_Github #1144 [ run ] triggered by Bot |
PR_Github #1144 [ run ] completed with state |
/bot run |
PR_Github #1152 [ run ] triggered by Bot |
PR_Github #1152 [ run ] completed with state |
4006bfc
to
7676af8
Compare
/bot run |
PR_Github #1681 [ run ] triggered by Bot |
PR_Github #1681 [ run ] completed with state |
/bot run --stage-list "DGX_H100-4_GPUs-PyTorch-1, DGX_H100-4_GPUs-PyTorch-2, DGX_H100-4_GPUs-TensorRT-2" |
PR_Github #1803 [ run ] triggered by Bot |
PR_Github #1803 [ run ] completed with state |
@pamelap-nvidia FYI - @EmmaQiaoCh is also adding the automation testing for gb20x. |
Signed-off-by: Pamela Peng <[email protected]>
Signed-off-by: Pamela Peng <[email protected]>
Signed-off-by: Pamela Peng <[email protected]>
Signed-off-by: Pamela Peng <[email protected]>
7676af8
to
452d770
Compare
/bot run --stage-list "DGX_H100-4_GPUs-PyTorch-1, DGX_H100-4_GPUs-PyTorch-2, DGX_H100-4_GPUs-TensorRT-2" |
PR_Github #1951 [ run ] triggered by Bot |
PR_Github #1951 [ run ] completed with state |
Hi Pamela, will you also add the test list yml file for L0 under 'tests/integration/test_lists/test-db'? Or you can tell me the tests that you want to run on SM120, I also need to change some CI scripts:) |
Resolved offline. |
Minor comments. Changes look good to me. |
Signed-off-by: Pamela Peng <[email protected]>
Bypassing some checks to merge since comments addressed after PR_Github #1951 were very minor. |
* Allow FP8 on SM120 Signed-off-by: Pamela Peng <[email protected]> * fix sm121 Signed-off-by: Pamela Peng <[email protected]> * fix Signed-off-by: Pamela Peng <[email protected]> * fix pre-commit Signed-off-by: Pamela Peng <[email protected]> * review update Signed-off-by: Pamela Peng <[email protected]> --------- Signed-off-by: Pamela Peng <[email protected]> Co-authored-by: Sharan Chetlur <[email protected]> Signed-off-by: Luis Vega <[email protected]>