-
Notifications
You must be signed in to change notification settings - Fork 1.9k
Issues: triton-lang/triton
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
Bad performance in InfoNCE loss between triton vs pytorch implementation
#6605
opened Apr 25, 2025 by
rangehow
tl.dot on transposed matrix tries to rearrange matrix in shared memory
performance
#6569
opened Apr 23, 2025 by
saagarjha
Triton does not detect libcuda.so inside Conda environment
bug
#6558
opened Apr 22, 2025 by
BrandonStudio
Crash when MLIR_DUMP_PATH is set to a directory instead of a file
bug
#6548
opened Apr 21, 2025 by
peymanbr
Autotune bug: autotune result not equal to non-autotuned result
bug
#6524
opened Apr 17, 2025 by
huangrt01
The position of
if-else
significantly affect performance, which is unexpected.
performance
#6491
opened Apr 15, 2025 by
wenhaoli-xmu
Triton BF16 mode is slower compared to FP32 mode.
performance
#6427
opened Apr 8, 2025 by
wenhaoli-xmu
[AMD] BufferOps require additional annotation for grid dependent information
performance
#6398
opened Apr 4, 2025 by
njriasan
[FeatureRequest] Support passing triton.jit functions to triton.jit functions as arguments
#6367
opened Apr 2, 2025 by
alexdremov
[AMD] Triton Kernel generates suboptimal
s_waitcnt vmcnt
when mixing buffer_load and global_load
performance
#6310
opened Mar 26, 2025 by
njriasan
Previous Next
ProTip!
Type g i on any issue or pull request to go back to the issue listing page.