-
Notifications
You must be signed in to change notification settings - Fork 52
Pull requests: ikawrakow/ik_llama.cpp
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Remove what appears to be unnecessary asserts in ggml_cuda_cpy
#560
opened Jun 26, 2025 by
ikawrakow
Loading…
Use cuBLAS for large batches and quants with block size 16
#559
opened Jun 26, 2025 by
ikawrakow
Loading…
Update README.md to add quickstart section
#554
opened Jun 25, 2025 by
jwinpbe
Loading…
2 of 4 tasks
Add DRY and fix the server to use other new samplers.
#504
opened Jun 7, 2025 by
Ph0rk0z
Loading…
2 of 4 tasks
Check if ffn_up and ffn_gate are of the same type before using fmoe
#495
opened Jun 6, 2025 by
ikawrakow
Loading…
ProTip!
Follow long discussions with comments:>50.