Skip to content

Extreme Performance Issue & Unwanted reasoning when using Revisual-R1 #14466

Answered by 0cc4m
arcseedai asked this question in Q&A
Discussion options

You must be logged in to vote

Please post the llama-server log, but my guess would be that you need to set -ngl to a high value like 999 (assuming it still defaults to 0, not sure what behaviour you get with your -1 value). If that doesn't work, you might not have the CUDA binaries and it's running on CPU, but the log would show that.

Replies: 1 comment 1 reply

Comment options

You must be logged in to vote
1 reply
@arcseedai
Comment options

Answer selected by arcseedai
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
2 participants