You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Is your feature request related to a problem? Please describe it
Beam search is a known technique used to enhance LLM inference, especially in tasks requiring reasoning, at a great cost to performance. ChatGPT has been suspected of using it.
Describe the solution
llama.cpp does support it. All that would be needed is to add somewhere in inference config.
What is the motivation / use case for changing the behavior?
Example: mathematical problems. Global probability maximization allows the model to catch and prune erroneous paths, since eventually they lead to less probable chains of thought than their incorrect counterparts.
The text was updated successfully, but these errors were encountered:
Is your feature request related to a problem? Please describe it
Beam search is a known technique used to enhance LLM inference, especially in tasks requiring reasoning, at a great cost to performance. ChatGPT has been suspected of using it.
Describe the solution
llama.cpp does support it. All that would be needed is to add somewhere in inference config.
Teachability, documentation, adoption, migration strategy
No response
What is the motivation / use case for changing the behavior?
Example: mathematical problems. Global probability maximization allows the model to catch and prune erroneous paths, since eventually they lead to less probable chains of thought than their incorrect counterparts.
The text was updated successfully, but these errors were encountered: