example/speculative: drafting fails completely when params.sparams.temp is set to 0

Please include information about your system, the steps to reproduce the bug, and the version of llama.cpp that you are using. If possible, please provide a minimal code example that reproduces the bug.

In the current speculative.cpp implementation, params.sparams.temp is forced to -1.0f
However, if I change this value to 0:
<img width="761" alt="image" src="https://github.com/ggerganov/llama.cpp/assets/54794500/81749cc4-5837-4157-a514-ff1b9e795db8">

draft sampling seems to fail completely:
<img width="902" alt="image" src="https://github.com/ggerganov/llama.cpp/assets/54794500/e336cee1-ae64-4e67-9e71-7268f7268d9d">
(speculative.log)

Is this intended behavior?
I'm working on #5625 which removes the temperature limit so I'd like to get this fixed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

example/speculative: drafting fails completely when params.sparams.temp is set to 0 #5657

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

example/speculative: drafting fails completely when params.sparams.temp is set to 0 #5657

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions