Streaming creates a lot of repetitions, behavior does not resemble example #2338

chengyjonathan · 2024-08-06T19:25:47Z

./stream -m ./models/ggml-base.en.bin -t 8 --step 500 --length 5000

Not sure if I'm doing something wrong. Just tried to run out of the box.

But instead of incrementally adding text to the transcription, i get repeating text as shown below

[Start speaking]
Testing to see if the log is there any mor

Testing to see if the log is there anymore

Testing to see if the log is there anymore. Okay, it's not there anymore.

The text was updated successfully, but these errors were encountered:

chengyjonathan · 2024-08-06T19:38:43Z

Reading through some other similar issues, i tried

./stream -m ./models/ggml-large-v2.bin -t 8 --step 500 --length 5000

But still got the same repetitions

[Start speaking]
All right, we're going to prove it h

Alright, we're going to prove it her

All right, we're going to improvemen

Alright, we're gonna prove it here. I don't think things are better, but.

Alright, we're gonna prove it here. I don't think things are better, but I'm noticing...

chengyjonathan · 2024-08-06T19:42:01Z

https://github.com/ggerganov/whisper.cpp/tree/master/examples/stream

Mostly going off of this readme, where the functionality seems quite effective?

chengyjonathan · 2024-08-06T19:56:17Z

The VAD transcriptions are still quite nice. Even with the base model.

So it might just be a matter of hallucinations on silences?

chengyjonathan · 2024-08-06T20:00:07Z

Only problem with VAD approach is that it seems like it waits for a pause before running the inference, meaning that it's not quite streaming anymore?

chengyjonathan · 2024-08-06T20:27:02Z

Oh interesting, when i do

./stream -m ./models/ggml-base.en.bin --step 0 --step 1000

I start seeing the issue, even though at a 10000 step the vad seems to solve the hallucination problem.

So something about the smaller context is causing...repetitions I guess? Or in the one second silences, we're getting repetitions in the inference.

[Start speaking]

Now here's where we do one second based inferences

Now here's where we do one second base inferences a

Now here's where we do one second based inferences

Now here's where we do one second base inferences a

Now here's where we do one second based inferences and see how the reaction speed looks like.

There's going to be a lot of repetition now. I wond

there's going to be a lot of repetition now. I wond

there's going to be a lot of repetition now. I wonder why that is.

chengyjonathan · 2024-08-06T23:10:49Z

I tried the advice to adjust the max_context down, but I still get:
(#1507 (comment))

[Start speaking]

Testing one, two, three. Hmm, okay, well let'

Testing one, two, three. Hmm, okay, well let's see how this goes and let's see whether ther

Testing one, two, three. Hmm, okay, well let's see how this goes and let's see whether there's repetitions, there's all these things.

a lot of repetitions.

chengyjonathan · 2024-08-06T23:12:10Z

I tried the large v2 model, as per
(#1507 (comment))

And I still have repetitions. I can't tell if I'm misunderstanding something.

chengyjonathan closed this as completed Aug 9, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Streaming creates a lot of repetitions, behavior does not resemble example #2338

Streaming creates a lot of repetitions, behavior does not resemble example #2338

chengyjonathan commented Aug 6, 2024 •

edited

Loading

chengyjonathan commented Aug 6, 2024

chengyjonathan commented Aug 6, 2024

chengyjonathan commented Aug 6, 2024

chengyjonathan commented Aug 6, 2024

chengyjonathan commented Aug 6, 2024

chengyjonathan commented Aug 6, 2024

chengyjonathan commented Aug 6, 2024

Streaming creates a lot of repetitions, behavior does not resemble example #2338

Streaming creates a lot of repetitions, behavior does not resemble example #2338

Comments

chengyjonathan commented Aug 6, 2024 • edited Loading

chengyjonathan commented Aug 6, 2024

chengyjonathan commented Aug 6, 2024

chengyjonathan commented Aug 6, 2024

chengyjonathan commented Aug 6, 2024

chengyjonathan commented Aug 6, 2024

chengyjonathan commented Aug 6, 2024

chengyjonathan commented Aug 6, 2024

chengyjonathan commented Aug 6, 2024 •

edited

Loading