Is beam search not efficient enough? It's slower than Faster Whisper. #3521

twmht · 2025-04-14T05:39:37Z

twmht
Apr 14, 2025

When I set the beam size to 1, TensorRT-LLM is about 50% faster than Faster Whisper. However, when I set the beam size to 5, the speeds are roughly the same. TensorRT-LLM's latency increase is significantly greater than Faster Whisper's. Any thoughts?

lfr-0531 · 2025-04-14T07:05:57Z

lfr-0531
Apr 14, 2025
Collaborator

When setting beam size to 5, we will predict different output tokens for differnet beams. For such a case, in the decode phase, if batch_size=1, the equivalent batch will be changed from 1 to 5. So the latency will increase.

0 replies

twmht · 2025-04-14T12:11:14Z

twmht
Apr 14, 2025
Author

So i need to set batch size to 5 to reduce the increased latency?

3 replies

lfr-0531 Apr 14, 2025
Collaborator

Nonono, I mean you need to set beam size to 5 or batch size to 5 to Faster Whisper for fair perf comparison.

twmht Apr 14, 2025
Author

yeah I set beam size to 5 for faster whisper, but the latency of TensorRT-LLM increased a lot as compared to faster whisper

lfr-0531 Apr 15, 2025
Collaborator

Got it. I think you can fill a GitHub issue to report this large perf drop.

MahmoudAshraf97 · 2025-04-16T13:29:48Z

MahmoudAshraf97
Apr 16, 2025

TRT-LLM is 3 times faster than CT2 for whisper in almost all cases, you just need to give it enough inputs to see the difference, if you are using it for personal usage and not production, then stick to faster-whiper.
Source: I'm the maintainer of faster whisper

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Is beam search not efficient enough? It's slower than Faster Whisper. #3521

{{title}}

Replies: 3 comments 3 replies

{{title}}

{{title}}

{{title}}

{{title}}

{{title}}

{{title}}

Select a reply

Is beam search not efficient enough? It's slower than Faster Whisper. #3521

twmht Apr 14, 2025

Replies: 3 comments · 3 replies

lfr-0531 Apr 14, 2025 Collaborator

twmht Apr 14, 2025 Author

lfr-0531 Apr 14, 2025 Collaborator

twmht Apr 14, 2025 Author

lfr-0531 Apr 15, 2025 Collaborator

MahmoudAshraf97 Apr 16, 2025

twmht
Apr 14, 2025

Replies: 3 comments 3 replies

lfr-0531
Apr 14, 2025
Collaborator

twmht
Apr 14, 2025
Author

lfr-0531 Apr 14, 2025
Collaborator

twmht Apr 14, 2025
Author

lfr-0531 Apr 15, 2025
Collaborator

MahmoudAshraf97
Apr 16, 2025