How to map timestamp to seconds referring to Issue #3627 #3925

shahzebali42 · 2021-10-01T11:38:36Z

How to map timestamp to seconds Issue #3627

How to map timestamps to seconds using the formula :

segment_start + timestep/total_frames * segment_duration:

I am getting correct timestamps but i want to find respecting seconds of the audio.
Can anyone explain this formulae?

What do we have to put in these --> segment_start, segment_duration <-- variables to find respective seconds?

My audio's sample rate=16000 , sample width = 2

stale · 2022-03-02T23:32:13Z

This issue has been automatically marked as stale. If this issue is still affecting you, please leave any comment (for example, "bump"), and we'll keep it open. We are sorry that we haven't been able to prioritize it yet. If you have any new additional information, please include it with your comment!

stale · 2022-04-17T23:20:38Z

Closing this issue after a prolonged period of inactivity. If this issue is still present in the latest release, please create a new issue with up-to-date information. Thank you!

abarcovschi · 2023-12-06T21:09:12Z

@shahzebali42 did you ever figure out where to get the values for segment_start and segment_duration?

abarcovschi · 2023-12-17T14:13:27Z

I figured out how to map from timesteps to seconds for each non-blank token. The formula I used is the following:

sec_time = frame_num * (audio_len / (num_frames * sample_rate))

where:

frame_num = the timestep of the symbol, as returned in the 'timesteps' field of Wl2Decoder.decode() outputs.
audio_len = the number of samples in the loaded audio file corresponding to the transcript (if using batched w2v2 acoustic model inference, will be zero padded to the length of the longest loaded audio file in the batch).
num_frames = the number of frames in the emission matrix returned by the w2v2 acoustic model inference for that audio file (if using batched inference, the number of frames for each audio file will be the same as in this case all loaded audio files are padded to the length of the longest audio file in the batch).
sample_rate = sample rate of loaded audio files (usually 16000 Hz).

shahzebali42 added needs triage question labels Oct 1, 2021

stale bot added the stale label Mar 2, 2022

stale bot closed this as completed Apr 17, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How to map timestamp to seconds referring to Issue #3627 #3925

How to map timestamp to seconds referring to Issue #3627 #3925

shahzebali42 commented Oct 1, 2021

stale bot commented Mar 2, 2022

stale bot commented Apr 17, 2022

abarcovschi commented Dec 6, 2023

abarcovschi commented Dec 17, 2023

How to map timestamp to seconds referring to Issue #3627 #3925

How to map timestamp to seconds referring to Issue #3627 #3925

Comments

shahzebali42 commented Oct 1, 2021

How to map timestamp to seconds Issue #3627

How to map timestamps to seconds using the formula :

segment_start + timestep/total_frames * segment_duration:

What do we have to put in these --> segment_start, segment_duration <-- variables to find respective seconds?

stale bot commented Mar 2, 2022

stale bot commented Apr 17, 2022

abarcovschi commented Dec 6, 2023

abarcovschi commented Dec 17, 2023