Verbose transcription doesn't work #738

DryHam · 2024-03-11T01:31:57Z

Hi

In the orginal Openai-Whisper in Python you can set verbose=True, e.g.
result = model.transcribe(input_file,
language='en',
initial_prompt=prompt,
fp16=False,
verbose=True)

This prints the transcription output as the program runs so you can observe any errors in the transcription before it finalises, e.g.
[00:00.000 --> 00:07.000] Okay.
[00:30.000 --> 00:43.680] I'm making inquiries into a workplace dispute that
[00:43.680 --> 00:50.160] has reportedly occurred on Saturday 6 January
[00:50.160 --> 00:51.160] 2024.

Judging from what I can see in transcribe.py, WhisperX has verbose=True set by default. However, nothing is printed while the program is running so you can't observe the transcription before it finalises. I have attempted to set print_progress = True in model.transcribe(), e.g.
result = model.transcribe(audio, batch_size=batch_size, print_progress=True)

However, this just prints out a percentage completion rate, not the transcription itself, e.g.
Progress: 1.43%...
Progress: 2.86%...
Progress: 4.29%...
Progress: 5.71%...
Progress: 7.14%...
Progress: 8.57%...

Is there a way to print out the transcription in Python as the program is running, or is this an error or an intended change with WhisperX?

H4CK3Rabhi mentioned this issue Mar 27, 2024

Add ultization to verbose flag #759

Merged

DryHam closed this as completed Jan 15, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Verbose transcription doesn't work #738

Verbose transcription doesn't work #738

DryHam commented Mar 11, 2024

Verbose transcription doesn't work #738

Verbose transcription doesn't work #738

Comments

DryHam commented Mar 11, 2024