Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add ultization to verbose flag #759

Merged
merged 2 commits into from
Jan 1, 2025
Merged

Add ultization to verbose flag #759

merged 2 commits into from
Jan 1, 2025

Conversation

H4CK3Rabhi
Copy link
Contributor

#738
had the issue with non-functionality of verbose flag and suggested handling it like openai-whisper

Quote from #738

In the orginal Openai-Whisper in Python you can set verbose=True, e.g.
result = model.transcribe(input_file,
language='en',
initial_prompt=prompt,
fp16=False,
verbose=True)

This prints the transcription output as the program runs so you can observe any errors in the transcription before it finalises, e.g.
[00:00.000 --> 00:07.000] Okay.
[00:30.000 --> 00:43.680] I'm making inquiries into a workplace dispute that
[00:43.680 --> 00:50.160] has reportedly occurred on Saturday 6 January
[00:50.160 --> 00:51.160] 2024.

Judging from what I can see in transcribe.py, WhisperX has verbose=True set by default. However, nothing is printed

Handled the same

Copy link
Collaborator

@Barabazs Barabazs left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thank you for the contribution!

@Barabazs Barabazs merged commit 51da227 into m-bain:main Jan 1, 2025
yuki-opus pushed a commit to opus-pro/whisperX that referenced this pull request Jan 2, 2025


---------

Co-authored-by: Abhishek Sharma <[email protected]>
Co-authored-by: Barabazs <[email protected]>
yuki-opus added a commit to opus-pro/whisperX that referenced this pull request Jan 6, 2025
* local vad model

* move model to assets

* Remove typo in error message

* Fix link in README.md

* Added Romanian phoneme-based ASR model (m-bain#791)

Co-authored-by: Barabazs <[email protected]>

* feat: add new align models (m-bain#922)

Co-authored-by: Barabazs <[email protected]>

* feat: update Norwegian models (m-bain#687)

Updated Norwegian Bokmål and Norwegian Nynorsk models

Co-authored-by: Barabazs <[email protected]>

* fix: Force ctranslate to version 4.4.0

Force ctranslate to version 4.4.0 due libcudnn_ops_infer.so.8:
SYSTRAN/faster-whisper#729

Co-authored-by: Icaro Bombonato <[email protected]>

* Update MANIFEST.in to include necessary files

* chore: bump version

* feat: update faster-whisper to 1.0.2 (m-bain#814)

* Update faster-whisper to 1.0.2 to enable model distil-large-v3

* feat: add hotwords option to default_asr_options

---------

Co-authored-by: Barabazs <[email protected]>

* feat: add support for faster-whisper 1.0.3 (m-bain#875)




---------

Co-authored-by: Barabazs <[email protected]>

* feat: update versions for pyannote:3.3.2 and faster-whisper:1.1.0 (m-bain#936)

* chore: bump faster-whisper to 1.1.0

* chore: bump pyannote to 3.3.2

* feat: add multilingual option in load_model function

---------

Co-authored-by: Barabazs <[email protected]>

* feat: add verbose output (m-bain#759)




---------

Co-authored-by: Abhishek Sharma <[email protected]>
Co-authored-by: Barabazs <[email protected]>

* feat: add local_files_only option on whisperx.load_model for offline mode (m-bain#867)

Adds the parameter local_files_only (default False for consistency) to whisperx.load_model so that the user can avoid downloading the file and return the path to the local cached file if it exists.

---------

Co-authored-by: Barabazs <[email protected]>

* feat: use model_dir as cache_dir for wav2vec2 (m-bain#681)

* feat: add Python compatibility testing workflow

feat: restrict Python versions to 3.9 - 3.12

* feat: add build and release workflow

* chore: clean up MANIFEST.in by removing unnecessary asset inclusions

* chore: update gitignore

* fix: update README image source and enhance setup.py for long description

* docs: update installation instructions in README

* chore: update license in setup.py

* fix: add UTF-8 encoding when reading README.md

* chore: update ctranslate2 version to restrict <4.5.0

* chore: bump whisperX to 3.3.0

* fix: update import statement for conjunctions module

* refactor: simplify imports for better type inference

* refactor: add type hints

* feat: include speaker information in  WriteTXT when diarizing

* refactor: replace NamedTuple with TranscriptionOptions in FasterWhisperPipeline

---------

Co-authored-by: Max Bain <[email protected]>
Co-authored-by: Max Bain <[email protected]>
Co-authored-by: Alex Zamoshchin <[email protected]>
Co-authored-by: Jim O’Regan <[email protected]>
Co-authored-by: Ruhollah Majdoddin <[email protected]>
Co-authored-by: Barabazs <[email protected]>
Co-authored-by: Ismael Ruiz Ranz <[email protected]>
Co-authored-by: pere <[email protected]>
Co-authored-by: Icaro Bombonato <[email protected]>
Co-authored-by: Frost Ming <[email protected]>
Co-authored-by: moritzbrantner <[email protected]>
Co-authored-by: Hasan Naseer <[email protected]>
Co-authored-by: Abhishek Sharma <[email protected]>
Co-authored-by: Abhishek Sharma <[email protected]>
Co-authored-by: Roque Giordano <[email protected]>
Co-authored-by: bnitsan <[email protected]>
Co-authored-by: Philippe Anel <[email protected]>
croquies pushed a commit to fika-dev/x-custom that referenced this pull request Jan 20, 2025


---------

Co-authored-by: Abhishek Sharma <[email protected]>
Co-authored-by: Barabazs <[email protected]>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants