Support lazily generate intermediate files(.wav) and clear them once the corresponding transcribe task is completed #50

MaleicAcid · 2024-07-08T06:15:22Z

Currently, it seems that intermediate files(.wav) are generated uniformly before the all transcription task starts, and cleared after all transcription tasks are completed.

This results in huge temporary disk usage.
In my example, the task of transcribing of 7.8GB MP4 audios (total 120 files) generates about 50GB intermediate files, which is unfriendly to the gpu server in the cloud environment.

zh-plus · 2024-07-08T07:25:05Z

I'm working on refactoring the intermediate file generation process, which is currently buggy and poorly structured.
During the refactor:

I will maintain the lazy generation strategy. The project's time bottleneck lies in transcription and translation, so a consumer-worker pattern is used to mitigate this. All intermediate files (preprocessing) should be generated before transcription since both require CPU usage.
I will consider clearing these files as soon as transcription is completed.

I expect to finish the refactor by late July. Feel free to submit a PR if you want to help speed up the process.

zh-plus added the enhancement New feature or request label Jul 8, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support lazily generate intermediate files(.wav) and clear them once the corresponding transcribe task is completed #50

Support lazily generate intermediate files(.wav) and clear them once the corresponding transcribe task is completed #50

MaleicAcid commented Jul 8, 2024 •

edited

Loading

zh-plus commented Jul 8, 2024

Support lazily generate intermediate files(.wav) and clear them once the corresponding transcribe task is completed #50

Support lazily generate intermediate files(.wav) and clear them once the corresponding transcribe task is completed #50

Comments

MaleicAcid commented Jul 8, 2024 • edited Loading

zh-plus commented Jul 8, 2024

MaleicAcid commented Jul 8, 2024 •

edited

Loading