Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add Real-Time-Voice-Cloning? #18

Closed
htadashi opened this issue Jan 21, 2020 · 21 comments
Closed

Add Real-Time-Voice-Cloning? #18

htadashi opened this issue Jan 21, 2020 · 21 comments

Comments

@htadashi
Copy link

htadashi commented Jan 21, 2020

Thanks for this fantastic resource. I think this collab on real time voice cloning could also be a nice addition to the list:
https://colab.research.google.com/drive/1ts-KRxOuuAHmb-AtjU1XTM_lIFOepAN2

@tugstugi
Copy link
Owner

tugstugi commented Feb 1, 2020

tnx, I have updated the demo notebook little bit and uploaded it: https://colab.research.google.com/github/tugstugi/dl-colab-notebooks/blob/master/notebooks/RealTimeVoiceCloning.ipynb

@tugstugi tugstugi closed this as completed Feb 1, 2020
@400lbhacker
Copy link

last step, error: "operands could not be broadcast together with shapes (2200,) (4000,) (2200,)"
is this due to numpy version or python version(2/3) thanks

@tugstugi
Copy link
Owner

@400lbhacker fixed.

@aakashkumar6868
Copy link

Hey can u do it for adding paragraphs.

@400lbhacker
Copy link

Hey tugstugi I just wanted to say thankyou for making this wonderful project, and for also resolving that issue. That is the fastest fix responce time ive ever had on github. keep up the wonderful project

@400lbhacker
Copy link

I really want to talk to you im impressed with alot of your projects im new to github still learning how to use it, my email is here: [email protected] my facebook is: https://www.facebook.com/profile.php?id=100015734717430

@himanshuragtah1
Copy link

Hey can u do it for adding paragraphs.

@400lbhacker fixed.

@tugstugi error: "operands could not be broadcast together with shapes (2200,) (4000,) (2200,)" occurring again. How can this be fixed?

@deanrussell
Copy link

deanrussell commented Mar 7, 2022

Hi @tugstugi. On colab, I'm getting two errors in the final "Synthesize" step:

FileNotFoundError Traceback (most recent call last) [<ipython-input-4-6e155181f8d0>](https://localhost:8080/#) in <module>()

and

FileNotFoundError: [Errno 2] No such file or directory: 'Real-Time-Voice-Cloning/saved_models/default/synthesizer.pt'

Any thoughts on what I'm doing wrong?

@tugstugi
Copy link
Owner

tugstugi commented Mar 7, 2022

@deanrussell Currently download from Google Drive doesn't work anymore, as long gdown doesn't fix it, we can't do anything, see: wkentaro/gdown#162

@tugstugi
Copy link
Owner

tugstugi commented Mar 7, 2022

@deanrussell you can try again, it is now hopefully fixed.

@deanrussell
Copy link

deanrussell commented Mar 8, 2022

@deanrussell you can try again, it is now hopefully fixed.

Thank you, @tugstugi! It works.

@Cybernite21
Copy link

Hi, this is a really great tool, but if I were to disconnect and reconnect, is there a way to reload the state the ai was in? So i do not have to re upload audio and retrain?

@KaliHUB10
Copy link

THANK YOU

@KaliHUB10
Copy link

KaliHUB10 commented Aug 12, 2022 via email

@souvikg544
Copy link

Will giving big audio files of maybe 5 hours help in getting a good output?

@lucidigital
Copy link

tnx, I have updated the demo notebook little bit and uploaded it: https://colab.research.google.com/github/tugstugi/dl-colab-notebooks/blob/master/notebooks/RealTimeVoiceCloning.ipynb

Google Colab said they don't support TensorFlow 1.x anymore, and require to update TensorFlow 2. Colab doc becomes useless :( Can you update it? Thanks

@jujpenabe
Copy link

Must be upgraded to tensorflow 2 for compatibility with colab.

@AlbinoGeek
Copy link

AlbinoGeek commented Feb 18, 2023

For all the people mentioning tensorflow 2 support....

Just delete the line that mentions tensorflow 1.x -- it works.

@jujpenabe @lucidigital

@Abbienew
Copy link

Abbienew commented May 1, 2023

My first attempt was successful, but the subsequent executions were unsuccessful and produced errors
ModuleNotFoundError: No module named 'unidecode' ; NameError: name 'encoder' is not defined;" NameError: name 'encoder' is not defined "

any solutions? tips?

@imvbhuvan
Copy link


TypeError Traceback (most recent call last)
in <cell line: 23>()
28 #button = widgets.Button(description="Upload Voice File")
29 #button.on_click(_upload_audio)
---> 30 _upload_audio("")

2 frames
/content/Real-Time-Voice-Cloning/encoder/audio.py in preprocess_wav(fpath_or_wav, source_sr, normalize, trim_silence)
40 # Resample the wav if needed
41 if source_sr is not None and source_sr != sampling_rate:
---> 42 wav = librosa.resample(wav, source_sr, sampling_rate)
43
44 # Apply the preprocessing: normalize volume and shorten long silences

TypeError: resample() takes 1 positional argument but 3 were given

I am this error constantly any tips on how to resolve this @tugstugi

@roy-sub
Copy link

roy-sub commented Mar 13, 2024

Hey imvbhuvan where you able to fix the TypeError: resample() takes 1 positional argument but 3 were given.

If yes kindly provide the solution !

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests