GitHub - BME-SmartLab/txt-ult2wav: Speech Synthesis from Text and Ultrasound Tongue Image-based Articulatory Input

txt+ult2wav

Implementation of Tamás Gábor Csapó, László Tóth, Gábor Gosztolya, Alexandra Markó, ,,Speech Synthesis from Text and Ultrasound Tongue Image-based Articulatory Input'', ISCA 11th Speech Synthesis Workshop (SSW11), 2021, accepted, arXiv:2107.02003

txt+ult2wav is extending the original Merlin toolkit, for articulatory-to-acoustic mapping (ultrasound-to-speech) purposes.

For data, the UltraSuite-TaL corpus is used.

additional requirement: ultrasuite-tools

txt+ult2wav recipes:

txt+ult2wav pre-trained models :

Merlin: The Neural Network (NN) based Speech Synthesis System

This repository contains the Neural Network (NN) based Speech Synthesis System
developed at the Centre for Speech Technology Research (CSTR), University of Edinburgh.

Merlin is a toolkit for building Deep Neural Network models for statistical parametric speech synthesis. It must be used in combination with a front-end text processor (e.g., Festival) and a vocoder (e.g., STRAIGHT or WORLD).

The system is written in Python and relies on the Theano numerical computation library.

Merlin comes with recipes (in the spirit of the Kaldi automatic speech recognition toolkit) to show you how to build state-of-the art systems.

Merlin is free software, distributed under an Apache License Version 2.0, allowing unrestricted commercial and non-commercial use alike.

Read the documentation at cstr-edinburgh.github.io/merlin.

Merlin is compatible with: Python 2.7-3.6.

Installation

Merlin uses the following dependencies:

numpy, scipy
matplotlib
bandmat
theano
tensorflow (optional, required if you use tensorflow models)
sklearn, keras, h5py (optional, required if you use keras models)

To install Merlin, cd merlin and run the below steps:

Install some basic tools in Merlin

bash tools/compile_tools.sh

Install python dependencies

pip install -r requirements.txt

For detailed instructions, to build the toolkit: see INSTALL and CSTR blog post.
These instructions are valid for UNIX systems including various flavors of Linux;

Getting started with Merlin

To run the example system builds, see egs/README.txt

As a first demo, please follow the scripts in egs/slt_arctic

Now, you can also follow Josh Meyer's blog post for detailed instructions
on how to install Merlin and build SLT demo voice.

For a more in-depth tutorial about building voices with Merlin, you can check out:

Synthetic speech samples

Listen to synthetic speech samples from our SLT arctic voice.

Development pattern for contributors

Create a personal fork of the main Merlin repository in GitHub.
Make your changes in a named branch different from master, e.g. you create a branch my-new-feature.
Generate a pull request through the Web interface of GitHub.

Contact Us

Post your questions, suggestions, and discussions to GitHub Issues.

Citation

If you publish work based on Merlin, please cite:

Zhizheng Wu, Oliver Watts, Simon King, "Merlin: An Open Source Neural Network Speech Synthesis System" in Proc. 9th ISCA Speech Synthesis Workshop (SSW9), September 2016, Sunnyvale, CA, USA.

Name		Name	Last commit message	Last commit date
Latest commit History 396 Commits
docs		docs
egs		egs
misc		misc
src		src
test		test
tools		tools
.gitignore		.gitignore
.travis.yml		.travis.yml
COPYING		COPYING
CREDITS.md		CREDITS.md
INSTALL.md		INSTALL.md
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Licenses found

Repository files navigation

txt+ult2wav

Merlin: The Neural Network (NN) based Speech Synthesis System

Installation

Getting started with Merlin

Synthetic speech samples

Development pattern for contributors

Contact Us

Citation

About

Licenses found

Releases

Packages

Contributors 29

Languages

License

Licenses found

BME-SmartLab/txt-ult2wav

Folders and files

Latest commit

History

Repository files navigation

txt+ult2wav

Merlin: The Neural Network (NN) based Speech Synthesis System

Installation

Getting started with Merlin

Synthetic speech samples

Development pattern for contributors

Contact Us

Citation

About

Resources

License

Licenses found

Stars

Watchers

Forks

Releases

Packages 0

Contributors 29

Languages

Packages