Skip to content

Commit

Permalink
[TTS] Replace diagram with github release link
Browse files Browse the repository at this point in the history
Signed-off-by: Ryan <[email protected]>
  • Loading branch information
rlangman committed Apr 17, 2024
1 parent b172ed5 commit 0bd1387
Show file tree
Hide file tree
Showing 2 changed files with 1 addition and 1 deletion.
2 changes: 1 addition & 1 deletion tutorials/tts/Audio_Codec_Training.ipynb
Original file line number Diff line number Diff line change
Expand Up @@ -44,7 +44,7 @@
"Neural audio codecs are deep learning models that compress audio into a low bitrate representation. The compact embedding space created by these models can be useful for various speech tasks, such as TTS and ASR.\n",
"\n",
"<div>\n",
"<img src=\"https://raw.githubusercontent.com/NVIDIA/NeMo/main/tutorials/tts/images/audio_codec_diagram.png\" width=\"800\", height=\"400\"/>\n",
"<img src=\"https://github.com/NVIDIA/NeMo/releases/download/v1.22.0/nemo_audio_codec.png\" width=\"800\", height=\"400\"/>\n",
"</div>\n",
"\n",
"Audio codec models typically have an *encoder-quantizer-decoder* structure. The **encoder** takes an input audio signal and encodes it into a sequence of embeddings. The **quantizer** discretizes the embeddings to create a lookup table known as a **codebook**. The embeddings saved in the codebook are referred to as **audio codes**. The **decoder** takes the audio codes as input and attempts to reconstruct the original audio signal.\n",
Expand Down
Binary file removed tutorials/tts/images/audio_codec_diagram.png
Binary file not shown.

0 comments on commit 0bd1387

Please sign in to comment.