diff --git a/docs/source/quicktour.rst b/docs/source/quicktour.rst index 3cb726dc1..42dd6e0fd 100644 --- a/docs/source/quicktour.rst +++ b/docs/source/quicktour.rst @@ -73,7 +73,7 @@ Training the tokenizer In this tour, we will build and train a Byte-Pair Encoding (BPE) tokenizer. For more information about the different type of tokenizers, check out this `guide -`__ in the 🤗 Transformers +`__ in the 🤗 Transformers documentation. Here, training the tokenizer means it will learn merge rules by: - Start with all the characters present in the training corpus as tokens.