How to add tokenizer of fine-tune whisper model for converting of ctranslater2 #220

ILG2021 · 2023-05-10T10:26:04Z

Hello, everyone, I have fine-tune a whisper model by the methond of this:
https://huggingface.co/blog/fine-tune-whisper

But my model can not be converted to ctranslate2. It seems lacks tokenizer. But the tutorial doesn't reference it. So anyone know how to solve it? This is my model.

funboarder13920 · 2023-05-10T14:38:32Z

You need to add vocab.json, tokenizer.json, special_token_map.json and merges.txt in the checkpoint folder that you want to convert.

You should be able to find them here : https://huggingface.co/openai/whisper-large/tree/main

ILG2021 · 2023-05-10T18:15:59Z

You need to add vocab.json, tokenizer.json, special_token_map.json and merges.txt in the checkpoint folder that you want to convert.

Just copy and upload it? I am fine-tuning a large-v2.

ILG2021 closed this as completed May 14, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How to add tokenizer of fine-tune whisper model for converting of ctranslater2 #220

How to add tokenizer of fine-tune whisper model for converting of ctranslater2 #220

ILG2021 commented May 10, 2023

funboarder13920 commented May 10, 2023

ILG2021 commented May 10, 2023 •

edited

Loading

How to add tokenizer of fine-tune whisper model for converting of ctranslater2 #220

How to add tokenizer of fine-tune whisper model for converting of ctranslater2 #220

Comments

ILG2021 commented May 10, 2023

funboarder13920 commented May 10, 2023

ILG2021 commented May 10, 2023 • edited Loading

ILG2021 commented May 10, 2023 •

edited

Loading