Enable HF nllb conversion #204

francoishernandez · 2025-01-30T17:34:20Z

NLLB support has been unclear since switching from OpenNMT to Eole.
This PR facilitates the conversion of official HF NLLB models (e.g. https://huggingface.co/facebook/nllb-200-distilled-1.3B).
This should also facilitate re-enabling support of pre-trained seq2seq models such as BART or T5.

That being said, the current structure of convert_HF needs to be reviewed to better support the encoder/decoder duality.
#156 was a first major step in making convert_HF, more modular, and #153 introduced the support of encoder keys, but now we need to meld all this into a more robust logic.
Also, we should probably define a better "HF settings deduction waterfall" in build_config_dict, but I'm not sure there is any centralized repository of all the possible values, as they are defined per model.

Note: ~~HF tokenization was not tested yet, there might be some edge case to tackle with the prefix transform.~~
EDIT: HF tokenization patched in 3fd59e6

francoishernandez added 2 commits January 30, 2025 17:27

enable HF nllb conversion

e440f81

basic nllb recipe

2ac0cdf

francoishernandez mentioned this pull request Jan 30, 2025

How to handle tags #189

Closed

francoishernandez added 2 commits January 31, 2025 14:06

patch decoder_start_token seq2seq

3fd59e6

update nllb recipe and readme

f3de9cd

francoishernandez changed the title ~~[WIP] Enable HF nllb conversion~~ Enable HF nllb conversion Jan 31, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Enable HF nllb conversion #204

Enable HF nllb conversion #204

francoishernandez commented Jan 30, 2025 •

edited

Loading

Enable HF nllb conversion #204

Are you sure you want to change the base?

Enable HF nllb conversion #204

Conversation

francoishernandez commented Jan 30, 2025 • edited Loading

francoishernandez commented Jan 30, 2025 •

edited

Loading