Llama3.1 #56

farzadab · 2024-07-25T16:58:16Z

This PR enables Llama 3.1 training and loading.

Using Llama 3.1 required us to upgrade transformers. The new version changes caching so there are some changes related to that as well.

mcloud.yaml

ultravox/training/train.py

ultravox/data/datasets.py

ultravox/training/configs/meta_config.yaml

mcloud.yaml

ultravox/model/ultravox_model.py

farzadab · 2024-07-25T19:04:21Z

Huh, the transformers version change was note pushed.

ultravox/model/ultravox_model.py

* upgrade transformers for llama 3.1 * fix cache on new version * TRAIN_ARGS instead of UV_CONFIG * remove matchtrain validation

- add fleurs data set - update WER calculation to use whisper normalization to be consistent with the whisper paper - update several datasets to include proper lang_id that is needed for whisper-normalization - add new eval configs for commonvoice, fleurs, and covost2 - add test cases for wer and bleu

farzadab added 14 commits July 23, 2024 13:54

add covost2 dataset for validation & test

0b4c246

bleu eval test

3775b75

Merge branch 'farzad-covost2' into farzad-bleu-eval

f4081bc

text-only fix

657d58d

Merge branch 'farzad-covost2' into farzad-bleu-eval

3852a42

add more ST scenarios and restructure

39b0eaf

typo fix

3d9a701

bugfix

b479c67

upgrade to llama3.1

3e71fbe

fix?

42fee83

bugfix: cache class in new transformers

c53044c

fix transformers dep on torch

4fa870a

Merge remote-tracking branch 'origin/main' into farzad-llama3.1

28160ce

reverting logging format

708cde4

farzadab force-pushed the farzad-llama3.1 branch from 797c44f to 708cde4 Compare July 25, 2024 18:26

farzadab changed the title ~~[WIP] Llama3.1~~ Llama3.1 Jul 25, 2024

farzadab marked this pull request as ready for review July 25, 2024 18:28

farzadab requested review from juberti and zqhuang211 July 25, 2024 18:29

farzadab commented Jul 25, 2024

View reviewed changes

mcloud.yaml Outdated Show resolved Hide resolved

ultravox/training/train.py Outdated Show resolved Hide resolved

ultravox/data/datasets.py Show resolved Hide resolved

ultravox/training/configs/meta_config.yaml Outdated Show resolved Hide resolved

zqhuang211 reviewed Jul 25, 2024

View reviewed changes

mcloud.yaml Outdated Show resolved Hide resolved

zqhuang211 reviewed Jul 25, 2024

View reviewed changes

ultravox/model/ultravox_model.py Show resolved Hide resolved

update transformers

62f8c42

zqhuang211 reviewed Jul 25, 2024

View reviewed changes

ultravox/model/ultravox_model.py Show resolved Hide resolved

farzadab added 2 commits July 25, 2024 14:48

revert moving default to 3.1

5034c5a

TRAIN_ARGS instead of UV_CONFIG

866a83d

zqhuang211 self-requested a review July 25, 2024 21:49

zqhuang211 approved these changes Jul 25, 2024

View reviewed changes

add comment for matchtrain and linked to issue

a8764f1

farzadab merged commit 01960b1 into main Jul 25, 2024
1 check passed

farzadab deleted the farzad-llama3.1 branch July 25, 2024 23:57

akshat0311 pushed a commit to jiviai/audio-llm that referenced this pull request Jan 30, 2025

Llama3.1 (fixie-ai#56)

020ad86

* upgrade transformers for llama 3.1 * fix cache on new version * TRAIN_ARGS instead of UV_CONFIG * remove matchtrain validation

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Llama3.1 #56

Llama3.1 #56

farzadab commented Jul 25, 2024 •

edited

Loading

farzadab commented Jul 25, 2024

Llama3.1 #56

Llama3.1 #56

Conversation

farzadab commented Jul 25, 2024 • edited Loading

farzadab commented Jul 25, 2024

farzadab commented Jul 25, 2024 •

edited

Loading