Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Create Italian to all other languages translation process #75

Closed
2 tasks done
andrewtavis opened this issue Feb 24, 2024 · 5 comments
Closed
2 tasks done

Create Italian to all other languages translation process #75

andrewtavis opened this issue Feb 24, 2024 · 5 comments
Assignees
Labels
feature New feature or request good first issue Good for newcomers help wanted Extra attention is needed

Comments

@andrewtavis
Copy link
Member

Terms

Description

The goal of this issue is to create a process whereby a single file is used to translate all words within Italian/translations/words_to_translate.json to all other Scribe languages. To achieve this we'll be using m2m100_418M, with the output being a JSON file that has a string and keyed values for each language. This can then be transferred to an SQLite database table with each string in an index corresponding to a column value for each language.

Of specific importance is trying to get a metric of the accuracy of the translation and doing a cutoff such that we're no longer including low quality translations in Scribe applications :)

Contribution

Happy to work on this or support someone with interest in working on it!

@andrewtavis andrewtavis added feature New feature or request help wanted Extra attention is needed good first issue Good for newcomers labels Feb 24, 2024
@ikeadeoyin
Copy link
Contributor

Hello @andrewtavis,
I am interested in working on this.

@andrewtavis
Copy link
Member Author

Sounds good, @ikeadeoyin! Let us merge in another process that's for English and then you can use that as a basis. Should be merged by Wednesday 😊

@ikeadeoyin
Copy link
Contributor

Alright, that is okay.

@andrewtavis
Copy link
Member Author

Hey @ikeadeoyin 👋 The process has been set up and we're ready to implement here :) It's actually quite streamlined now. If you make a version of scribe_data/extract_transform/languages/English/translations/translate_words.py that replaces SRC_LANG with Italian we should be good to go here 😊

@andrewtavis
Copy link
Member Author

Hey @ikeadeoyin 👋 I went ahead and sent along the change in 2b72e64 as I had a few other things that I needed to get done, and this needed to get finished up :) Hope all's well!

@github-project-automation github-project-automation bot moved this from Todo to Done in Scribe Board Mar 24, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
feature New feature or request good first issue Good for newcomers help wanted Extra attention is needed
Projects
Archived in project
Development

No branches or pull requests

2 participants