[IDEA] RedPajama-Data-1T #12

svupper · 2023-04-22T05:16:41Z

Creating a French Llama version by translating RedPajama dataset

bofenghuang · 2023-05-03T17:08:21Z

Meta's LLaMA model has been trained on a massive amount of data - 1.0T/1.4T tokens on 2048 A100s (80GB) over a period of 5 months. Continuing the pre-training of the LLaMA model on a French corpus is definitely a promising approach to improve its performance on the French language. However, this option is still quite expensive and may require significant computational resources. I'm currently pre-training it on a small French dataset to see if it improves a lot. Stay tuned!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[IDEA] RedPajama-Data-1T #12

[IDEA] RedPajama-Data-1T #12

svupper commented Apr 22, 2023

bofenghuang commented May 3, 2023

[IDEA] RedPajama-Data-1T #12

[IDEA] RedPajama-Data-1T #12

Comments

svupper commented Apr 22, 2023

bofenghuang commented May 3, 2023