GitHub - chloelavrat/RadioGPT: Build your Radio LLM from scratch !

Branches Tags

Name		Name	Last commit message	Last commit date
Latest commit History 25 Commits
gptmodel		gptmodel
.gitignore		.gitignore
README.md		README.md
RadioGPT_1_Generateur_de_Moliere.ipynb		RadioGPT_1_Generateur_de_Moliere.ipynb
RadioGPT_2_Larger_LLM_Chat.ipynb		RadioGPT_2_Larger_LLM_Chat.ipynb
RadioGPT_3_Finally_RadioGPT.ipynb		RadioGPT_3_Finally_RadioGPT.ipynb

Repository files navigation

Interactive Demo • Video • Python API

The RadioGPT project is an educational project that demonstrates how to build and train language models from scratch. Through a series of progressive notebooks, you'll learn how to create increasingly sophisticated language models, from a basic character-level model to a more advanced chat-capable LLM.

Project Structure

The project consists of three main notebooks, each building upon the previous one:

RadioGPT_1_Generateur_de_Moliere.ipynb
- Introduction to basic language model concepts
- Character-level tokenization
- Training a simple model on Molière's works
- Understanding fundamental LLM components
RadioGPT_2_Larger_LLM_Chat.ipynb
- Playing with a larger model (83.1M parameters)
- Working with more sophisticated architectures
- Improved text generation capabilities
- Chat-oriented model training
RadioGPT_3_Finally_RadioGPT.ipynb
- Fine-tuning on radio station data
- Advanced model architecture
- Real-world application with radio content
- Multiple radio station dataset options

Core Components

Model Architecture (`gptmodel/core/model.py`)

GPTlite: A lightweight GPT-style transformer
Modular attention mechanism
Scalable architecture with configurable parameters
Support for both training and generation

Dataset Handling (`gptmodel/core/dataset.py`)

Multiple dataset classes:
- TinyShakespeare: Character-level dataset
- AlpacaDataset: Chat-oriented dataset

Dataset links

TinyShakespeare style:
- petit molière
ALpaca style:
- Acquiesce_data_110k_instructions

Utilities (`gptmodel/core/utils.py`)

Training and evaluation functions
Model saving and loading
Text generation utilities
Performance monitoring

Requirements

pip install torch datasets tqdm transformers

GPU recommended on Google Colab (T4 or better)

Usage

Clone the repository:

git clone https://github.com/yourusername/RadioGPT.git

Install dependencies:

pip install torch datasets tqdm transformers

Open the notebooks in order:
- Start with RadioGPT_1_Generateur_de_Moliere.ipynb
- Progress to RadioGPT_2_Larger_LLM_Chat.ipynb
- Finally, explore RadioGPT_3_Finally_RadioGPT.ipynb

Contributing

Contributions are welcome! Please feel free to submit pull requests or open issues for improvements and bug fixes. This has been developed in less than 6 days, so there is a lot of room for improvement. 😉

License

Developped by Chloé Lavrat for Radio France

Acknowledgments

Thanks to Marc Yefimchuk and Jade Moillic for the help in processing the data and improving the notebooks
Thanks to the "attention is all you need" authors and the GPT-2 authors for the inspiration 🥰
Thanks to the PyTorch team for the excellent deep learning framework
Special thanks to contributors and the community

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Interactive Demo • Video • Python API

Project Structure

Core Components

Model Architecture (`gptmodel/core/model.py`)

Dataset Handling (`gptmodel/core/dataset.py`)

Dataset links

Utilities (`gptmodel/core/utils.py`)

Requirements

Usage

Contributing

License

Acknowledgments

About

Releases

Packages

Languages

chloelavrat/RadioGPT

Folders and files

Latest commit

History

Repository files navigation

Interactive Demo • Video • Python API

Project Structure

Core Components

Model Architecture (gptmodel/core/model.py)

Dataset Handling (gptmodel/core/dataset.py)

Dataset links

Utilities (gptmodel/core/utils.py)

Requirements

Usage

Contributing

License

Acknowledgments

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Model Architecture (`gptmodel/core/model.py`)

Dataset Handling (`gptmodel/core/dataset.py`)

Utilities (`gptmodel/core/utils.py`)

Packages