NotebookLM is a self-hosted platform designed to create, manage, and enhance audiobooks. This platform leverages various tools and models to parse, transcribe, and improve manuscripts, ultimately providing high-quality audio content.
- python audiobook.py
- converts script pdf into structured scene json using mistral-large API.
- triggers Text to Speech server for speech/dialogs and store audio files in resources folder
- triggers Text to Sound server for background sound/music and store audio files in resources folder
- Combines generated audio from speech and sound modules and creates audibook for the provided script
- Prompts for Audiobook: Creating a storyline using prompts Read more
- Script parser - convert to scene structured json
- TTS server - Parler-tts server to generate speech for dialogs in scenes
- AudioGen module - Audiocraft/magnet background sound creator
- Basic audiobook - linear worflow for full audio creation without scene logic
- Challenge: project challenge. Read more
- Approach: Approach in solving the problem. Read more
- Workflow: Worflow for audiobook generation. Read more
- Setup: Follow the setup guide to build an End-to-End platform for Audibook Generation/Notebook LLM.
- UX: UX design for Audiobook generation Read more
- Database Schema: Schema for Scene Editing and Media content Read more
An experiment to build a production grade audiobook content generator system to help publishers build on their IP and reach larger audience.
- NotebookLlama: Additional resources and use cases. Learn more
- WishList - RTX 4090 GPU based computer for building End to End product inhouse. Buy Me Razer Blade 18 or ROG Zephyrus M16