Podcast Generation

This project generates a podcast episode using different APIs and technologies. The main topic of this podcast is birds so each episode will be about a specific bird. This also means that the prompts were written with this in mind.

Text Generation

We use Wikipedia library to get the text about a bird, and Langchain with Openai to generate a structured podcast dialogue.

Audio Generation

We use ElevenLabs API to generate the voices for the podcast. We also create an audio for the podcast intro using Audiocraft. This audio is a song generated with MusicGen and some sound effects generated using AudioGen. We put everything together using pydub library.

Image Generation

We use the Stable Diffusion XL model (stable-diffusion-xl-base-1.0) as a base model and Refiner XL (stable-diffusion-xl-refiner-1.0) as a refiner model to generate a podcast cover.

This project was part of the Uplimit course "Prompt Design & Building AI Products". You can see the Colab notebooks here:

Week 1: Geting the text about your podcast topic, generating the podcast dialogue using OpenAI, and generating the voices with Elevenlabs.
Week 2: Generating a cover for our podcast using Stable Diffusion XL model with a refiner. In this notebook, I tried different prompts to generate a cover in different art styles.
Week 3: Generating music for the intro using AudioCraft. I also used Langchain to create a sequential chain summarising and generating the podcast dialogue. And the same process using Langchain agents.

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
audios		audios
covers		covers
src		src
.gitignore		.gitignore
README.md		README.md
example_podcast_generation.ipynb		example_podcast_generation.ipynb
extentions.md		extentions.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Podcast Generation

Text Generation

Audio Generation

Image Generation

About

Releases

Packages

Contributors 2

Languages

lyssascherer/podcast_generation

Folders and files

Latest commit

History

Repository files navigation

Podcast Generation

Text Generation

Audio Generation

Image Generation

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages