Skip to content

Latest commit

 

History

History
12 lines (9 loc) · 632 Bytes

README.md

File metadata and controls

12 lines (9 loc) · 632 Bytes

Podcasts-Pipeline

This Project was done using Apache Airflow to build a pipeline for downloading podcast episodes, the episodes are stored in a SQLite Database.

Pros of using Airflow:

  • The project runs daily, downloading the new episodes everyday.
  • Each task runs independently, and logs can be monitored.
  • Each task can be run in an order and can also parallelize.
  • This project can be extened eaily using Airflow.

This is the DAG(Directed Acyclic Graph) that shows how the tasks are ordered in the pipeline: image