Skip to content
View airscholar's full-sized avatar
💭
Do hard things!
💭
Do hard things!

Highlights

  • Pro

Block or report airscholar

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
airscholar/README.md

Hey there 👋, I'm Yusuf!

LinkedIn Medium Stackoverflow Dev.to

👨🏻‍🎓 Academic experience:

📝 I regularly write articles:

  • On Medium about programing, data science and AI
  • On HackerNoon about programing, data science and AI
  • On Dev.to about programing, data science and AI

📺 Latest Youtube Videos

Realtime Logs Processing with Apache Airflow, Kafka and Elasticsearch - PART 1 Realtime Log Processing with #ApacheAirflow, #ApacheKafka and #Elasticsearch Indexing High Throughout Systems on Elasticsearch End to End Monitoring with Prometheus, Grafana, Apache Kafka and Spark - A Data Engineering Project End to End Monitoring of High Performance Systems - A Data Engineering Project PART 1 1.2 Billion Records Per Hour High Performance Kafka and Spark - End to End Data Engineering Project The 1.2Billion Records Architecture Per Hour with #ApacheKafka and #ApacheSpark Building a High Performance Real-Time Analytics Database - End to End Data Engineering Project #Apache Frameworks for #DataEngineering - Building High Performance Realtime Systems

📚 Latest Medium Stories

airscholar

Pinned Loading

  1. e2e-data-engineering Public

    An end-to-end data engineering pipeline that orchestrates data ingestion, processing, and storage using Apache Airflow, Python, Apache Kafka, Apache Zookeeper, Apache Spark, and Cassandra. All comp…

    Python 230 108

  2. RedditDataEngineering Public

    This project provides a comprehensive data pipeline solution to extract, transform, and load (ETL) Reddit data into a Redshift data warehouse. The pipeline leverages a combination of tools and serv…

    Python 117 58

  3. changecapture-e2e Public

    This project shows how to capture changes from postgres database and stream them into kafka

    Python 35 18

  4. RealtimeStreamingEngineering Public

    This project serves as a comprehensive guide to building an end-to-end data engineering pipeline using TCP/IP Socket, Apache Spark, OpenAI LLM, Kafka and Elasticsearch. It covers each stage from da…

    Python 32 24

  5. FootballDataEngineering Public

    An end-to-end data engineering pipeline that fetches data from Wikipedia, cleans and transforms it with Apache Airflow and saves it on Azure Data Lake. Other processing takes place on Azure Data Fa…

    Python 19 19

  6. ApacheFlink-SalesAnalytics Public

    This repository contains an end-to-end data engineering project using Apache Flink, focused on performing sales analytics. The project demonstrates how to ingest, process, and analyze sales data, s…

    Java 11 7

1,127 contributions in the last year

Contribution Graph
Day of Week February March April May June July August September October November December January February
Sunday
Monday
Tuesday
Wednesday
Thursday
Friday
Saturday
Less
No contributions.
Low contributions.
Medium-low contributions.
Medium-high contributions.
High contributions.
More

Activity overview

Loading A graph representing airscholar's contributions from February 18, 2024 to February 19, 2025. The contributions are 91% commits, 9% pull requests, 0% code review, 0% issues.   Code review   Issues 9% Pull requests 91% Commits

Contribution activity

February 2025

Created 1 commit in 1 repository
Reviewed 1 pull request in 1 repository
airscholar/e2e-data-engineering 1 pull request
13 contributions in private repositories Feb 3 – Feb 14
Loading

Seeing something unexpected? Take a look at the GitHub profile guide.