Skip to content

The mini project for the course Database Technologies. The task is to take in data via a pipeline built using spark-streaming and kafka, and store the processed data into a SQLite database for further manipulation

License

Notifications You must be signed in to change notification settings

Shreyas-s14/McFlAi-OTPMS

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

13 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

McFlAi-OTPMS

The mini project for the course Database Technologies.

Technologies / Frameworks to be exercised:

    1. Apache Spark Streaming, Spark SQL [execute multiple workloads e.g., Spark SQL queries to carry out action, transformation or aggregation on the input data]
    1. Apache Kafka Streaming [have to publish/subscribe the results or produce/consume choosing >=3 topics].
    1. Store the data in a DBMS of your choice like postgres,MySQL ( Ofcourse we use postgres, myswl is a dumpster fire)
    1. Make use of any other tool/s as required like Zookeepr.

Run the same queries in a batch mode on the same/whole data from the database (#3

above).

Compare the above results/accuracy/performance with the streaming mode of

execution.

Name is a pun on Marty McFly . McFlAirlines On Time Performance Monitoring System

About

The mini project for the course Database Technologies. The task is to take in data via a pipeline built using spark-streaming and kafka, and store the processed data into a SQLite database for further manipulation

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published