The project focuses on detection of fake news from news articles online, the Machine Learning model is trained on the LIAR dataset which has been collated from a fact-checking website PolitiFact. It has 12.8K human labeled short statements collected from PolitiFact.
Implementation is done using spark, pandas and dask for distributed processing.