Skip to content

Latest commit

 

History

History
4 lines (3 loc) · 422 Bytes

README.md

File metadata and controls

4 lines (3 loc) · 422 Bytes

Fake News Detection POC on large datasets using distributed computing

The project focuses on detection of fake news from news articles online, the Machine Learning model is trained on the LIAR dataset which has been collated from a fact-checking website PolitiFact. It has 12.8K human labeled short statements collected from PolitiFact.

Implementation is done using spark, pandas and dask for distributed processing.