Fake News Detection POC on large datasets using distributed computing

The project focuses on detection of fake news from news articles online, the Machine Learning model is trained on the LIAR dataset which has been collated from a fact-checking website PolitiFact. It has 12.8K human labeled short statements collected from PolitiFact.

Implementation is done using spark, pandas and dask for distributed processing.

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
README.md		README.md
fak_nws_ntbk.ipynb		fak_nws_ntbk.ipynb
liar_dataset.zip		liar_dataset.zip

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Fake News Detection POC on large datasets using distributed computing

About

Releases

Packages

Languages

ishwarinalgirkar/fake-news

Folders and files

Latest commit

History

Repository files navigation

Fake News Detection POC on large datasets using distributed computing

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages