COVID vCues Dataset

This is a dataset research project developed to assist Professor Ankur Chattopadhyay's COVID vCues research by creating a multi-modal dataset containing images sourced from reliable and unreliable sources on COVID-19. This dataset will be used to train multiple AI models: reliable vs unreliable images, and identify memes, ads, claims, fact-checks, or logos.

To-Do

Sources

The dataset based on CoAID: COVID-19 Healthcare Misinformation Dataset, ReCovery, and MM-Covid.

Citations:
@misc {
cui2020coaid,
title={CoAID: COVID-19 Healthcare Misinformation Dataset},
author={Limeng Cui and Dongwon Lee},
year={2020},
eprint={2006.00885},
archivePrefix={arXiv},
primaryClass={cs.SI}
}
https://github.com/apurvamulay/ReCOVery/tree/master
https://github.com/bigheiniu/MM-COVID/blob/main/README.md

Usage

This dataset is still underdevelopment and not yet ready for use.

Authors

Sarah Ogden
Shreetika Poudel

Helpful Tutorials

Scrapy in 30 minutes (start here.): https://www.youtube.com/watch?v=r7pMqU2kYqc
CNN in Python: https://medium.com/metakratos-studio/python-based-ai-powered-by-tensorflow-and-keras-52140b1495e3
Twikit Example: https://github.com/d60/twikit/blob/main/examples/download_tweet_media.py
Remove duplicate images: https://pyimagesearch.com/2020/04/20/detect-and-remove-duplicate-images-from-a-dataset-for-deep-learning/
SVM model for image classification: https://medium.com/analytics-vidhya/image-classification-using-machine-learning-support-vector-machine-svm-dc7a0ec92e01
Fastdup: https://medium.com/visual-layer/fastdup-a-powerful-tool-to-manage-clean-curate-visual-data-at-scale-on-your-cpu-for-free-12e850946ead

Name		Name	Last commit message	Last commit date
Latest commit History 119 Commits
CoAID		CoAID
covid_crawler		covid_crawler
data_cleaning		data_cleaning
image_categories/categories		image_categories/categories
large_news_model		large_news_model
model1		model1
reliable_MM		reliable_MM
reliable_coaid		reliable_coaid
reliable_recovery		reliable_recovery
svm_model		svm_model
unreliable_MM		unreliable_MM
unreliable_coaid		unreliable_coaid
unreliable_recovery		unreliable_recovery
urls_MM		urls_MM
urls_coaid		urls_coaid
urls_recovery		urls_recovery
.DS_Store		.DS_Store
README.md		README.md
parser_coaid.py		parser_coaid.py
parser_mm.py		parser_mm.py
parser_recovery.py		parser_recovery.py
poetry.lock		poetry.lock
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

COVID vCues Dataset

To-Do

Sources

Usage

Authors

Helpful Tutorials

About

Releases

Packages

Contributors 2

Languages

sarahogden2017/Covid-vCues

Folders and files

Latest commit

History

Repository files navigation

COVID vCues Dataset

To-Do

Sources

Usage

Authors

Helpful Tutorials

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages