USF-Zika-Research

A collection of utility R scripts based around collecting and parsing Twitter discussion based around the Zika virus.

Current parsed variables include -- timestamp_ms
datetime
tweet_id
text
retweet_count
favorite_count
expanded_url
friends_count
screen_name
user_id_str
in_reply_to_screen_name in_reply_to_user_id
rt_screen_name
rt_screen_id
full_name
followers_count
place_lat
place_lon
lat
lon
mentioned_users -";" separated multivalued column

mentioned_id -";" separated multivalued column

hashes -";" separated multivalued column

parsed_media_type -";" separated multivalued column

parsed_media_url -";" separated multivalued column

Parse workthrow overview

Collect tweets with Python (tweepy) or R (streamR) - key is streaming JSON output.

Run parse program as follows:

Rscript parse_twitter_zika.R ./zika_hagen_dat/data_0825.txt sample_output.csv 1

(1 for header, 2 for no header....used in bulk output)

parse_bulk_json.sh collates the 12 json files into 1 giant CSV file for now, supressing the CSV header output when writing at additional stages. Need to manually change folder if re-using this script right now, can be cleaner.

Name		Name	Last commit message	Last commit date
Latest commit History 67 Commits
README.md		README.md
cleandate.R		cleandate.R
comprehensivetopretweets.R		comprehensivetopretweets.R
countrycountoriginaltweets.R		countrycountoriginaltweets.R
dirtytablemaking.R		dirtytablemaking.R
exportexcel.R		exportexcel.R
exportxlsx2.R		exportxlsx2.R
external.R		external.R
floridawork.R		floridawork.R
friedmdanhashtaganalysis.R		friedmdanhashtaganalysis.R
getall.R		getall.R
getstats.R		getstats.R
hashtagtable.R		hashtagtable.R
legacymentionshashes.R		legacymentionshashes.R
listparsing.R		listparsing.R
misc1.R		misc1.R
misc2.R		misc2.R
mostfollowers.R		mostfollowers.R
msotfavs.R		msotfavs.R
notebook.Rmd		notebook.Rmd
parse_bulk_json.sh		parse_bulk_json.sh
parse_twitter_zika.R		parse_twitter_zika.R
piechart.R		piechart.R
popularpictures.R		popularpictures.R
t2re.R		t2re.R
table 1.R		table 1.R
table 5.R		table 5.R
tables_rev3.R		tables_rev3.R
tbl5excl0s.R		tbl5excl0s.R
tbl_10.R		tbl_10.R
topretweets.R		topretweets.R
wordmap.R		wordmap.R
zika_network_analysis.Rmd		zika_network_analysis.Rmd

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

USF-Zika-Research

Parse workthrow overview

About

Releases

Packages

Contributors 3

Languages

ryanscharf/USF-Zika-Research

Folders and files

Latest commit

History

Repository files navigation

USF-Zika-Research

Parse workthrow overview

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Languages

Packages