diff --git a/congress-age/README.md b/congress-age/README.md index be207968..ad5b3930 100644 --- a/congress-age/README.md +++ b/congress-age/README.md @@ -1,3 +1,7 @@ +# Congress Age + +This folder contains the data behind the story [Both Republicans And Democrats Have an Age Problem](https://fivethirtyeight.com/features/both-republicans-and-democrats-have-an-age-problem/) + `congress-terms.csv` has an entry for every member of congress who served at any point during a particular congress between January 1947 and Februrary 2014. House membership data is from the [@unitedstates project](http://theunitedstates.io/), with Congress meeting numbers added using code from [GovTrack](https://www.govtrack.us/developers/api): diff --git a/congress-generic-ballot/README.md b/congress-generic-ballot/README.md index 3364db98..9386107a 100644 --- a/congress-generic-ballot/README.md +++ b/congress-generic-ballot/README.md @@ -5,4 +5,4 @@ files: --- # Congress Generic Ballot Polls -This contains the raw data behind "[Are Democrats Winning The Race For Congress?](https://projects.fivethirtyeight.com/congress-generic-ballot-polls/)" +This readme contains links to the data behind [Are Democrats Winning The Race For Congress?](https://projects.fivethirtyeight.com/congress-generic-ballot-polls/). For the latest version of this updating data set, visit the links at the top of this README. diff --git a/congress-resignations/README.md b/congress-resignations/README.md index 938d38a9..95cb729b 100644 --- a/congress-resignations/README.md +++ b/congress-resignations/README.md @@ -1,6 +1,6 @@ # Congressional Resignations -Data behind the story [We’ve Never Seen Congressional Resignations Like This Before](https://fivethirtyeight.com/features/more-people-are-resigning-from-congress-than-at-any-time-in-recent-history/). +This folder contains data behind the story [We’ve Never Seen Congressional Resignations Like This Before](https://fivethirtyeight.com/features/more-people-are-resigning-from-congress-than-at-any-time-in-recent-history/). `congressional_resignations.csv` contains information about the 615 members of Congress who resigned or were removed from office from March 4, 1901 (the first day of the 57th Congress) through January 15, 2018, including the resigning member’s party and district, the date they resigned, the reason for their resignation and the source of the information about their resignation. diff --git a/cousin-marriage/README.md b/cousin-marriage/README.md index 4147b6e7..e3d192d0 100644 --- a/cousin-marriage/README.md +++ b/cousin-marriage/README.md @@ -1,6 +1,6 @@ -### Cousin Marriage Data +# Cousin Marriage -The raw data behind the story [Dear Mona: How Many Americans Are Married To Their Cousins?] +This folder contains data behind the story [Dear Mona: How Many Americans Are Married To Their Cousins?](https://fivethirtyeight.com/features/how-many-americans-are-married-to-their-cousins/). Header | Definition ---|--------- diff --git a/daily-show-guests/README.md b/daily-show-guests/README.md index 21bbff93..f3bde709 100644 --- a/daily-show-guests/README.md +++ b/daily-show-guests/README.md @@ -1,6 +1,6 @@ -### Daily Show Guests +# Daily Show Guests -The raw data behind the story [Every Guest Jon Stewart Ever Had On ‘The Daily Show’](http://fivethirtyeight.com/datalab/every-guest-jon-stewart-ever-had-on-the-daily-show/) +This folder contains data behind the story [Every Guest Jon Stewart Ever Had On ‘The Daily Show’](http://fivethirtyeight.com/datalab/every-guest-jon-stewart-ever-had-on-the-daily-show/). Header | Definition ---|--------- @@ -8,6 +8,6 @@ Header | Definition `GoogleKnowlege_Occupation` | Their occupation or office, according to Google's Knowledge Graph or, if they're not in there, how Stewart introduced them on the program. `Show` | Air date of episode. Not unique, as some shows had more than one guest `Group` | A larger group designation for the occupation. For instance, us senators, us presidents, and former presidents are all under "politicians" -`Raw_Guest_List` | The person or list of people who appeared on the show, according to Wikipedia. The GoogleKnowlege_Occupation only refers to one of them in a given row. +`Raw_Guest_List` | The person or list of people who appeared on the show, according to Wikipedia. The GoogleKnowlege_Occupation only refers to one of them in a given row. Source: Google Knowlege Graph, The Daily Show clip library, Wikipedia. diff --git a/democratic-bench/README.md b/democratic-bench/README.md index b9d0a7a5..ecd4bda2 100644 --- a/democratic-bench/README.md +++ b/democratic-bench/README.md @@ -1,4 +1,6 @@ -### Democratic bench +# Democratic bench + +This folder contains data behind the story [Some Democrats Who Could Step Up If Hillary Isn’t Ready For Hillary](https://fivethirtyeight.com/features/some-democrats-who-could-step-up-if-hillary-isnt-ready-for-hillary/). Header | Definition ---|--------- diff --git a/drug-use-by-age/README.md b/drug-use-by-age/README.md index a677971f..fc6b4129 100644 --- a/drug-use-by-age/README.md +++ b/drug-use-by-age/README.md @@ -1,8 +1,8 @@ -### Drug Use By Age +# Drug Use By Age -This directory contains the data behind the story [How Baby Boomers Get High](http://fivethirtyeight.com/datalab/how-baby-boomers-get-high/). It covers 13 drugs across 17 age groups. +This directory contains data behind the story [How Baby Boomers Get High](http://fivethirtyeight.com/datalab/how-baby-boomers-get-high/). It covers 13 drugs across 17 age groups. -Source: [National Survey on Drug Use and Health from the Substance Abuse and Mental Health Data Archive](http://www.icpsr.umich.edu/icpsrweb/content/SAMHDA/index.html). +Source: [National Survey on Drug Use and Health from the Substance Abuse and Mental Health Data Archive](http://www.icpsr.umich.edu/icpsrweb/content/SAMHDA/index.html). Header | Definition ---|--------- diff --git a/early-senate-polls/README.md b/early-senate-polls/README.md new file mode 100644 index 00000000..3c551580 --- /dev/null +++ b/early-senate-polls/README.md @@ -0,0 +1,3 @@ +# Early Senate Polls + +This folder contains data behind the story [Early Senate Polls Have Plenty to Tell Us About November](https://fivethirtyeight.com/features/early-senate-polls-have-plenty-to-tell-us-about-november/). \ No newline at end of file diff --git a/elo-blatter/README.md b/elo-blatter/README.md index 7a20e98f..79630e7a 100644 --- a/elo-blatter/README.md +++ b/elo-blatter/README.md @@ -1,6 +1,6 @@ -### FIFA teams under Blatter +# FIFA teams under Blatter -The raw data behind the story [Blatter’s Reign At FIFA Hasn’t Helped Soccer’s Poor](http://fivethirtyeight.com/features/blatters-reign-at-fifa-hasnt-helped-soccers-poor/) +This folder contains data behind the story [Blatter’s Reign At FIFA Hasn’t Helped Soccer’s Poor](http://fivethirtyeight.com/features/blatters-reign-at-fifa-hasnt-helped-soccers-poor/). Header | Definition ---|--------- diff --git a/endorsements-june-30/README.md b/endorsements-june-30/README.md index db2bbaae..8e6af558 100644 --- a/endorsements-june-30/README.md +++ b/endorsements-june-30/README.md @@ -1,8 +1,8 @@ -### Endorsements through June 30 +# Endorsements through June 30 -The raw data behind the story [Pols And Polls Say The Same Thing: Jeb Bush Is A Weak Front-Runner](http://fivethirtyeight.com/features/pols-and-polls-say-the-same-thing-jeb-bush-is-a-weak-front-runner/) +This folder contains data behind the story [Pols And Polls Say The Same Thing: Jeb Bush Is A Weak Front-Runner](http://fivethirtyeight.com/features/pols-and-polls-say-the-same-thing-jeb-bush-is-a-weak-front-runner/). -This data includes something we call "endorsement points," an attempt to quantify the importance of endorsements by weighting each one according to the position held by the endorser: 10 points for each governor, 5 points for each senator and 1 point for each representative +This data includes something we call "endorsement points," an attempt to quantify the importance of endorsements by weighting each one according to the position held by the endorser: 10 points for each governor, 5 points for each senator and 1 point for each representative. Header | Definition ---|--------- diff --git a/fandango/README.md b/fandango/README.md index acc67bf8..2ed607c0 100644 --- a/fandango/README.md +++ b/fandango/README.md @@ -1,3 +1,5 @@ +# Fandango + This directory contains the data behind the story [Be Suspicious Of Online Movie Ratings, Especially Fandango’s](http://fivethirtyeight.com/features/fandango-movies-ratings/). `fandango_score_comparison.csv` contains every film that has a Rotten Tomatoes rating, a RT User rating, a Metacritic score, a Metacritic User score, and IMDb score, and at least 30 fan reviews on Fandango. The data from Fandango was pulled on Aug. 24, 2015. @@ -5,13 +7,13 @@ This directory contains the data behind the story [Be Suspicious Of Online Movie Column | Definition --- | ----------- FILM | The film in question -RottenTomatoes | The Rotten Tomatoes Tomatometer score for the film -RottenTomatoes_User | The Rotten Tomatoes user score for the film +RottenTomatoes | The Rotten Tomatoes Tomatometer score for the film +RottenTomatoes_User | The Rotten Tomatoes user score for the film Metacritic | The Metacritic critic score for the film Metacritic_User | The Metacritic user score for the film IMDB | The IMDb user score for the film Fandango_Stars | The number of stars the film had on its Fandango movie page -Fandango_Ratingvalue | The Fandango ratingValue for the film, as pulled from the HTML of each page. This is the actual average score the movie obtained. +Fandango_Ratingvalue | The Fandango ratingValue for the film, as pulled from the HTML of each page. This is the actual average score the movie obtained. RT_norm | The Rotten Tomatoes Tomatometer score for the film , normalized to a 0 to 5 point system RT_user_norm | The Rotten Tomatoes user score for the film , normalized to a 0 to 5 point system Metacritic_norm | The Metacritic critic score for the film, normalized to a 0 to 5 point system @@ -34,5 +36,5 @@ Column | Definiton --- | --------- FILM | The movie STARS | Number of stars presented on Fandango.com -RATING | The Fandango ratingValue for the film, as pulled from the HTML of each page. This is the actual average score the movie obtained. -VOTES | number of people who had reviewed the film at the time we pulled it. +RATING | The Fandango ratingValue for the film, as pulled from the HTML of each page. This is the actual average score the movie obtained. +VOTES | number of people who had reviewed the film at the time we pulled it. diff --git a/fifa/README.md b/fifa/README.md index 9a8cfe6e..f595c512 100644 --- a/fifa/README.md +++ b/fifa/README.md @@ -1,4 +1,4 @@ -### FIFA +# FIFA This directory contains the data behind the story [How To Break FIFA](http://fivethirtyeight.com/features/how-to-break-fifa/). diff --git a/flying-etiquette-survey/README.md b/flying-etiquette-survey/README.md index 03bef85e..ffe78132 100644 --- a/flying-etiquette-survey/README.md +++ b/flying-etiquette-survey/README.md @@ -1,3 +1,5 @@ -### Flying Etiquette Survey +# Flying Etiquette Survey -Results of a SurveyMonkey survey commissioned by FiveThirtyEight for the story [41 Percent of Fliers Say It’s Rude To Recline Your Airplane Seat](http://fivethirtyeight.com/datalab/airplane-etiquette-recline-seat) +This folder contains data behind the story [41 Percent of Fliers Say It’s Rude To Recline Your Airplane Seat](http://fivethirtyeight.com/datalab/airplane-etiquette-recline-seat). + +`flying-etiquette.csv` contains the results of a SurveyMonkey survey commissioned by FiveThirtyEight for the story. \ No newline at end of file diff --git a/food-world-cup/README.md b/food-world-cup/README.md new file mode 100644 index 00000000..d9c215d3 --- /dev/null +++ b/food-world-cup/README.md @@ -0,0 +1,16 @@ +# Food World Cup + +This folder contains data behind the stories: +* [The FiveThirtyEight International Food Association’s 2014 World Cup](https://fivethirtyeight.com/features/the-fivethirtyeight-international-food-associations-2014-world-cup/) +* [What is Americans’ Favorite Global Cuisine?](https://fivethirtyeight.com/features/what-is-americans-favorite-global-cuisine/) + +Anwser key for the responses to the "Please rate how much you like the traditional cuisine of X:" questions. + +Value | Description +------|-------------- +5 | I love this country's traditional cuisine. I think it's one of the best in the world. +4 | I like this country's traditional cuisine. I think it's considerably above average. +3 | I'm OK with this county's traditional cuisine. I think it's about average. +2 | I dislike this country's traditional cuisine. I think it's considerably below average. +1 | I hate this country's traditional cuisine. I think it's one of the worst in the world. +N/A | I'm unfamiliar with this country's traditional cuisine. \ No newline at end of file diff --git a/food-world-cup/readme.txt b/food-world-cup/readme.txt deleted file mode 100644 index bd424188..00000000 --- a/food-world-cup/readme.txt +++ /dev/null @@ -1,8 +0,0 @@ -Anwser key for the responses to the "Please rate how much you like the traditional cuisine of X:" questions. - -5: I love this country's traditional cuisine. I think it's one of the best in the world. -4: I like this country's traditional cuisine. I think it's considerably above average. -3: I'm OK with this county's traditional cuisine. I think it's about average. -2: I dislike this country's traditional cuisine. I think it's considerably below average. -1: I hate this country's traditional cuisine. I think it's one of the worst in the world. -N/A: I'm unfamiliar with this country's traditional cuisine. \ No newline at end of file diff --git a/forecast-methodology/README.md b/forecast-methodology/README.md index f7355ab5..987ac492 100644 --- a/forecast-methodology/README.md +++ b/forecast-methodology/README.md @@ -1,6 +1,6 @@ -### Historical FiveThirtyEight Senate Forecasts +# Historical FiveThirtyEight Senate Forecasts -The data behind the story [How The FiveThirtyEight Senate Forecast Model Works](http://fivethirtyeight.com/features/how-the-fivethirtyeight-senate-forecast-model-works/) +This folder contains the data behind the story [How The FiveThirtyEight Senate Forecast Model Works](http://fivethirtyeight.com/features/how-the-fivethirtyeight-senate-forecast-model-works/). Header | Definition ---|--------- diff --git a/goose/README.md b/goose/README.md index f71ec65d..7b8f80e6 100644 --- a/goose/README.md +++ b/goose/README.md @@ -1,6 +1,6 @@ -### Goose +# Goose -The raw data behind the stories: +The data behind the stories: * [The Save Ruined Relief Pitching. The Goose Egg Can Fix It](https://fivethirtyeight.com/features/goose-egg-new-save-stat-relief-pitchers/) * [Kenley Jansen Is The Model Of A Modern Reliever](https://fivethirtyeight.com/features/kenley-jansen-is-the-model-of-a-modern-reliever/) diff --git a/hate-crimes/README.md b/hate-crimes/README.md index 415b72e4..831dc3c2 100644 --- a/hate-crimes/README.md +++ b/hate-crimes/README.md @@ -1,6 +1,6 @@ -### Hate-crimes data +# Hate Crimes -The raw data behind the story [Higher Rates Of Hate Crimes Are Tied To Income Inequality](https://fivethirtyeight.com/features/higher-rates-of-hate-crimes-are-tied-to-income-inequality/) +This folder contains data behind the story [Higher Rates Of Hate Crimes Are Tied To Income Inequality](https://fivethirtyeight.com/features/higher-rates-of-hate-crimes-are-tied-to-income-inequality/). Header | Definition ---|--------- diff --git a/hip-hop-candidate-lyrics/README.md b/hip-hop-candidate-lyrics/README.md index 0014485d..ffdd5188 100644 --- a/hip-hop-candidate-lyrics/README.md +++ b/hip-hop-candidate-lyrics/README.md @@ -1,6 +1,8 @@ -### Every mention of the 2016 primary candidates in hip-hop songs +# Hip Hop Candidate Lyrics -The raw data behind the story [ Hip-Hop Is Turning On Donald Trump](http://projects.fivethirtyeight.com/clinton-trump-hip-hop-lyrics/) +This folder contains data behind the story [ Hip-Hop Is Turning On Donald Trump](http://projects.fivethirtyeight.com/clinton-trump-hip-hop-lyrics/). + +`genius_hip_hop_lyrics.csv` contains every mention of the 2016 primary candidates in hip-hop songs. Header | Definition ---|--------- @@ -8,10 +10,9 @@ Header | Definition `song` | Song name `artist` | Artist name `sentiment` | Positive, negative or neutral -`theme` | Theme of lyric +`theme` | Theme of lyric `album_release_date` | Date of album release `line` | Lyrics `url` | Genius link - Source: [Genius](http://genius.com/) \ No newline at end of file diff --git a/historical-ncaa-forecasts/README.md b/historical-ncaa-forecasts/README.md new file mode 100644 index 00000000..acc51b0e --- /dev/null +++ b/historical-ncaa-forecasts/README.md @@ -0,0 +1,3 @@ +# NCAA Bracket + +This folder contains data behind the story [The NCAA Bracket: Checking Our Work](https://fivethirtyeight.com/datalab/the-ncaa-bracket-checking-our-work). \ No newline at end of file diff --git a/inconvenient-sequel/README.md b/inconvenient-sequel/README.md index 0c8212ac..2663f783 100644 --- a/inconvenient-sequel/README.md +++ b/inconvenient-sequel/README.md @@ -1,5 +1,5 @@ # An Inconvenient Sequel -Raw data behind the story [Al Gore’s New Movie Exposes The Big Flaw In Online Movie Ratings](https://fivethirtyeight.com/features/al-gores-new-movie-exposes-the-big-flaw-in-online-movie-ratings/) +This folder contains data behind the story [Al Gore’s New Movie Exposes The Big Flaw In Online Movie Ratings](https://fivethirtyeight.com/features/al-gores-new-movie-exposes-the-big-flaw-in-online-movie-ratings/). -Data contains [IMDb ratings](http://www.imdb.com/title/tt6322922/ratings) for the film "An Inconvenient Sequel: Truth to Power" collected daily from July 17 to August 29, 2017. +`ratings.csv` contains [IMDb ratings](http://www.imdb.com/title/tt6322922/ratings) for the film "An Inconvenient Sequel: Truth to Power" collected daily from July 17 to August 29, 2017. diff --git a/infrastructure-jobs/README.md b/infrastructure-jobs/README.md new file mode 100644 index 00000000..3895be0d --- /dev/null +++ b/infrastructure-jobs/README.md @@ -0,0 +1,3 @@ +# Infrastructure Jobs + +This folder contains data behind the story [Using Infrastructure Jobs as a Measuring Stick For State-Level Spending](https://fivethirtyeight.com/features/using-infrastructure-jobs-as-a-measuring-stick-for-state-level-spending/). \ No newline at end of file diff --git a/librarians/README.md b/librarians/README.md index ec35e640..b74f6615 100644 --- a/librarians/README.md +++ b/librarians/README.md @@ -1,3 +1,3 @@ # Librarians -The data behind the story [Where Are America’s Librarians?](https://fivethirtyeight.com/features/where-are-americas-librarians/) +This folder contains data behind the story [Where Are America’s Librarians?](https://fivethirtyeight.com/features/where-are-americas-librarians/). diff --git a/love-actually/readme.md b/love-actually/readme.md index 02406b12..990b46b6 100644 --- a/love-actually/readme.md +++ b/love-actually/readme.md @@ -1,10 +1,10 @@ -### Love Actually +# Love Actually -This directory contains the data behind the story: [The Definitive Analysis Of ‘Love Actually,’ The Greatest Christmas Movie Of Our Time](https://fivethirtyeight.com/features/the-definitive-analysis-of-love-actually-the-greatest-christmas-movie-of-our-time/) +This directory contains the data behind the story [The Definitive Analysis Of ‘Love Actually,’ The Greatest Christmas Movie Of Our Time](https://fivethirtyeight.com/features/the-definitive-analysis-of-love-actually-the-greatest-christmas-movie-of-our-time/). -There are two data files: - * `love_actually_appearances.csv` - A table of the central actors in "Love Actually" and which scenes they appear in - * `love_actually_adjacencies.csv` - The adjacency matrix of which actors appear in the same scene together +`love_actually_appearances.csv` contains a table of the central actors in "Love Actually" and which scenes they appear in. + +`love_actually_adjacencies.csv` contains the adjacency matrix of which actors appear in the same scene together. You'll notice there are a lot of “Love Actually” actors who we didn’t track in the data. That’s because they rarely cross storylines. When they do, it’s in the company of the actor who we *did* include, the linchpin of that storyline. diff --git a/mad-men/README.md b/mad-men/README.md index 3d487519..d2e3e4eb 100644 --- a/mad-men/README.md +++ b/mad-men/README.md @@ -1,4 +1,4 @@ -### Mad Men +# Mad Men This directory contains the data behind the story [‘Mad Men’ Is Ending. What’s Next For The Cast?](http://fivethirtyeight.com/datalab/mad-men-is-ending-whats-next-for-the-cast/). diff --git a/male-flight-attendants/README.md b/male-flight-attendants/README.md index d38f31f4..fca403cf 100644 --- a/male-flight-attendants/README.md +++ b/male-flight-attendants/README.md @@ -1,12 +1,8 @@ -### Male flight attendants +# Male Flight Attendants -This repo contains the data from the article on the gender divide in various U.S. occupations +This folder contains the data behind the story [Dear Mona, How Many Flight Attendants Are Men?](http://fivethirtyeight.com/datalab/dear-mona-how-many-flight-attendants-are-men/). -[Dear Mona, How Many Flight Attendants Are Men?](http://fivethirtyeight.com/datalab/dear-mona-how-many-flight-attendants-are-men/) - -`male-flight-attendants.tsv`: - -The tab-separated text file contains the percentage of U.S. employees that are male in 320 different job categories. +`male-flight-attendants.tsv` contains the percentage of U.S. employees that are male in 320 different job categories. Source: [IPUMS](https://usa.ipums.org/usa/), 2012 diff --git a/march-madness-predictions-2015/README.md b/march-madness-predictions-2015/README.md index c6fe1df8..3173167c 100644 --- a/march-madness-predictions-2015/README.md +++ b/march-madness-predictions-2015/README.md @@ -1,4 +1,5 @@ -March Madness Predictions 2015 -============================== +# March Madness Predictions -Data files for [FiveThirtyEight's 2015 March Madness Predictions](http://fivethirtyeight.com/interactives/march-madness-predictions-2015/), updated each time we calculate new odds. +This folder contains data behind the [2015 March Madness Predictions](http://fivethirtyeight.com/interactives/march-madness-predictions-2015/). + +Data was updated each time we calculate new odds. diff --git a/march-madness-predictions/README.md b/march-madness-predictions/README.md index 2aca5e55..aa4c34eb 100644 --- a/march-madness-predictions/README.md +++ b/march-madness-predictions/README.md @@ -1 +1,3 @@ -http://fivethirtyeight.com/interactives/march-madness-predictions/ \ No newline at end of file +# March Madness Predictions + +This folder contains data behind the [2014 NCAA Tournament Predictions](http://fivethirtyeight.com/interactives/march-madness-predictions/). \ No newline at end of file diff --git a/marriage/README.md b/marriage/README.md index ce8f28ab..27e36e54 100644 --- a/marriage/README.md +++ b/marriage/README.md @@ -1,6 +1,10 @@ -These files contain data used in FiveThirtyEight's story on marriage trends.File names are self-explanatory. Source for all data is Decennial Census (years 1960 to 2000) and American Community Survey (years 2001-2012), via IPUMS USA. +# Marriage -Except in the divorce file, figures represent share of the relevant population that has never been married (MARST == 6 in the IPUMS data). Note that in the story, charts generally show the share that have ever been married, which is simply 1 - n. In the divorce file, figures are share of the relevant population that is currently divorced, conditional on having ever been married. +This folder contains data behind the story [Marriage Isn’t Dead — Yet](http://fivethirtyeight.com/features/marriage-isnt-dead-yet/). + +Source for all data is Decennial Census (years 1960 to 2000) and American Community Survey (years 2001-2012), via [IPUMS USA](https://usa.ipums.org/usa/cite.shtml). + +Except in the divorce file, figures represent share of the relevant population that has never been married (MARST == 6 in the IPUMS data). Note that in the story, charts generally show the share that have *ever* been married, which is simply 1 - n. In the divorce file, figures are share of the relevant population that is *currently* divorced, conditional on having ever been married. Variable names are as follows. Number in variable names are age ranges, so `all_2534` is the marriage rate for everyone ages 25 to 34. diff --git a/mayweather-mcgregor/README.md b/mayweather-mcgregor/README.md index 8844a6ea..b8f44990 100644 --- a/mayweather-mcgregor/README.md +++ b/mayweather-mcgregor/README.md @@ -1,6 +1,5 @@ -# Mayweather Vs McGregor +# Mayweather vs McGregor -Raw data behind the story [The Mayweather-McGregor Fight As Told Through Emojis -](https://fivethirtyeight.com/?post_type=fte_features&p=161615) +This folder contains data behind the story [The Mayweather-McGregor Fight As Told Through Emojis](https://fivethirtyeight.com/?post_type=fte_features&p=161615). -This data contains 12,118 tweets that contain one or more emojis and match one or more of the following hashtags: #MayMac, #MayweatherMcGregor, #MayweatherVMcGregor, #MayweatherVsMcGregor, #McGregor and #Mayweather. Data was collected on August 27, 2017 between 12:05 a.m. and 1:15 a.m. EDT using the Twitter streaming API. \ No newline at end of file +`tweets.csv` contains 12,118 tweets that contain one or more emojis and match one or more of the following hashtags: #MayMac, #MayweatherMcGregor, #MayweatherVMcGregor, #MayweatherVsMcGregor, #McGregor and #Mayweather. Data was collected on August 27, 2017 between 12:05 a.m. and 1:15 a.m. EDT using the Twitter streaming API. \ No newline at end of file diff --git a/mlb-allstar-teams/README.md b/mlb-allstar-teams/README.md index debe0fb2..4887e10c 100644 --- a/mlb-allstar-teams/README.md +++ b/mlb-allstar-teams/README.md @@ -1,6 +1,10 @@ +# MLB All-Star Teams + +This folder contains data behind the story [The Best MLB All-Star Teams Ever](http://fivethirtyeight.com/features/the-best-mlb-all-star-teams-ever/). + Estimates of most talented MLB All-Star teams, 1933-2015 -Team talent estimates: +`allstar_team_talent.csv` contains team talent estimates with the following headers: Header | Definition ---|--------- @@ -21,7 +25,7 @@ Header | Definition `no_1_player` | Best player according to combo of actual PA/IP and talent `no_2_player` | 2nd-best player according to combo of actual PA/IP and talent -Player talent estimates: +`allstar_player_talent.csv` contains team player estimates with the following headers: Header | Definition ---|--------- @@ -41,4 +45,3 @@ Header | Definition `PITper9innASG` | Expected pitching runs added above average (from talent) based on IP in ASG, scaled to a 9-inning game `TOTper9innASG` | Expected runs added above average (from talent) based on PA/IP in ASG, scaled to a 9-inning game -http://fivethirtyeight.com/features/the-best-mlb-all-star-teams-ever/ diff --git a/mlb-elo/README.md b/mlb-elo/README.md index cd82f742..e35214c4 100644 --- a/mlb-elo/README.md +++ b/mlb-elo/README.md @@ -4,6 +4,6 @@ files: --- # MLB Elo -This contains the raw data behind [The Complete History Of MLB](https://projects.fivethirtyeight.com/complete-history-of-mlb/) and our [MLB Predictions](https://projects.fivethirtyeight.com/2017-mlb-predictions/). +This readme contains links to the data behind [The Complete History Of MLB](https://projects.fivethirtyeight.com/complete-history-of-mlb/) and our [MLB Predictions](https://projects.fivethirtyeight.com/2017-mlb-predictions/). For the latest version of this updating data set, visit the links at the top of this README. -* `mlb_elo.csv` - Game-by-game Elo ratings and forecasts back to 1871. +`mlb_elo.csv` contains game-by-game Elo ratings and forecasts back to 1871. diff --git a/most-common-name/README.md b/most-common-name/README.md index d08960d1..11309ea2 100644 --- a/most-common-name/README.md +++ b/most-common-name/README.md @@ -1,8 +1,6 @@ -### Most Common Name +# Most Common Name -This directory contains the code and data behind the story: - -[Dear Mona, What’s The Most Common Name In America?](http://fivethirtyeight.com/features/whats-the-most-common-name-in-america/) +This directory contains the code and data behind the story [Dear Mona, What’s The Most Common Name In America?](http://fivethirtyeight.com/features/whats-the-most-common-name-in-america/). The main script file is `most-common-name.R` diff --git a/murder_2016/README.md b/murder_2016/README.md index d4903a4f..b7530cfe 100644 --- a/murder_2016/README.md +++ b/murder_2016/README.md @@ -1,8 +1,6 @@ -### 2016 murder data +# 2016 Murder Data -The raw data behind the story [A Handful Of Cities Are Driving 2016's Rise In Murder](http://fivethirtyeight.com/features/a-handful-of-cities-are-driving-2016s-rise-in-murders/) - -There are two files: +This folder contains data behind the story [A Handful Of Cities Are Driving 2016's Rise In Murder](http://fivethirtyeight.com/features/a-handful-of-cities-are-driving-2016s-rise-in-murders/). `murder_2016_prelim.csv` contains preliminary 2016 murder counts for 79 large U.S. cities. 2015 figures are counts through the same data a year ago. Sources are listed in the file. diff --git a/nba-carmelo/README.md b/nba-carmelo/README.md index 21bae8bf..6956632a 100644 --- a/nba-carmelo/README.md +++ b/nba-carmelo/README.md @@ -4,6 +4,6 @@ files: --- # NBA Elo -This contains the raw data behind [The Complete History Of The NBA](https://projects.fivethirtyeight.com/complete-history-of-the-nba/) and our [NBA Predictions](https://projects.fivethirtyeight.com/2018-nba-predictions/). +This contains the raw data behind [The Complete History Of The NBA](https://projects.fivethirtyeight.com/complete-history-of-the-nba/) and our [NBA Predictions](https://projects.fivethirtyeight.com/2018-nba-predictions/). For the latest version of this updating data set, visit the links at the top of this README. -* `nba_elo.csv` - Game-by-game Elo ratings and forecasts back to 1946. +* `nba_elo.csv` contains game-by-game Elo ratings and forecasts back to 1946. diff --git a/nba-draft-2015/README.md b/nba-draft-2015/README.md index 97b8c933..007a0a77 100644 --- a/nba-draft-2015/README.md +++ b/nba-draft-2015/README.md @@ -1,4 +1,8 @@ -Historical results of NBA draft projection model, 2001-2015. +# NBA Draft 2015 + +This folder contains data behind the story [Projecting The Top 50 Players In The 2015 NBA Draft Class](http://fivethirtyeight.com/features/projecting-the-top-50-players-in-the-2015-nba-draft-class/). + +`historical_projections.csv` contains historical results of the NBA draft projection model, 2001-2015. Header | Definition ---|--------- @@ -11,5 +15,3 @@ Header | Definition `Starter` | Probability of becoming a starting-caliber player (10 per draft, SPM >= +0.5) `Role Player` | Probability of becoming a role player (25 per draft, SPM >= -1.4) `Bust` | Probability of becoming a bust (everyone else, SPM < -1.4) - -http://fivethirtyeight.com/features/projecting-the-top-50-players-in-the-2015-nba-draft-class/ diff --git a/nba-elo/README.md b/nba-elo/README.md index 0a3473bd..075c7932 100644 --- a/nba-elo/README.md +++ b/nba-elo/README.md @@ -1,4 +1,4 @@ -### Historical NBA Elo +# Historical NBA Elo This directory contains the data behind the [Complete History Of The NBA](http://fivethirtyeight.com/interactives/the-complete-history-of-every-nba-team-by-elo) interactive. Data updated periodically. Game information is from [Basketball-Reference.com](http://www.basketball-reference.com/). diff --git a/nba-tattoos/README.md b/nba-tattoos/README.md new file mode 100644 index 00000000..0a94aca0 --- /dev/null +++ b/nba-tattoos/README.md @@ -0,0 +1,3 @@ +# NBA Tattoos + +This folder contains data behind the story [What Ethan Swan Learned From Tracking Every Tattoo in the NBA](https://fivethirtyeight.com/features/what-ethan-swan-learned-from-tracking-every-tattoo-in-the-nba/) \ No newline at end of file diff --git a/nba-winprobs/readme.md b/nba-winprobs/readme.md index 55f475cf..c845242b 100644 --- a/nba-winprobs/readme.md +++ b/nba-winprobs/readme.md @@ -1,10 +1,6 @@ -### NBA Win Probabilities +# NBA Win Probabilities -This directory contains the data behind the story: +This directory contains the data behind the story [Every NBA Team’s Chance Of Winning In Every Minute Across Every Game](https://fivethirtyeight.com/features/every-nba-teams-chance-of-winning-in-every-minute-across-every-game/). -[Every NBA Team’s Chance Of Winning In Every Minute Across Every Game](https://fivethirtyeight.com/features/every-nba-teams-chance-of-winning-in-every-minute-across-every-game/) - -There is one data file: - - * `nba.tsv` - The 2014-15 NBA season win probabilities for each team over the course of a game, as of February 18, 2015 +`nba.tsv` contains the 2014-15 NBA season win probabilities for each team over the course of a game, as of February 18, 2015. diff --git a/next-bechdel/README.md b/next-bechdel/README.md index 59379983..5f1d7539 100644 --- a/next-bechdel/README.md +++ b/next-bechdel/README.md @@ -1,32 +1,31 @@ -# The Next Bechdel Test -Data for [The Next Bechdel Test](https://projects.fivethirtyeight.com/next-bechdel/) story. +# The Next Bechdel Test -## Data included +This folder contains data behind the story [The Next Bechdel Test](https://projects.fivethirtyeight.com/next-bechdel/). -1. `nextBechal_allTests.csv` powers the graphics on the page, and shows the high-level breakdown of which movies passed and failed - - Each row is one of the 50 top-grossing movies from 2016 - - Each column is one of the tests. A `0` means the movie failed that test, a `1` means it passed. +`nextBechal_allTests.csv` and shows the high-level breakdown of which movies passed and failed + - Each row is one of the 50 top-grossing movies from 2016. + - Each column is one of the tests. A `0` means the movie failed that test, a `1` means it passed. -2. `nextBechal_castGender.csv` Estimated gender for the entire cast for every movie, including whether a role was supporting or main. Data obtained from [The Numbers](http://the-numbers.com) - -Variable | Definition ----|--------- -`MOVIE` | Title of the film -`ACTOR` | Full name of the actor -`CHARACTER` | All characters played by the actor in that movie -`TYPE` | Leading, Supporting, Cameo or Lead Ensemble Member -`BILLING` | Billing number -`GENDER` | Estimated gender of the actor +`nextBechal_castGender.csv` contains the estimated gender for the entire cast for every movie, including whether a role was supporting or main. Data was obtained from [The Numbers](http://the-numbers.com) + Variable | Definition + ---|--------- + `MOVIE` | Title of the film + `ACTOR` | Full name of the actor + `CHARACTER` | All characters played by the actor in that movie + `TYPE` | Leading, Supporting, Cameo or Lead Ensemble Member + `BILLING` | Billing number + `GENDER` | Estimated gender of the actor -3. `nextBechal_crewGender.csv` crew for every movie, by probablity that a give first name is male. - -Variable | Definition ----|--------- -`MOVIE` | Title of the film -`DEPARTMENT` | Full name of the actor -`FULL_NAME` | Actor's first and last name -`FIRST_NAME` | Just first name of actor -`IMDB` | Actor's IMDB page -`GENDER_PROB` | Percent chance that a given name is male -`GENDER_GUESS` | Based on the probablity, guess if the name is male or female \ No newline at end of file + +`nextBechal_crewGender.csv` contains data for the crew for every movie, by probablity that a give first name is male. + + Variable | Definition + ---|--------- + `MOVIE` | Title of the film + `DEPARTMENT` | Full name of the actor + `FULL_NAME` | Actor's first and last name + `FIRST_NAME` | Just first name of actor + `IMDB` | Actor's IMDB page + `GENDER_PROB` | Percent chance that a given name is male + `GENDER_GUESS` | Based on the probablity, guess if the name is male or female \ No newline at end of file