Being able to identify the predominant human fluxes and mobility patterns in the world is crucial for effectively modelling the spreading of diseases, establishing effective mobility connections and understanding the travelling habits of different populations. This project extension continues the exploration of human geographic movement by relating the latitude and longitude coordinates of check-ins (to the Gowalla and Brightkite social networks) to the respective countries where the localization took place. This is done with the purpose of investigating human migration patterns within and among countries. We are looking for eventual correlation between the users’ home location and their travelling destinations, possibly identifying what are the most frequent connections among countries.
- What are the main fluxes of people over the world ?
- Have some countries different trips habits than others ?
- What is the influence of factors like the language, the distance or the GDP on travlling habits ?
We were given several millions of check-ins and the friendship network coming from two social networks : Brightkite and Gowalla.
Here's a summary of this data :
Social network | Number of check-ins | Number of users | Number of countries and places | Number of nodes | Number of edges |
---|---|---|---|---|---|
Brightkite | 4 491 035 | 50 686 | 242 | 58 228 | 214 078 |
Gowalla | 6 442 728 | 107 069 | 196 | 196 591 | 950 327 |
You can see the distribution of these millions over the planet right there :
At the beginng, the only thing we had was a huge amount of check-ins. Before everything else, we had to define a house for each user, in order to know its nationality and the distances that the users travel when they leave their home.
To achieve this, we divided the all world into 25 km x 25 km cells. The home location of a user was the average location of his or her check-ins, within the cell into which he or she had the more check-ins.
You can see here the distribution of home location over the world :
Add the picture
Once this was done, we were able to give a nationality to each user.
As you can see on the distribution of check-ins and home locations, these lasts are not equally distributed on earth. These two social networks are more popular in the US and some European countries. Therefore, we had to set on threshold to apply our study only in places where we had enough data. In this analysis, we only kept the countries in which at leat 1 person over 100 000 was a user for both social networks.
The selected countries are represented on this map :Add the picture
For each check-in, we could compute the distance between the user's location while checking in and his or her home location. Therefore, we have been able to compute various kind of statistics. Given the geographical boundaries of our stidues, we have chosen to compare the US and the average results of European countries that are part of the study (namely Europe in what follows).
Moreover, as we are only interested in travelling habits, the following statistics are based on check-ins that were done at a distance from home greater than 200 km, which is how we define a travel.
Here are the results :
Please note that in order not to account twice some users that could use both Brightkite and Gowalla, we will always present you the results for both of them separately and analyse the similarities and differences between these two.
The following figure shows how far from their home are people when they leave their country.
You can see that American people go much further than European when they leave the US. That makes sense because the US is almost as big as all Europe in terms of surface.
Our team is composed of 3 students from EPFL :
Chiara ONGARO, Cyril CHAMBE and Matthieu SOUTTRE
Country Mapping - ISO, Continent, Region