Skip to content

Latest commit

 

History

History
112 lines (60 loc) · 5.53 KB

index.md

File metadata and controls

112 lines (60 loc) · 5.53 KB

Being able to identify the predominant human fluxes and mobility patterns in the world is crucial for effectively modelling the spreading of diseases, establishing effective mobility connections and understanding the travelling habits of different populations. This project extension continues the exploration of human geographic movement by relating the latitude and longitude coordinates of check-ins (to the Gowalla and Brightkite social networks) to the respective countries where the localization took place. This is done with the purpose of investigating human migration patterns within and among countries. We are looking for eventual correlation between the users’ home location and their travelling destinations, possibly identifying what are the most frequent connections among countries.

The main questions we asked ourselves

  1. What are the main fluxes of people over the world ?
  2. Have some countries different trips habits than others ?
  3. What is the influence of factors like the language, the distance or the GDP on travlling habits ?

What was our input data ?

We were given several millions of check-ins and the friendship network coming from two social networks : Brightkite and Gowalla.

Here's a summary of this data :

Social network Number of check-ins Number of users Number of countries and places Number of nodes Number of edges
Brightkite 4 491 035 50 686 242 58 228 214 078
Gowalla 6 442 728 107 069 196 196 591 950 327

You can see the distribution of these millions over the planet right there : Check-ins map

How do we shaped this data ?

1. Finding user's home

At the beginng, the only thing we had was a huge amount of check-ins. Before everything else, we had to define a house for each user, in order to know its nationality and the distances that the users travel when they leave their home.

To achieve this, we divided the all world into 25 km x 25 km cells. The home location of a user was the average location of his or her check-ins, within the cell into which he or she had the more check-ins.

You can see here the distribution of home location over the world :

Add the picture

Once this was done, we were able to give a nationality to each user.

2. Geographical boundaries of our study

As you can see on the distribution of check-ins and home locations, these lasts are not equally distributed on earth. These two social networks are more popular in the US and some European countries. Therefore, we had to set on threshold to apply our study only in places where we had enough data. In this analysis, we only kept the countries in which at leat 1 person over 100 000 was a user for both social networks.

The selected countries are represented on this map :

Add the picture

Results

1. Comparison between United-States and Europe

For each check-in, we could compute the distance between the user's location while checking in and his or her home location. Therefore, we have been able to compute various kind of statistics. Given the geographical boundaries of our stidues, we have chosen to compare the US and the average results of European countries that are part of the study (namely Europe in what follows).

Moreover, as we are only interested in travelling habits, the following statistics are based on check-ins that were done at a distance from home greater than 200 km, which is how we define a travel.

Here are the results :

Please note that in order not to account twice some users that could use both Brightkite and Gowalla, we will always present you the results for both of them separately and analyse the similarities and differences between these two.

The following figure shows how far from their home are people when they leave their country.

Distance EU vs US

You can see that American people go much further than European when they leave the US. That makes sense because the US is almost as big as all Europe in terms of surface.

Part of population EU vs US

Countries per traveller EU vs US

2. But where are they going ?!

Departures US Gowalla

Departures Sweden Gowalla

Departures US Brightkite

Departures Sweden Brightkite

3. What is influencing travellers ?

Regression Brightkite

Regression Gowalla

Conclusion

The team

Our team is composed of 3 students from EPFL :

Chiara ONGARO, Cyril CHAMBE and Matthieu SOUTTRE

Appendix

Datasets used

Check-ins from Gowalla

Check-ins from Brightkite

Country Mapping - ISO, Continent, Region

Countries of the world

Language list by country and place

Methods

Help for markdown

Help