GitHub - tv-doan/infocom-2020-netflix

A Longitudinal View of Netflix: Content Delivery over IPv6 and Content Cache Deployments

Trinh Viet Doan (Technical University of Munich) | Vaibhav Bajpai (Technical University of Munich) | Sam Crawford (SamKnows)

IEEE INFOCOM 2020, July 6–9, 2020. Publication →

Vantage Points

The dataset is collected from ~100 SamKnows probes:

Dataset

The raw datasets are available at:

Technical University of Munich, mediaTUM →

The data consists of two sqlite3 databases, one for the measurements by the netflix test (netflix-data.db), the other one for the throughput measurements toward MLab (mlab-data.db).

The schemas of the tables can be found under ./data/netflix-schema.sql) and ./data/mlab-schema.sql).

This repository contains (most of) the required code and metadata to reproduce the results, see below for further instructions.

Requirements

To read from the database (see above), sqlite3 is needed. The analyses were performed using jupyter notebooks on Python 2.7. Required Python dependencies are listed in requirements.txt and can be installed using pip install -r requirements.txt.

For the calculation of CDFs and drawing of the corresponding plots, Pmf.py → and Cdf.py → from Think Stats → are used.

Repeating the results

Move the required datasets and modules to the right locations:

netflix-data.db → ./data/
mlab-data.db → ./data/
Pmf.py → ./notebooks/
Cdf.py → ./notebooks/

Run the aggregation.ipynb notebook to process and aggregate the raw dataset, which will store the results in a separate database. After that, the other notebooks fig-*.ipynb can be used to draw the plots presented in the paper. All plots are saved under ./plots/.

Note: the lookup of metadata was already done, however, it can be repeated by running ./metadata/netflix-metadata-lookup.py and ./metadata/probe-to-timezone.ipynb.

Contact

Please feel welcome to contact the authors for further details.

Trinh Viet Doan ([email protected]) (corresponding author)
Vaibhav Bajpai ([email protected])
Sam Crawford ([email protected])

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
data		data
metadata		metadata
notebooks		notebooks
plots		plots
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

A Longitudinal View of Netflix: Content Delivery over IPv6 and Content Cache Deployments

Vantage Points

Dataset

Requirements

Repeating the results

Contact

About

Releases

Packages

Languages

License

tv-doan/infocom-2020-netflix

Folders and files

Latest commit

History

Repository files navigation

A Longitudinal View of Netflix: Content Delivery over IPv6 and Content Cache Deployments

Vantage Points

Dataset

Requirements

Repeating the results

Contact

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages