- Extracting data: background color of cells in Excel files
- Reading in data from Google spreadsheet
- Converting from PDF using batch mode
- Discretize variables using numpy.matlib.repmat
- Data visualization via dimensionality-reduction
- Explore some ONC datasets
- Explore the YaleED dataset
- Explore clinical codes; calculate some clinical indices
- Logistric regresssion and LSTM using PIMA
- LR and MLP using PIMA
- Time-series classification (WISDM dataset)
- "Making Data Visual" by Danyel Fisher, Miriah Meyer
- Visualizing missing data quickly
- Visualization: bad examples
- Violinplot via Seaborn
- Cartograms (explained in first ten seconds)
- "Statistical Approaches to the Model Comparison Task in Learning Analytics," Gardner & Brooks, LAK2017
- Slides "Statistical testing for classification..." by B Evans
- Notes: "[...] ordinal regression is known to perform better than softmax on ordinal data (Cheng et al., 2008)"