Data Preprocess, ML, Regression, Classification, Text Analysis
-
qunar_data_merged.csv: raw data
-
regression.csv: final regression data
-
classfication_all_14features.csv: final classification data
-
preprocessing.ipynb: the first step, data cleaning
-
cluster_variable_select.ipynb: the second step, dimensionality reduction
-
R_reg_preprocess.R: generate regression data by R
-
R_cla_preprocess.R: generate classification data by R
-
class_train_all.csv: classification train data used
-
class_test_all.csv: classification test data used
-
R_operate.Rmd: classification by ML with R