injury_prediction

Applying machine learning methods for predicting knee ligament injuries

Overview

Applied five machine learning models (Logistic Regression, Decision Tree, Random Forest, SVM and MLP) for predicting if a patient will have a new injury based on several attributes

This dataset was collect as a part of a Master's degree thesis in physiotherapy and names of the patients were removed from data.csv. More data are being collected and this analysis will be updated as soon as it becomes available.

The attributes of the dataset are written in portuguese and the label is the feature "Você sofreu uma nova lesão do ligamento cruzado anterior?" (Have you suffered a new injury on knee ligament?)

PCA

The PCA was used to decrease the number of features in the model. My approach was to reduce the number of features by 28 considering those that had a high correlation.

Imbalanced data

My approach for solving balancing issues was using SMOTE for upsampling the minor class.

Results

Talking about the results, the models achieved between 86% and 96% of accuracy. Some notes were taken:

Logistic Regression had excellent results
Decision Tree had the worst accuracy
Random Forest had a poor performance due to the curse of dimensionality. It always predicts only one class
SVM and Neural Networks achieved the best results considering accuracy, recall, precision and F1-score
I implemented a MLP with TensorFlow and it achieved very similar results to the sklearn implementation
I collected some percentage predictions based on the outputs of the last layer (softmax activation function) of the TensorFlow's implementation. Find below a screenshot of these results
This dataset is small and the results may change when more data becomes available

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
screenshots		screenshots
README.md		README.md
correlation.png		correlation.png
data.csv		data.csv
models.ipynb		models.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

injury_prediction

Overview

PCA

Imbalanced data

Results

About

Releases

Packages

Languages

lucasgml/injury_prediction

Folders and files

Latest commit

History

Repository files navigation

injury_prediction

Overview

PCA

Imbalanced data

Results

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages