Student Performance Prediction and Educational Interventions

This project aims to predict student performance and provide educational interventions based on influential factors. We use the Student Performance Data Set, combining math and Portuguese language datasets to build a predictive model and offer tailored recommendations.

Installation

Clone the repository:

git clone https://github.com/yourusername/student-performance-prediction.git

Navigate to the project directory:
```
cd student-performance-prediction
```
Install the required packages:
```
pip install -r requirements.txt
```

Usage

Ensure you have the dataset files (student-mat.csv and student-por.csv) in the project directory.
Run the script:
```
python main.py
```

Project Overview

This project involves the following steps:

Loading and combining the student performance datasets.
Cleaning and preprocessing the data.
Splitting the data into training and testing sets.
Training a RandomForestRegressor model to predict student performance.
Evaluating the model's performance using Mean Absolute Error (MAE) and Root Mean Squared Error (RMSE).
Analyzing feature importance to understand influential factors.
Providing educational interventions based on model predictions.

Model Training

The model training involves data loading, preprocessing, feature and label separation, train-test split, and training the RandomForestRegressor model.

Evaluation

Evaluate the model using MAE and RMSE to measure its performance.

Feature Importance

Analyze feature importance to understand the influential factors that affect student performance.

Educational Interventions

Provide tailored recommendations based on influential features identified during the model training process.

Example Usage

Example of recommending interventions for a student, showcasing how the model's predictions can be used to provide personalized educational support.

Mathematical Concepts and Formulas

Mean Absolute Error (MAE)

The Mean Absolute Error is calculated as:

$$ MAE = rac{1}{n} \sum_{i=1}^{n} |y_i - \hat{y}_i| $$

where ( y_i ) is the actual value and ( \hat{y}_i ) is the predicted value.

Root Mean Squared Error (RMSE)

The Root Mean Squared Error is calculated as:

$$ RMSE = \sqrt{rac{1}{n} \sum_{i=1}^{n} (y_i - \hat{y}_i)^2} $$

where ( y_i ) is the actual value and ( \hat{y}_i ) is the predicted value.

Random Forest Regressor

Random Forest is an ensemble learning method that operates by constructing multiple decision trees during training and outputting the mean prediction of the individual trees. The model combines the predictions from many decision trees to improve the overall prediction accuracy and control overfitting.

License

This project is licensed under the MIT License. See the LICENSE file for details.

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
README.md		README.md
student-mat.csv		student-mat.csv
student-performance.py		student-performance.py
student-por.csv		student-por.csv
student.txt		student.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Student Performance Prediction and Educational Interventions

Table of Contents

Installation

Usage

Project Overview

Model Training

Evaluation

Feature Importance

Educational Interventions

Example Usage

Mathematical Concepts and Formulas

Mean Absolute Error (MAE)

Root Mean Squared Error (RMSE)

Random Forest Regressor

License

About

Releases

Packages

Languages

JacquesGariepy/student-performance

Folders and files

Latest commit

History

Repository files navigation

Student Performance Prediction and Educational Interventions

Table of Contents

Installation

Usage

Project Overview

Model Training

Evaluation

Feature Importance

Educational Interventions

Example Usage

Mathematical Concepts and Formulas

Mean Absolute Error (MAE)

Root Mean Squared Error (RMSE)

Random Forest Regressor

License

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages