Skip to content

JacquesGariepy/student-performance

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

7 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Student Performance Prediction and Educational Interventions

This project aims to predict student performance and provide educational interventions based on influential factors. We use the Student Performance Data Set, combining math and Portuguese language datasets to build a predictive model and offer tailored recommendations.

Table of Contents

Installation

  1. Clone the repository:
    git clone https://github.com/yourusername/student-performance-prediction.git
  2. Navigate to the project directory:
    cd student-performance-prediction
  3. Install the required packages:
    pip install -r requirements.txt

Usage

  1. Ensure you have the dataset files (student-mat.csv and student-por.csv) in the project directory.
  2. Run the script:
    python main.py

Project Overview

This project involves the following steps:

  1. Loading and combining the student performance datasets.
  2. Cleaning and preprocessing the data.
  3. Splitting the data into training and testing sets.
  4. Training a RandomForestRegressor model to predict student performance.
  5. Evaluating the model's performance using Mean Absolute Error (MAE) and Root Mean Squared Error (RMSE).
  6. Analyzing feature importance to understand influential factors.
  7. Providing educational interventions based on model predictions.

Model Training

The model training involves data loading, preprocessing, feature and label separation, train-test split, and training the RandomForestRegressor model.

Evaluation

Evaluate the model using MAE and RMSE to measure its performance.

Feature Importance

Analyze feature importance to understand the influential factors that affect student performance.

Educational Interventions

Provide tailored recommendations based on influential features identified during the model training process.

Example Usage

Example of recommending interventions for a student, showcasing how the model's predictions can be used to provide personalized educational support.

Mathematical Concepts and Formulas

Mean Absolute Error (MAE)

The Mean Absolute Error is calculated as:

$$ MAE = rac{1}{n} \sum_{i=1}^{n} |y_i - \hat{y}_i| $$

where ( y_i ) is the actual value and ( \hat{y}_i ) is the predicted value.

Root Mean Squared Error (RMSE)

The Root Mean Squared Error is calculated as:

$$ RMSE = \sqrt{rac{1}{n} \sum_{i=1}^{n} (y_i - \hat{y}_i)^2} $$

where ( y_i ) is the actual value and ( \hat{y}_i ) is the predicted value.

Random Forest Regressor

Random Forest is an ensemble learning method that operates by constructing multiple decision trees during training and outputting the mean prediction of the individual trees. The model combines the predictions from many decision trees to improve the overall prediction accuracy and control overfitting.

License

This project is licensed under the MIT License. See the LICENSE file for details.

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages