Skip to content

MiniBooNE particle identification signal or background classification

Notifications You must be signed in to change notification settings

zachary-cauchi/CCE5225-assignment-1

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

27 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Dataset breakdown:

  • 1st line:
    • 1st number: Signal events in file.
    • 2nd number: Background events in file.
  • Each line past line 1:
    • 1 event (either signal or background):
      • 50 variables known as Particle IDentification variables.
    • Events are 1 of 2 classes - signal, or background.
  • Aim is to classify events correctly into the two classes.

Setup

  1. Download MiniBooNE dataset.

Goals

Implement the below classifiers, ensuring each individual classifier meets the shared objectives below.

Classifier implementations

  • Vanilla neural network
  • Support Vector Machine
  • Random forest

Shared Objectives

  • Load and transform the dataset into a feature matrix. Expected format of (x: event, y: feature).
  • Scale the input features to a discreet value range.
  • Re-shuffle the dataset, dividing into a training and test set of 80% and 20% respectively.
  • nn: [x] svm: [x] rf: [x] Identify the hyperparameters which result in the best accuracy. Do so using a grid-search and test it using 5-fold cross-validation (Hence the 80% 20% partitioning above.)
    • nn: [ ] svm: [ ] rf: [ ] Comment/provide feedback on the performance influence of different hyperparameter values per model.
  • nn: [x] svm: [x] rf: [x] Report the following data:
    • nn: [x] svm: [x] rf: [x] Training time
    • nn: [x] svm: [x] rf: [x] Tested hyperparameters
    • nn: [x] svm: [x] rf: [x] Confusion matrix of the per-class accuracies achieved by running on the unseen test set.
  • nn: [ ] svm: [ ] rf: [ ] Evaluate each model's performance, commenting on the performance and providing an explanation with reasons on why - in your opinion - the highest-performing model gave the best results (see Information Theory).

About

MiniBooNE particle identification signal or background classification

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published