BugBang

This repository contains an R script to classify an issue report from JIRA as referring to a bug or not. It also contains associated datasets. They are used in the following paper:

Nitish Pandey, Debarshi Kumar Sanyal, Abir Hudait, and Amitava Sen, “Automated Classification of Software Issue Reports Using Machine Learning Techniques: An Empirical Study,” Innovations in Systems and Software Engineering, Springer. (Accepted, March 2017) (doi:10.1007/s1133)

Please cite the above paper if you use the scripts in this repository. I thank Prof. Hideaki Hata for the data (which were then augmented with records from JIRA).

Commandline in Linux

Rscript BugBang_rel_v4.0.R --infile="./datasets/exp1/http_client.csv" --outfile="./out/classification_http_client.out" --max_terms_in_dtm=0.25 --normalize=3 --cv_fold=10

Ensure that the input files and the output directory exist.

Script Modification for Experiment 2

The released script is for experiment 1 (data in folder "exp1"). For experiments in exp2, "CLASSIFIED" in the following lines should be replaced with "TYPE":

training.data.input <- training.data.input [ training.data.input$CLASSIFIED %in% c("BUG", "NUG"), ]

Train.Type <- as.factor(training.data.input$CLASSIFIED);

Name		Name	Last commit message	Last commit date
Latest commit History 15 Commits
BugBang_rel_v4.0.R		BugBang_rel_v4.0.R
LogsForJournal.zip		LogsForJournal.zip
README.md		README.md
TablesAndGraphsForJournal.zip		TablesAndGraphsForJournal.zip
datasets.zip		datasets.zip

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

BugBang

Contents

Commandline in Linux

Script Modification for Experiment 2

About

Releases

Packages

Languages

dksanyal/BugBang

Folders and files

Latest commit

History

Repository files navigation

BugBang

Contents

Commandline in Linux

Script Modification for Experiment 2

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages