new->enron1 dataset ready and ready2 are to get dataset clean proj is an implementation of multinomial bayes(still have some doubts with vectorization, will update it asap)
Naive Bayes classifier for multinomial models
The multinomial Naive Bayes classifier is suitable for classification with discrete features (e.g., word counts for text classification). The multinomial distribution normally requires integer feature counts. However, in practice, fractional counts such as tf-idf may also work.