Academic project that required the implementation of a supervised Spam Filter using the MLlib Machine Learning library of Apache Spark.
It includes also a commented data processing step which has been left commented due to the lower accuracy of the model.