Survey on Spam Filtering Techniques and Mapreduce
Prajakta S. Patil, Prof. Rashmi A. Rane, Prof. Madhuri A. Bhalekar"Survey on Spam Filtering Techniques and Mapreduce", International Journal of Engineering Trends and Technology (IJETT), V30(9),444-447 December 2015. ISSN:2231-5381. www.ijettjournal.org. published by seventh sense research group
Spam Email, also known as junk email , is a subset of electronic spam involving nearly identical messages sent to numerous recipients by email. The messages may contain disguised links that appear to be for familiar websites but in fact lead to phishing web sites or sites that are hosting malware. Spam email may also include malware as scripts or other executable file attachments. Spam is any unwanted and harmful mail. Separation of spam from normal mails is essential. This paper surveys different spam email filtering techniques. The different techniques are Machine learning based, list based, content based and hybrid or other. Machine learning based, is mostly used because of high accuracy and mathematical support.
 Amol G. Kakade1, Prashant K. Kharat2, Anil Kumar Gupta, Survey of Spam Filtering Techniques and Tools, and Map Reduce with SVM, International Journal of Computer Science and Mobile Computing Vol.2 Issue. 11,November- 2013, pg. 91-98.
 Puch-Tran Ho , HEE Su Kin, Application of Sim Hash Algorithm and Big Data Analysis in Spam Email Detection System, International Journal of Computer Applications (0975 8887) Volume 39 No.6, February 2014.
 Sahil Puri1, Dishant Gosain2, "COMPARISON AND ANALYSIS OF SPAM DETECTION ALGORITHMS,International Journal of Application or Innovation in Engineering and Management,Volume 2, Issue 4, April 2013.
 Godwin Caruana, Maozhen Li, Yang Liu, An ontology enhanced parallel SVM for scalable spamlter training, Neuro computing Elsevier, vol. 108, pp.45-57, 2013.
 L. Zhang, J. Zhu, T. Yao, An evaluation of statistical spamltering techniques , ACM Transaction on Asian Language Information Process, vol. 3, pp.243269,2004.
Spam filtering techniques, Machine learning based ,content based, word based.