Spam Detection Algorithm Analysis By: Joseph LaFata Alex Wade
Problem Spam is a problem on the Internet Wastes bandwidth Wastes time General Annoyance
Overview ISP and Email Servers are constantly needing new ways to fight spam Spammers are not sitting still Coming up with new ways to evade spam filters Development of new prevention techniques always needed
Proposal Study different spam prevention techniques Develop framework for training and testing new algorithms
Spam Detection Techniques Look into common algorithms to prevent spam Implement at least three Attempt to develop new or hybrid algorithms
Framework Provide Training emails Easily add new algorithms Spam Valid Email Easily add new algorithms Keep Metrics How accurate is it? How long did it take to run?
Questions?
Timeline Week 1: Collect Spam\Valid Email Week 2: Write Framework for training and testing Week 3: Implement Spam Filter #1 Week 4: Implement Spam Filter #2 and #3 Week 5: Test, Collect Results, Write Paper
Spam Filters Spam Bayes Lookup Sending Host Presence of only images Further techniques will be researched