Presentation is loading. Please wait.

Presentation is loading. Please wait.

Spam Filtering Techniques Arnold Perez Joseph Tilley.

Similar presentations


Presentation on theme: "Spam Filtering Techniques Arnold Perez Joseph Tilley."— Presentation transcript:

1 Spam Filtering Techniques Arnold Perez Joseph Tilley

2 Spam Filters CRM114 – The Controllable Regex Mutilator Bayesian Filter – improvements over Pantel and Lin Case Based Approach to Spam Filtering that Can Track Concept Drift

3 Spam Filter Traits Text Classification Use text classification to identify spam Concept Drift Leverage case based filtering to avoid concept drift Email Headers Investigate data in email headers

4 Other Spam Filtering Techniques Blacklists List of sender information that will identify an email as spam Greylists Hold messages from a sender that is not recognized

5 Goals Evaluate different Spam Filtering Techniques Create Spam Filter that borrows from different strengths from other Spam Filters Decrease number of false positives.

6 Testing Test our filter against a set of text files that represent emails. Compare our results with statistical data of existing spam filters. Provide record statistics at milestones Addition of text classification Addition of Bayesian improvements Addition of cased based filtering

7 Project Deliverables Create Spam Filter that combines text classification, case based filtering, and improved Bayesian filter Comparisons of our filter to existing statistical data. Conclusions, lessons learned and possible future work.


Download ppt "Spam Filtering Techniques Arnold Perez Joseph Tilley."

Similar presentations


Ads by Google