Skip to content

mwangcs/SpamDetection

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

5 Commits
 
 
 
 
 
 
 
 

Repository files navigation

Spam Email Detector

Approach

Stem using function stem from python package stemming, module porter2. Construct dictionary as set of all Unicode-converted, stemmed ham and spam email words. Convert documents to vectors using (stemmed) dictionary-word in-document frequencies. Augments vectors with 1 for bias. Train the model using logistic regression by gradient descent with regularization.

Evaluation

92.8% recall and 93.2% precison on unseen test data

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages