Skip to content

s-shikharcse/Email-Spam_Classifier

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

11 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Email-Spam_Classifier

We all face the problem of spams in our inboxes. Let’s build a spam classifier program in python which can tell whether a given message is spam or not! We can do this by using a simple, yet powerful theorem from probability theory called Baye’s Theorem. It is mathematically expressed as

1 82tln2qziioehv2em4doiq

We have a message m = (w1, w2, . . . . , wn), where (w1, w2, . . . . , wn) is a set of unique words contained in the message. We need to find

1 fngyu6bsb_qw82aldd2e9q

If we assume that occurrence of a word are independent of all other words, we can simplify the above expression to

3

In order to classify we have to determine which is greater

1 ynultj72f1z2we927udlrq

This project classifies the email as spam or not based on the scanning of the email body. The program compares the string tokens with that of already stored spam-keywords and if it finds the one then reports the mail as spam otherwise not.

If classified as spam will place in spam folder or give label of spam. If not spam will show in inbox as normal inbox message.

Download Dataset from the given link:

 https://drive.google.com/open?id=1bhx83hB8aS8EnQepSjzD_OoRnK-ArJiQ

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages