Spam Email Classifier

Overview

This repository contains code for a spam email classifier developed as part of my Machine Learning internship at Digital Empowerment Network. The goal of this project is to classify emails as spam or ham (not spam) using machine learning techniques.

Dataset

The dataset used for this project is the mail_data.csv file, which contains the following columns:

Category: The label of the email, either 'spam' or 'ham'.
Message: The content of the email.

Steps Involved

1. Data Preprocessing

Cleaning the data and preparing it for model training.

2.Feature Extraction

Transforming the text data into numerical features using TF-IDF vectorization.

3.Model Training

Training a Logistic Regression model to classify emails.

4.Model Evaluation

Evaluating the model's performance using accuracy, classification report, and confusion matrix.

Results

The Logistic Regression model achieved the following results:
Accuracy on training data: 0.967
Accuracy on test data: 0.965

Usage

To run the code, follow these steps:

Clone this repository to your local machine.
Navigate to the directory containing the code.
Ensure that the mail_data.csv file is in the same directory as the code.
Run the script: python spam_email_classifier.ipynb

Conclusion

This project demonstrates the process of building a spam email classifier using Logistic Regression. The model can accurately classify emails as spam or ham based on their content. Future improvements could include experimenting with different models and techniques to further enhance accuracy.

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
README.md		README.md
Spam_Email_Classifier.ipynb		Spam_Email_Classifier.ipynb
mail_data.csv		mail_data.csv

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Spam Email Classifier

Overview

Dataset

Steps Involved

1. Data Preprocessing

2.Feature Extraction

3.Model Training

4.Model Evaluation

Results

Usage

Conclusion

About

Releases

Packages

Languages

beenish-Ishtiaq/DEP-Task-2-Spam-Email-Classifier

Folders and files

Latest commit

History

Repository files navigation

Spam Email Classifier

Overview

Dataset

Steps Involved

1. Data Preprocessing

2.Feature Extraction

3.Model Training

4.Model Evaluation

Results

Usage

Conclusion

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages