Toxic-Content-Filtering

NLP based project for toxic content filtering

Dataset: https://www.kaggle.com/c/jigsaw-toxic-comment-classification-challenge/data

Live Website: http://rashmio2410.pythonanywhere.com/

Libraries required: pandas nltk nltk.download(‘averaged-perceptron-tagger’) Flask numpy scikit-learn matplotlib

Steps to run: Before executing any of the following commands, ‘train.csv’ and ‘test.csv’ files of the dataset must be present in the ‘data’ directory (download them from the link above).

Create and save model

$cd src

$python main.py create

It will create and save pickle files of the model in ‘model’ directory
Generate output file for entire kaggle test set

$cd src

$python main.py test
Test on a single comment using the developed application

$cd src

$python main.py

The application can be accessed from http://localhost:5000/

Name		Name	Last commit message	Last commit date
Latest commit History 19 Commits
data		data
experiments		experiments
model		model
src		src
.gitignore		.gitignore
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Toxic-Content-Filtering

About

Uh oh!

Releases

Packages

Contributors 2

Uh oh!

Languages

may1sharma/Toxic-Content-Classification

Folders and files

Latest commit

History

Repository files navigation

Toxic-Content-Filtering

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Uh oh!

Languages

Packages