Skip to content

may1sharma/Toxic-Content-Classification

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

19 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Toxic-Content-Filtering

NLP based project for toxic content filtering

Dataset: https://www.kaggle.com/c/jigsaw-toxic-comment-classification-challenge/data

Live Website: http://rashmio2410.pythonanywhere.com/

Libraries required: pandas nltk nltk.download(‘averaged-perceptron-tagger’) Flask numpy scikit-learn matplotlib

Steps to run: Before executing any of the following commands, ‘train.csv’ and ‘test.csv’ files of the dataset must be present in the ‘data’ directory (download them from the link above).

  1. Create and save model

    $cd src

    $python main.py create

    It will create and save pickle files of the model in ‘model’ directory

  2. Generate output file for entire kaggle test set

    $cd src

    $python main.py test

  3. Test on a single comment using the developed application

    $cd src

    $python main.py

    The application can be accessed from http://localhost:5000/

About

Multilabel Classification of Wikipedia Talk comments for calculating Toxic behavior.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Contributors 2

  •  
  •