Skip to content

Automated CV filtering system written in Python. The script extracts text from PDF, DOCX, and TXT files, checks for predefined keywords, and automatically sorts CVs into "match" and "do_not_match" folders.

Notifications You must be signed in to change notification settings

reemkhaleed/cv-filter-system

Folders and files

NameName
Last commit message
Last commit date

Latest commit

ย 

History

8 Commits
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 

Repository files navigation

CV Filter System (Keyword-Based)

This Python script automatically filters CV files based on keywords. It extracts text from PDFs, DOCX documents, and TXT files, checks for target keywords, and then sorts the CVs into two folders:

  • match/ โ†’ CV contains at least one keyword
  • do_not_match/ โ†’ CV does not contain any keywords

It also supports importing a LinkedIn applicant ZIP file automatically.


๐Ÿ” Features

  • Reads PDF, DOCX, and TXT CVs.
  • Automatically extracts text.
  • Keyword-based filtering (Python, Excel, Machine Learning, etc.).
  • Creates output folders automatically.
  • Supports LinkedIn applicant export ZIP files.
  • Clean and fast filtering โ€” works with large batches of CVs.

Install dependencies:

pip install -r requirements.txt

โ–ถ๏ธ How To Use :

1.Place your LinkedIn exported ZIP file as:

linkedin_applicants.zip

2.Run the script:

python Cv_filter.py

3.Results:

  • Matched CVs โ†’ match/

  • Not matched โ†’ do_not_match/

  • Extracted files โ†’ cvs/

๐Ÿ›  Customize Keywords:

Edit in Cv_filter.py/ Add or remove keywords as needed.

KEYWORDS = ["python", "excel", "data analysis"]

About

Automated CV filtering system written in Python. The script extracts text from PDF, DOCX, and TXT files, checks for predefined keywords, and automatically sorts CVs into "match" and "do_not_match" folders.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages