Skip to content

Plagueowl/Pdf-text-extraction-using-python

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

4 Commits
 
 
 
 
 
 
 
 

Repository files navigation

Pdf-text-extraction-using-python

A simple console based python code to extract texts from scanned pdfs using tesseract OCR of python.

Install the tesseract file, copy the tesseract.exe path in the python file, keep the temporary folder as it is because it stores the temporary images generated, it gets deleted afterwards. For word files, give the path with the file name example: D:/files/Texts.docx

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages