Document Scanner App

A simple computer vision project that scans documents from images and converts them into clean, readable PDF files.

Built using OpenCV and Streamlit.

Features

Detects document edges automatically
Corrects perspective (flattens the page)
Removes shadows and background noise
Enhances readability (black & white scan)
Supports multiple images
Exports scanned pages as a single PDF

Tech Stack

Python
OpenCV → image processing
NumPy → numerical operations
Streamlit → web app interface
Pillow → PDF generation

How It Works

The app follows this pipeline:

Image Upload
- User uploads one or more images.
Document Detection
- Uses contour detection to find the document edges.
- Filters shapes to detect a 4-sided boundary.
Perspective Transform
- Converts tilted document into a flat top-down view.
Preprocessing
- Converts to grayscale
- Removes shadows using background normalization
- Applies denoising
Thresholding
- Converts image into clean black & white for readability
PDF Generation
- All processed images are combined into a single PDF

Project Structure

doc_scanner/
├── app.py
├── src/
│   ├── scanner.py
│   ├── utils.py
├── data/
│   ├── input/
│   ├── output/
├── requirements.txt
├── README.md

Installation

1. Clone the repository

git clone https://github.com/YOUR_USERNAME/doc_scanner.git
cd doc_scanner

2. Create environment (conda recommended)

conda create --prefix ./venv python=3.10
conda activate ./venv

3. Install dependencies

pip install -r requirements.txt

Run the App

streamlit run app.py

Then open the link shown in terminal.

Usage

Upload one or more images
The app will automatically:
- detect the document
- scan and enhance it
Click Download PDF to get the final output

Key Concepts Used

1. Edge Detection

Detects boundaries of objects in the image.

2. Contour Detection

Finds shapes and identifies the document region.

3. Perspective Transform

Maps the document into a flat rectangular view.

4. Image Enhancement

Noise removal
Shadow removal
Contrast improvement

5. Thresholding

Converts image into high-contrast black & white.

Future Improvements

Live camera scanning
Automatic cropping
OCR (text extraction)

Author

Ahan Mondal

Name		Name	Last commit message	Last commit date
Latest commit History 15 Commits
data		data
src		src
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
app.py		app.py
output.pdf		output.pdf
packages.txt		packages.txt
requirements.txt		requirements.txt
runtime.txt		runtime.txt
temp.jpg		temp.jpg

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Document Scanner App

Features

Tech Stack

How It Works

Project Structure

Installation

1. Clone the repository

2. Create environment (conda recommended)

3. Install dependencies

Run the App

Usage

Key Concepts Used

1. Edge Detection

2. Contour Detection

3. Perspective Transform

4. Image Enhancement

5. Thresholding

Future Improvements

Author

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Document Scanner App

Features

Tech Stack

How It Works

Project Structure

Installation

1. Clone the repository

2. Create environment (conda recommended)

3. Install dependencies

Run the App

Usage

Key Concepts Used

1. Edge Detection

2. Contour Detection

3. Perspective Transform

4. Image Enhancement

5. Thresholding

Future Improvements

Author

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages