Ferrules: Modern, fast, document parser written in 🦀

🚧 Work in Progress: Check out our roadmap for upcoming features and development plans.

Ferrules is an opinionated high-performance document parsing library designed to generate LLM-ready documents efficiently. Unlike alternatives such as unstructured which are slow and Python-based, ferrules is written in Rust and aims to provide a seamless experience with robust deployment across various platforms.

| NOTE A ferrule is a corruption of Latin viriola on a pencil known as a Shoe, is any of a number of types of objects, generally used for fastening, joining, sealing, or reinforcement.

Features

📄 PDF Parsing and Layout Extraction:
- Utilizes pdfium2 to parse documents.
- Supports OCR using Apple's Vision on macOS (using objc2 Rust bindings and VNRecognizeTextRequest functionality).
- Extracts and analyzes page layouts with advanced preprocessing and postprocessing techniques.
- Accelerate model inference on Apple Neural Engine (ANE)/GPU (using ort library).
- Merges layout with PDF text lines for comprehensive document understanding.
🔄 Document Transformation:
- Groups captions, footers, and other elements intelligently.
- Structures lists and merges blocks into cohesive sections.
- Detects headings and titles using machine learning for logical document structuring.
🖨️ Rendering: Provides HTML, Markdown, and JSON rendering options for versatile use cases.
⚡ High Performance & Easy Deployment:
- Built with Rust for maximum speed and efficiency
- Zero-dependency deployment (no Python runtime required !)
- Hardware-accelerated ML inference (Apple Neural Engine, GPU)
- Designed for production environments with minimal setup
⚙️ Advanced Functionalities: : Offers configurable inference parameters for optimized processing (COMING SOON)
🛠️ API and CLI:
- Provides both a CLI and API interface
- Supports tracing

Installation

Ferrules provides precompiled binaries for macOS, available for download from the GitHub Releases page.

macOS Installation

Download the latest ferrules binary from the releases.
Verify the installation:
```
ferrules --version
```

Linux Installation

Linux support with NVIDIA GPU acceleration will be available soon. Keep an eye out for updates on the releases page.

⚠️ Note: Ensure that you have the necessary permissions to execute and move files to system directories.

Visit the GitHub Releases page to find the latest version suitable for your operating system.

Usage

Ferrules provides two ways to use the library:

1. Command Line Interface (CLI)

Basic Usage

ferrules path/to/your.pdf

This will parse the PDF and save the results in the current directory:

ferrules file.pdf
[00:00:02] [########################################] Parsed document in 108ms
✓ Results saved in: ./file-results.json

Debug Mode

To get detailed processing information and debug outputs:

ferrules path/to/your.pdf --debug
[00:00:02] [########################################] Parsed document in 257ms
ℹ Debug output saved in: /var/folders/x1/1fktcq215tl73kk60bllw9rc0000gn/T/ferrules-XXXX
✓ Results saved in: ./megatrends-results.json

Debug mode generates visual output showing the parsing results for each page:

Each color represents different elements detected in the document:

🟦 Layout detection
🟩 OCR parsed lines
🟥 Pdfium parsed lines

Available Options

Options:
  -r, --page-range <PAGE_RANGE>
          Specify pages to parse (e.g., '1-5' or '1' for single page)
      --output-dir <OUTPUT_DIR>
          Specify the directory to store parsing result [env: FERRULES_OUTPUT_DIR=]
      --save-images
          Specify the directory to store parsing result
      --layout-model-path <LAYOUT_MODEL_PATH>
          Specify the path to the layout model for document parsing [env: FERRULES_LAYOUT_MODEL_PATH=]
      --coreml
          Enable or disable the use of CoreML for layout inference
      --use-ane
          Enable or disable Apple Neural Engine acceleration (only applies when CoreML is enabled)
      --trt
          Enable or disable the use of TensorRT for layout inference
      --cuda
          Enable or disable the use of CUDA for layout inference
      --device-id <DEVICE_ID>
          CUDA device ID to use (0 for first GPU) [default: 0]
  -j, --intra-threads <INTRA_THREADS>
          Number of threads to use for parallel processing within operations [default: 2]
      --inter-threads <INTER_THREADS>
          Number of threads to use for executing operations in parallel [default: 1]
  -O, --graph-opt-level <GRAPH_OPT_LEVEL>
          Ort graph optimization level
      --debug
          Activate debug mode for detailed processing information [env: FERRULES_DEBUG=]
      --debug-dir <DEBUG_DIR>
          Specify the directory to store debug output files [env: FERRULES_DEBUG_PATH=]
  -h, --help
          Print help
  -V, --version
          Print version

You can also configure some options through environment variables:

FERRULES_OUTPUT_DIR: Set the output directory
FERRULES_LAYOUT_MODEL_PATH: Set the layout model path
FERRULES_DEBUG: Enable debug mode
FERRULES_DEBUG_PATH: Set the debug output directory

2. HTTP API Server

Ferrules also provides an HTTP API server for integration into existing systems. To start the API server:

ferrules-api

By default, the server listens on 0.0.0.0:3002. For detailed API documentation, see API.md.

Resources:

Apple vision text detection:
ort : https://ort.pyke.io/

Credits

This project uses models from the yolo-doclaynet repository. We are grateful to the contributors of that project.

Name		Name	Last commit message	Last commit date
Latest commit History 114 Commits
.cargo		.cargo
.github/workflows		.github/workflows
ferrules-api		ferrules-api
ferrules-cli		ferrules-cli
ferrules-core		ferrules-core
font		font
imgs		imgs
libs		libs
models		models
python		python
scripts		scripts
.dockerignore		.dockerignore
.env		.env
.gitignore		.gitignore
API.md		API.md
Cargo.lock		Cargo.lock
Cargo.toml		Cargo.toml
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
ROADMAP.md		ROADMAP.md
dist-workspace.toml		dist-workspace.toml
docker-compose.yml		docker-compose.yml
rust-toolchain.toml		rust-toolchain.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Ferrules: Modern, fast, document parser written in 🦀

Features

Installation

macOS Installation

Linux Installation

Usage

1. Command Line Interface (CLI)

Basic Usage

Debug Mode

Available Options

2. HTTP API Server

Resources:

Credits

About

Releases 6

Packages

Languages

License

AmineDiro/ferrules

Folders and files

Latest commit

History

Repository files navigation

Ferrules: Modern, fast, document parser written in 🦀

Features

Installation

macOS Installation

Linux Installation

Usage

1. Command Line Interface (CLI)

Basic Usage

Debug Mode

Available Options

2. HTTP API Server

Resources:

Credits

About

Resources

License

Stars

Watchers

Forks

Releases 6

Packages 0

Languages

Packages