RagPull: Full-Stack RAG WebApp

An end-to-end Retrieval-Augmented Generation (RAG) platform built with production-ready document ingestion, semantic search, and AI-driven chat, and a scalable, async backend processing pipeline.

🚀 Key Features

Asynchronous Document Ingestion: A robust backend worker queue built on PostgreSQL orchestrates the extraction, chunking, and embedding of various document types (PDF, CSV, TXT, etc.).
Local AI Integration: Leverages local LLMs via Ollama (qwen2.5:14b-instruct for document structuring, mxbai-embed-large for embeddings) for entirely private, offline processing. (can be switched to online LLMS from env variables - gemini and opencode zen are available)
RAG Chat Interface: An intuitive chat UI where users can query their uploaded knowledge base. The AI responses include precise citations, revealing exactly which document chunks and vector match percentages informed the answer.
Production-Ready Architecture: Designed with a decoupled frontend (React/Vite) and backend (Hono API), complete with authentication (Firebase) and a relational database (PostgreSQL/Drizzle), ensuring a smooth path from local development to cloud deployment.

🛠️ Tech Stack

Frontend: React, TypeScript, Vite, Tailwind CSS, ShadCN UI
Backend: Node.js, Hono API, background worker processes
Database & ORM: PostgreSQL, Drizzle ORM (handling both application data and vector storage/queues)
AI & Embeddings: Ollama (Local LLMs), pluggable provider architecture, Gemini, Qwen, MinMax-2.5
Authentication: Firebase Auth (with local emulator support)
Deployment: Cloudflare Pages & Workers ready

⚡ Architecture & Processing Flow

The core of RagPull is its asynchronous ingestion pipeline and subsequent semantic search capabilities. Here is how data flows through the system:

flowchart TD
    %% Styling
    classDef user fill:#e2e8f0,stroke:#64748b,color:#0f172a
    classDef frontend fill:#bae6fd,stroke:#3b82f6,color:#0f172a
    classDef api fill:#fef08a,stroke:#0ea5e9,color:#0f172a
    classDef worker fill:#fed7aa,stroke:#eab308,color:#0f172a
    classDef db fill:#bbf7d0,stroke:#22c55e,color:#0f172a
    classDef ai fill:#fbcfe8,stroke:#ec4899,color:#0f172a

    %% Nodes
    User(("User")):::user
    
    subgraph Frontend [React UI]
        UploadUI["Upload Interface"]:::frontend
        ChatUI["RAG Chat Interface"]:::frontend
    end

    subgraph Backend [Hono API]
        UploadRoute["POST /uploads"]:::api
        ChatRoute["POST /chat"]:::api
    end

    subgraph QueueWorker [Async Processing]
        Worker["Background Worker"]:::worker
        Structurer["Document Structurer<br/>(Qwen2.5)"]:::ai
        Embedder["Embedding Model<br/>(mxbai-embed)"]:::ai
    end

    subgraph Database [PostgreSQL]
        AppDB[("App Data<br/>(Users, Jobs)")]:::db
        VectorDB[("Vector Store<br/>(pgvector)")]:::db
    end

    %% Upload Flow
    User -->|Uploads Document| UploadUI
    UploadUI -->|Multipart File| UploadRoute
    UploadRoute -->|Enqueues Job| AppDB
    
    %% Ingestion Flow
    AppDB -->|Claims Job| Worker
    Worker -->|Extracts Text| Structurer
    Structurer -->|Structured Chunks| Worker
    Worker -->|Generates Vectors| Embedder
    Embedder -->|Embeddings| Worker
    Worker -->|Persists Chunks & Vectors| VectorDB

    %% Chat Flow
    User -->|Asks Question| ChatUI
    ChatUI -->|Query| ChatRoute
    ChatRoute -->|Embeds Query| Embedder
    Embedder -->|Query Vector| ChatRoute
    ChatRoute -->|Semantic Search| VectorDB
    VectorDB -->|Top K Chunks| ChatRoute
    ChatRoute -->|Generates Answer| AIModel["LLM Generator"]:::ai
    AIModel -->|Response + Citations| ChatUI

The Dashboard

The central hub for navigating the application.

Data Ingestion

The interface for uploading documents, complete with real-time, granular progress tracking as the background worker processes the queue.

Semantic Search & Chat

The conversational interface for querying the knowledge base. Notice the "Sources" expansion, which provides transparency into the RAG process by showing the exact matched chunks.

💻 Local Development

Everything needed to run RagPull is containerized or embedded for a seamless local developer experience. The whole project can be run with one command: docker compose up --build

Install dependencies:
```
pnpm install
```
Start all services: This single command spins up the frontend, backend API, async worker, embedded PostgreSQL database, and local Firebase Auth emulator.
```
pnpm run dev
```
Local AI Setup: Ensure Ollama is installed and the required models are pulled:
```
ollama pull qwen2.5:14b-instruct
ollama pull mxbai-embed-large
```

For detailed setup, port management, and production deployment guides, refer to the docs/ directory.

Name		Name	Last commit message	Last commit date
Latest commit History 28 Commits
database-server		database-server
docker		docker
docs		docs
scripts		scripts
server		server
ui		ui
.cursorrules		.cursorrules
.dockerignore		.dockerignore
.gitignore		.gitignore
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
docker-compose.yml		docker-compose.yml
firebase.json		firebase.json
package.json		package.json
pnpm-lock.yaml		pnpm-lock.yaml
pnpm-workspace.yaml		pnpm-workspace.yaml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

RagPull: Full-Stack RAG WebApp

🚀 Key Features

🛠️ Tech Stack

⚡ Architecture & Processing Flow

The Dashboard

Data Ingestion

Semantic Search & Chat

💻 Local Development

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

RagPull: Full-Stack RAG WebApp

🚀 Key Features

🛠️ Tech Stack

⚡ Architecture & Processing Flow

The Dashboard

Data Ingestion

Semantic Search & Chat

💻 Local Development

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages