A streamlined document chat application powered by DeepSeek-R1 that allows users to upload documents and engage in natural conversations about their content. This application supports PDF, DOCX, and image files, making it versatile for various document types.
- Document Processing: Upload and process PDF, DOCX, and image files
- Text Extraction: Automatic text extraction from various file formats
- Interactive Chat: Natural conversation interface for document queries
- Real-time Processing: Live document analysis and response generation
- Clean UI: Simple and intuitive Streamlit interface
- File Support: Handles multiple document formats with size limit checks
streamlit
together
pdfplumber
python-docx
pytesseract
Pillow
pdf2image
- Clone the repository:
git clone https://github.com/fahadyaseen001/DeepSeek-AI-Document-Chat-Assistant.git
cd DeepSeek-AI-Document-Chat-Assistant
- Install required dependencies:
pip install -r requirements.txt
- Install Tesseract OCR (required for image processing):
- For Ubuntu/Debian:
sudo apt-get install tesseract-ocr
- For macOS:
brew install tesseract
- For Windows: Download and install from: https://github.com/UB-Mannheim/tesseract/wiki
- Add required images:
- Place
deepseek-color.png
in the root directory
- Get your API key:
- Sign up at Together.ai
- Generate an API key from your dashboard
- Run the application:
streamlit run app.py
- In your web browser:
- Enter your Together API key in the sidebar
- Upload a document (PDF, DOCX, or image)
- Wait for processing completion
- Start asking questions about your document
- PDF documents (
.pdf
) - Word documents (
.docx
) - Images (
.png
,.jpg
,.jpeg
)
- Maximum file size: 200MB
- Maximum text processing: 6000 characters per query
The application is built with the following components:
- Frontend: Streamlit web interface
- Document Processing: pdfplumber, python-docx, pytesseract
- AI Model: DeepSeek-R1 via Together API
- Session Management: Streamlit session state
To contribute to the project:
- Fork the repository
- Create your feature branch (
git checkout -b feature/AmazingFeature
) - Commit your changes (
git commit -m 'Add some AmazingFeature'
) - Push to the branch (
git push origin feature/AmazingFeature
) - Open a Pull Request
DeepSeek AI Document Chat Assistant: An intelligent document analysis tool powered by DeepSeek-R1. Upload documents and chat naturally about their contents. Supports PDFs, Word documents, and images with an intuitive interface built using Streamlit.
Key features:
• Multi-format document support
• Natural language document queries
• Real-time processing
• Clean, user-friendly interface
• Powered by DeepSeek-R1 AI
This project is licensed under the MIT License - see the LICENSE file for details.
- DeepSeek-R1 model by DeepSeek
- Together.ai for API access
- Streamlit for the web framework
- Various open-source libraries used in the project