Skip to content

Feature/pdf ingestion jpdfium#6525

Open
EthanHealy01 wants to merge 8 commits into
mainfrom
feature/pdf-ingestion-jpdfium
Open

Feature/pdf ingestion jpdfium#6525
EthanHealy01 wants to merge 8 commits into
mainfrom
feature/pdf-ingestion-jpdfium

Conversation

@EthanHealy01
Copy link
Copy Markdown
Contributor

PDF Ingestion / Convert to markdown agent, also replaced the current convert to markdown API in java

TextLine-driven converter (tables: bordered/borderless, multi-table, uneven
rows, cross-page stitching, wrapped cells; multi-signal heading detection;
image metadata; two-column handling). Wires the orchestrator convert_markdown
path to the deterministic Java endpoint. Synthetic/owned test fixtures only.
@dosubot dosubot Bot added size:XXL This PR changes 1000+ lines ignoring generated files. enhancement New feature or request labels Jun 3, 2026
@stirlingbot stirlingbot Bot added Documentation Improvements or additions to documentation Java Pull requests that update Java code API API-related issues or pull requests Test Testing-related issues or pull requests engine and removed enhancement New feature or request labels Jun 3, 2026
Comment thread app/common/src/main/java/stirling/software/common/pdf/PdfMarkdownConverter.java Outdated
@github-actions
Copy link
Copy Markdown
Contributor

github-actions Bot commented Jun 3, 2026

Engine Check Failed

There are issues with your Python code that will need to be fixed before they can be merged in.

Run task engine:fix to auto-fix what can be fixed automatically, then run task engine:check to see what still needs fixing manually.

@stirlingbot
Copy link
Copy Markdown
Contributor

stirlingbot Bot commented Jun 3, 2026

🚀 V2 Auto-Deployment Complete!

Your V2 PR with embedded architecture has been deployed!

🔗 Direct Test URL (non-SSL) http://54.175.155.236:6525

🔐 Secure HTTPS URL: https://6525.ssl.stirlingpdf.cloud

This deployment will be automatically cleaned up when the PR is closed.

🔄 Auto-deployed for approved V2 contributors.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

API API-related issues or pull requests Documentation Improvements or additions to documentation engine Java Pull requests that update Java code size:XXL This PR changes 1000+ lines ignoring generated files. Test Testing-related issues or pull requests

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant