Skip to content

Core: Coalesce contiguous blob reads in PuffinReader#15649

Open
hemanthboyina wants to merge 2 commits intoapache:mainfrom
hemanthboyina:puffin_coalesce_reads
Open

Core: Coalesce contiguous blob reads in PuffinReader#15649
hemanthboyina wants to merge 2 commits intoapache:mainfrom
hemanthboyina:puffin_coalesce_reads

Conversation

@hemanthboyina
Copy link
Contributor

PuffinReader.readAll() currently issues a separate seek+read for each blob. Since blobs are written back-to-back in a Puffin file, this results in unnecessary I/O overhead. This PR coalesces contiguous blobs into a single read. Blobs are sorted by offset, grouped into ranges where adjacent blobs touch, and each range is read in one I/O call.

@github-actions github-actions bot added the core label Mar 16, 2026
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant