-
Notifications
You must be signed in to change notification settings - Fork 1
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Handling large IFD data arrays #70
Comments
Fetching 16kb is just a default. The user can override how much to prefetch. And the user can add a "block cache middleware" or something like that to speed up requests that extend beyond the prefetch range. |
The main problem is that array size is a-priori unknown and a good estimate is hard to make, unless the application knows exactly which types of COGs are going to be handled I've thought about this a bit and think that what would allow the user to do such a thing is to be clear about where we are in the file (fetching metadata or data),
but that underestimates by a bit, resulting in another fetch for less bytes, which conveniently also clears the data structure :) - I've tested with some COGs, undershoot in bytes, percentage of first count:
|
Ok it sounds like your desire is "fetch multiple overviews for a tile concurrently"? Are you speaking of data or IFD metadata? I don't think this is anything that async-tiff needs to concern itself at the core level. The main things to consider when deciding whether to merge requests or not are:
None of those need to know whether the range we're currently fetching is a tile at which zoom level. |
Instead of doing this, just make separate fetches for each tile (pseudocode)
|
Ok, sorry, it's specifically about tag data, which is IFD metadata. like so:
I would like to add:
That these arrays can vary in size so much is my reason for #76 and #74 cast showing http example |
EDIT: Since deferring fetching of
tile_info=tile_offsets+tile_byte_counts
was quite an intrusive change, and the remainingtile_info
s can be reasonably estimated from the firsttile_offsets
, this issue changed a bit in my question and is now mainly about #76TileByteCounts and TileOffsets arrays are special-cased in the COG spec, and don't generally fit within the first 16kB.
Thus, using
coalesce_ranges
and waiting with fetching that data until all tags are read could be beneficial (reduce the number of requests). I've updated the amazing diagram to include COG file structure:The text was updated successfully, but these errors were encountered: