Skip to content

Problem when using the ‘hdr_info’ property of ‘to_markdown()’. #198

Answered by JorjMcKie
Fianax asked this question in Q&A
Discussion options

You must be logged in to vote

I suggest you make a list of the spans that actually are created to see what the situation is:

blocks = page.get_text("dict",flags=pymupdf.TEXTFLAGS_TEXT)["blocks"]
spans = [s for b in blocks for l in b["lines"] for s in l["spans"]]
for s in spans:
    print(f"{s['text']=}")

Replies: 1 comment 1 reply

Comment options

You must be logged in to vote
1 reply
@Fianax
Comment options

Answer selected by Fianax
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
2 participants
Converted from issue

This discussion was converted from issue #197 on November 29, 2024 09:14.