Skip to content

White space BBOX is Wrong #823

@mailsnathaniel

Description

@mailsnathaniel

Hi,

I White space bbox is wrong. I have even used ascender/decender to get the actual ymin and ymax.

I have attached the input and output (span chunks are marked in red outline).

FYI - This input pdf is created using ABBY OCR.

Configurations:

  • Ubuntu
  • Python3.6
  • PyMuPDF 1.18.6

Thanks
spaces_bbox
cheesecake-20191221_003.pdf

Metadata

Metadata

Assignees

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions