Hi,
I White space bbox is wrong. I have even used ascender/decender to get the actual ymin and ymax.
I have attached the input and output (span chunks are marked in red outline).
FYI - This input pdf is created using ABBY OCR.
Configurations:
- Ubuntu
- Python3.6
- PyMuPDF 1.18.6
Thanks

cheesecake-20191221_003.pdf