page.get_text('blocks') output two piece of very similar text with different bbox

### Description of the bug

When I use `page.get_text('blocks')` ,  I get the very similar text with different bbox.
The output of Page 5 (start from 1) as follows:
![image](https://github.com/user-attachments/assets/73cf0062-4f4a-4ce8-b939-2cc86c6289c5)
And the associated page as follows:
![image](https://github.com/user-attachments/assets/8c561ba8-fbe7-4431-84ef-319fe346a9d6)

The raw pdf is
[00b3ad2ad0af97ec4a85274510343e04.pdf](https://github.com/user-attachments/files/17644118/00b3ad2ad0af97ec4a85274510343e04.pdf)
I think block 12 is the redundant one.

What's more, my python version is actually 3.8.19 but I select 3.9 because the available choice is start from 3.9

### How to reproduce the bug

```
import fitz

with open("./00b3ad2ad0af97ec4a85274510343e04.pdf", "rb") as f:
    pdf_bytes = f.read()
document = fitz.open(stream=pdf_bytes, filetype="pdf")

for i in range(document.page_count):
    if i==4:
        page = document.load_page(i)
        blocks = page.get_text("blocks")
        for i, block in enumerate(blocks):
            print(f"block {i}:", block)
            print('\n')
```

### PyMuPDF version

1.24.6

### Operating system

Linux

### Python version

3.9

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

page.get_text('blocks') output two piece of very similar text with different bbox #4026

Description of the bug

How to reproduce the bug

PyMuPDF version

Operating system

Python version

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

page.get_text('blocks') output two piece of very similar text with different bbox #4026

Description

Description of the bug

How to reproduce the bug

PyMuPDF version

Operating system

Python version

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions