You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Vertical text in table columns isn't handled correctly anymore, it used to work before the recent switch to use the "word extractor" methods.
How to reproduce the bug
Call extract() on any table that has vertical text
Example file: the two cells that have vertical text "Text" and "Numbers" are returned in reverse order as "txeT" and "srebmuN" table_examples_vertical_text.pdf
PyMuPDF version
1.23.21
Operating system
Windows
Python version
3.11
The text was updated successfully, but these errors were encountered:
The fix includes support for all rotations by multiples of 90°.
You may be aware that line breaks in a cell are preserved. This makes some sense as a last resort to identify additional table rows, and in cases with complex cell content ... but only for rotation 0 (horizontal) text. In other cases (rotations by 90°, 180°, 270°) I am replacing line breaks by spaces.
Forgot to mention, that the rotation of all of the cell content is determined by its first character / word. IAW, there cannot be different rotations of content in one single cell. But the following perfectly works:
Description of the bug
Vertical text in table columns isn't handled correctly anymore, it used to work before the recent switch to use the "word extractor" methods.
How to reproduce the bug
Call extract() on any table that has vertical text
Example file: the two cells that have vertical text "Text" and "Numbers" are returned in reverse order as "txeT" and "srebmuN"
table_examples_vertical_text.pdf
PyMuPDF version
1.23.21
Operating system
Windows
Python version
3.11
The text was updated successfully, but these errors were encountered: