You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
On the second page, highlight the word "Mitarbeiterinnen" by double-clicking it
What is the expected behavior?
The word should be recognized as a single word and highlighted as such.
What went wrong?
Instead, it's recognized as 3 words: "Mit", "arbei", "terinnen"
This happens for many words in this PDF.
Is there anything that can be changed in how I use pdfjs-dist to make it recognize this as a single word? Just looking at the PDF, I can't tell what would cause this behavior.
The text was updated successfully, but these errors were encountered:
and there are 2 extra spaces (the ( )Tj).
Since there are no Td after the Tj then the spaces are behind the char following them.
Anyway those spaces are in some marked content sections (BDC/EMC) and I didn't find anything in specs on how to deal with that case.
https://boersengefluester.de/wp-content/uploads/assets/annuals/2019/578560.pdf
Configuration:
Steps to reproduce the problem:
What is the expected behavior?
The word should be recognized as a single word and highlighted as such.
What went wrong?

Instead, it's recognized as 3 words: "Mit", "arbei", "terinnen"
This happens for many words in this PDF.
Is there anything that can be changed in how I use pdfjs-dist to make it recognize this as a single word? Just looking at the PDF, I can't tell what would cause this behavior.
The text was updated successfully, but these errors were encountered: