Regarding some issues with reading mathematical symbols. #3681
-
Beta Was this translation helpful? Give feedback.
Replies: 3 comments 5 replies
-
This is no issue / bug but a Discussions item -> transferring. |
Beta Was this translation helpful? Give feedback.
-
I extracted the font file from the PDF document (see the attached package) and matched the corresponding font file to the traversed font attribute for text insertion. However, some symbols are still missing. I noticed through the log that the text contains content like text="\x10", which is missing when inserted. The font is "CMEX10". So, how should I handle this better? |
Beta Was this translation helpful? Give feedback.
-
Thank you for the suggestion. Currently, I'm resolving it by matching the existing text information to the corresponding Unicode or glyph name information in the font file. @JorjMcKie |
Beta Was this translation helpful? Give feedback.
I cannot give you more advice here. If text extraction delivers a
\x10
, then that is all that can ever be known. If the font in question has proper documentation, then maybe there is some mapping information as to which Unicode name\x10
is pointing to, which you could use subsequently.Otherwise ... bad luck!
I think I mentioned that fonts in general may not be error-free nor complete in terms of these back-references.