I am looking for a way to obtain the position of individual characters in the OCR output. I can get the position of each word, but not of the individual character. Is there a way to do this?
I considered looking at the strings and then dividing them, either into equal parts or into parts based on the “width-profile” of the character.
However, for Arabic that approach will not work as the language contains text “extensions” (tatwheel) which may be placed between characters to create a more aesthetically pleasing text.
Thanks,
Soeren