Won´t scan tickets correctly

Fer_Torres · May 17, 2025, 8:45pm

Hi, I´m currently testing the OCR with pictures of tickets but it doesn´t scan the products correctly, it scans other parts of the ticket, does anyone have advice on how to filter the products? thanks in advance!

ocr-api-team · May 20, 2025, 9:07am

Hi, can you please post a sample image for us to test?

Fer_Torres · May 21, 2025, 9:58pm

Fer_Torres · May 21, 2025, 9:58pm

I uploaded 3 tickets, if you need more I can send them, thanks!

ocr-api-team · May 22, 2025, 12:32pm

I did a test with the first image with OCR Engine2, and the product text looks good to me - but the quantity is missing. Single digit OCR is tricky.

→ Just to clarify: Is this (the missing quantity) the issue you are seeing?

Fer_Torres · May 22, 2025, 10:56pm

nope not exactly, i wanted to extract the data in a json format if possible to only show the products, the quantity and price but it shows everything contained in the ticket; also, some products appear as “empty”, is there something I can do to fix these two issues?

ocr-api-team · May 26, 2025, 8:36am

Our OCR API always returns all text in a document/image. If you need only specific data, you can then post-process the OCR result. You could, for example, use regular expressions for this, or feed it into one of the LLM like ChatGPT, Gemini or Mistral.

ocr-api-team · May 26, 2025, 8:38am

For this issue, can you please post an example (overlay) screenshot that shows the missing data? There might be a fix for this, once we see the issue.