Online OCR and compression?

I am attempting to have an image analyzed by the free OCR API endpoints, but it’s not extracting all of the data from the image. I tried to use the Online OCR Analyzer, and got the same result as my API. I want to make sure if I pay for the Pro analyzer, that it’s going to give me the result I’m hoping for. Any way to test that?

Adam

Hello, can you post an example image? Then we can hopefully help you to improve the OCR result.

Sure! This is a paint label that I am working with. I want to be able to extract out all of the information on the label for storing in a database. FWIW - I’m using Laravel, PHP, and a MySQL DB.

Hey there, any feedback for me? I realized I didn’t “reply” to you. Here is another attachment:

Just checking in. It’s been a minute. Hoping you can help me out.

Hello and Happy New Year :slight_smile:

I confirmed that OCR Engine 1 and even 2 miss for example the single digits like “-”, “1” and “3”.

Solution: In only about 1-2 weeks we will release a new OCR Engine that brings much improved OCR results. I tested your image with your internal beta version, and the result is as follows (see below).

For me, it seems the OCR result is error free, or? If you have a few other images that you want me to test, just post them here.

SHERWIN-WILLIAMS 701288 01/25/24
616-667-3310 Order# 0254351

INTERIOR ARCHITECTURAL
DURATION HOME LATEX
FLAT IFC 8112NP

6821 POTENTIALLY PURPLE SHER-COLOR FORMULA

CCE*COLORANT	OZ	32	64	128
B1-Black	-	1	1	-
L1-Blue	-	3	1	-
R3-Magenta	-	15	1	1
ONE GALLON EXTRA WHITE
A95W01351 651049579

Non Returnable Tinted Color

CAUTION: To assure consistent color, always order enough paint to complete the job and intermix all containers of the same color before application. Mixed colors may vary slightly from color strip or color chip.

0254351-001

The new OCR engine will also have the option to return the text in markdown format, so for example the table structure is preserved:

CCE*COLORANT OZ 32 64 128
B1-Black - 1 1 -
L1-Blue - 3 1 -
R3-Magenta - 15 1 1

ONE GALLON EXTRA WHITE
A95W01351 651049579

The new OCR Engine3 is now available for beta testing :slightly_smiling_face:

1 Like