I truly appreciate the excellent work you’ve done with your website ocr.space it’s a powerful and user-friendly platform. However, I noticed that Urdu support is not available. Considering Urdu is spoken by millions of people worldwide and is also the national language of Pakistan, I believe adding it would greatly enhance accessibility and usability for a wider audience.
To support this, I have a large Urdu dataset (40GB+) that I’d be happy to share with you to help train your model and enable Urdu OCR support. plz kindly private message me so i can share my terabox login and password for easy download all data and will be happy to test out it once its enables on website
Thanks for the offer. Support for Urdu OCR is on our “to-do” list for future ocr languages updates - along with quite a few other ocr languages. I will update this forum post once it is available.
Here's a simple Urdu example sentence: "میرا نام على ہے" (Mera naam Ali hai), which means "My name is Ali", a fundamental phrase for introductions, showing the Subject-Object-Verb (SOV) structure common in Urdu.
Here are a few more examples:
• میں ثھیک بون. (Main theek hoon.) - "I am fine."
• آپ کہان سے ہیں؟ (Aap kahan se hain?) - "Where are you from?"
• یہ بہت اچا ہے. (Yeh bohat acha hai.) - "This is very good."