Looking at the OCR API page, it says the PRO PDF plan allows uploads “100MB+”. But when I try to upload anything over 95MB (100000000 bytes), I get an error. How can we process 100MB+ files?
Hello, thanks for using our OCR API
. There is no hard file size limit on the PRO PDF plan. However, there is a 10 minute connection timeout limit for the OCR API call. So depending on the Internet speed, the connection might time out before the file is fully uploaded. Here are our recommendations:
- If possible, use the URL method instead of uploading the (large) file. This makes the file upload almost instant. In addition, it also makes the internal OCR conversion a bit faster. Because in this case, the URL is used internally, not the large PDF file.
- Consider splitting the large file into smaller chunks. There are some good and free PDF file splitting command line tools available. If you need a recommendation, let me know.
When I try to submit the URL of a large PDF, I get the following response after a few seconds:
{
"OCRExitCode": 3,
"IsErroredOnProcessing": true,
"ErrorMessage": [
"E214:File failed validation. File size exceeds the maximum size limit. Maximum size limit 100000 KB"
],
"ErrorDetails": "",
"ProcessingTimeInMilliseconds": "343"
}
Oh, I stand corrected. Development advised me that there is a 100MB file size limit. We added that for security reasons and to avoid API overload.
If you like, we can increase this limit for your account, at no extra cost. Just contact PRO tech support and mention this forum post. However, I still do not recommend to send files that are significantly larger than 100 MB, as you will run in the API timeout issue that I mentioned above.
Ok, thanks. I contacted support twice but haven’t heard back yet.