Looking for someone to build Python software will take old scanned pdf book with mix of multi languages text and images and run it through google vision OCR to digitalize all the text.
* Software must be able to perform OCR on a whole book or just few pages for testing
*Software need to be a multi-threaded , have the option to bulk OCR all files in a folder.
*Software must be able to handle possible interruptions and able to continue the work from where it stopped. *Software must be able to keep the structure of the book like paragraphs and images locations for example. *Software must be able to reduce the size of the book since it's digitizing all the text and not just adding a text layer.
* And Finally would prefer if there is a user Web GUI but would be ok to just execute from command line on windows machine.
Hi, I would create a proper Web GUI in Django with all of the Utilities needed for a proper GUI for your Google Cloud Vision OCR on PDF Files. The Files in PDF Format could be automatically fetched from an email folde Daha Fazla
Bu iş için 11 freelancer ortalamada $185 teklif veriyor
Hi - This job looks like a good fit with my skill set and experience. I hold Bachelor of Computer Science and Master of Data Science Please see my profile and reviews for references.
I have experience to convert PDFs files to text with Java OCR library... With my web developer skills I can even develop a good web GUI for a better user interation.