Hi, I was wondering if there was any way to change the size of the text box of the document once the OCR is done? (See the pictures includes.)
Before the OCR, when I put the text zone, I put it wide around the text (see Before). When I apply OCR, the box resize itself to stick very closely to the text (see After). The problem it creates is this one (see Result) : the text is no longer capable of *** in the box size, so I could miss up to 4 sentences at the bottom of the page. So I have to manually change the font types and size so that the text can fit in.
Is there any way to provent that from happening?
Please, note that I have many hundreds of documents (so thousands of pages to go through) and there is no uniformity between the documents.
Thanks everybody for any idea
I left my PC on overnight during a Daylight Saving event and Docudirect Folder Watch reprocessed all of the pdf image files thinking they had a time stamp change - needless to say it got out of control and created 1000's of new OCR'd files over the weekend, and over wrote some existing files in the output folder.
Is there an option to ignore time changes due to Daylight Saving or a patch for Omnipage Docudirect Folder watch to not allow this time change bug ( happenened Sunday 3am in the morning, and Microsoft system clock went back to 2am )
Omnipage Ultimate Version 19.0 running on old but reliable XP ( backend OCR box )
I have Omnipage Ultimate running DocuDirect on a server. I have a single job running on it that watches a folder, and ocr's any pdf files placed in that folder. It outputs the newly ocr'd pdf into a new folder and deletes the original. We've had this running for a few years now. Someone just brought to my attention that the files it creates have a bookmark created within them for every single page. This is problematic for our users. I have been through the options and cannot find a way to turn that feature off or change it. Any ideas?
I just started using Omnipage 18, and I have been trying to figure out what the "PDF Overlay Matching" feature is and how to use it.
One explanation I found from my Google Searching said:
PDF Overlay Matching: Checks OCR output against underlying text for dramatically better conversion results - provides 100% character recognition accuracy for the majority of commonly used PDF formats.
This almost sounds like I should be able to define multiple zone templates, run Omnipage, and it could check the OCR output against each of these zone templates and match them to the correct zone template for a more accurate conversion.
But I think it must mean something else because I have not found a way to do that.
Could anyone explain what "PDF Overlay Matching" actually means, and in what context it is used?
Is there any where I can find a log of files OmniPage Docudirect has successfully processed?
Additionally, from a technical standpoint how does Docudirect know which files have been processed and which haven't?
I have a version 18 of omnipage that i bought almost 2 years ago. Now that I have a new computer, I would like to transfer it.
2 questions following:
1) is it compatible with windows 10?
2) how do I transfer it? I have downloaded it online.
We are having problems with inverse text over images causing docudirect folder watch to get stuck. ( white letters on dark or colour background images - as seen on many telephone bills. )
OmniPage does not get stuck on these documents when loaded manually, it appears to be a Docudirect folder watch problem - but I have checked the settings and inverse text is tick default for OCR accuracy setting.
We are on version 19.0 Prod ID OP-1303-020-13222.1111. Not sure if we need to apply a patch, upgrade or change setting on watch folders/docudirect. The Omnipage patch link seems to be broken, so i cannot check for any bug fixes, and my support has run out
Attached is a sample problem image causing the Omnipage agent to get stuck
I am running Omnipage Pro 17. Scanning a document with pictures works fine. I mark the pictures as type graphics and they appear in the text editor correctly. Clicking "Copy" correctly loads the picture into the clipboard. But if "Copy" is clicked the second time, nothing happens.
That is, the Paste function is disabled indicating that the image did not get copied to the clipboard.
I have some pdf files which require ocr, which, once done and then save as file, the resulting pdf file is really bad quality, text blurry and images not clear. Has anyone else had this problem?
Is there any way to ocr the pdf and not utilise "save as file" , ie open the pdf as a pdf, ocr, then simply save the file?