I'm converting PDFs to multipage TIFF files. I've created a workflow and have used the Batch manager, but I keep encountering issues where the file fails to convert. I can't figure out why this keeps happening and the error report gives me little to no information. The error reads: "Fax Correction: No. Rotation: No. Invalid Index." What's going on?
I'm using OmniPage Professional 18
We are using Omnipage SDK 19.20 now with OEM license(include Engine, RUNTIME, V19, OEM, Arabic, Formatted Output, Asian Engine, Thai and etc). When I tried to recognize Arabic and set output as .xml file, I found that unlike any other language, Arabic output doesn't separate each word, Basically, each line is a single word in my .xml file. I use exactly the same settings on English, Thai and Asian Languages, all can separated to correct words by space or any other ways . Is there any specific settings I should make when I run Arabic? Or is there any sample codes? I have attached my picture if someone is interested in trying it.
What are the possibilities to automatise Omnipage as much as possible. I know you can create processes but can you take it a step further and use "omnipage.exe 'commandlinearguments here' "?
what I am trying to achieve is the following:
We start for example with a newspaper article spread on two pages,
both pages are scanned separately so we have two source files. Is there an easy way to merge these two pages before or after OCR is performed? (for example: alinea starts on page 1 but lines continue on page 2 )
I'm using Docudirect as part of OmniPage Ultimate 19 for several different jobs. One specific job is to watch a scan destination folder and remove all blank pages from the .TIF files and save the output as a .TIF in the same folder with the same filename as the original.
The job works as intended, however the output .TIF files are exponentially larger in hard-drive size than the originals. One example is as folllows...
The original source .TIF file consists of 12 pages from my scanner (6 black and white text and 6 blank), totaling 730kb of hard-drive space. Once the document has been processed by Docudirect, and subsequently stripped of it's 6 blank pages, the output .TIF is 24MB in size. Be advised that the output file is only 6 pages and neither the resolution or DPI is adjusted.
Upon further analysis I've determined that this has to do with .TIF compression. My scanner creates .TIF's using the CCITT T.6 compression algorithm, while Docudirect on the other hand uses no compression algorithm when creating / saving .TIF's by default.
In conclusion, Docudirect outputs .TIF files more than 50x larger in storage space than it's source! This has been determined to be because Docudirect does not use any .TIF compression. I'd like to configure Docudirect so as it's output .TIF's are infact compressed.
Can someone please advise me on a way to configure Docudirect as to compress .TIF's upon saving?
Please find attached some screen captures of the Docudirect job I am reffering to. Thanks!
I'm getting ready to update my Windows 7 Pro to Windows 10.
1. Is Omnipage 19 compatible with Windows 10?
2. Do I need to uninstall Omnipage BEFORE upgrading to Windows 10. At present I am NOT planning on doing a clean install of Windows 10.. just the upgrade.
I have some pages which have been scanned in colour, but I think they will be much easier to read if turned into Black and White. In Omnipage, try to enhance a page. In the Image enhancement window , by using functions like "OCR Brightness", I can see a black and white image called "OCR image". I think this is what i need. Any way i can save such black and white "OCR image" into a PDF? Thanks.
Once I have installed Omnipage software, and had at least one in-program update, how do I do a backup?
There few software like Omnipage that can support MRC pdf which can significantly shrink pdf size without affecting the quality.
I often want to compress my pdfs downloaded from website. These pdfs are scanned documents and already been OCRed, that is they are already searchable. What I want to do is just re-save them using MRC technology to significant shrink their size. But, I don't want to OCR them again, it is a waste of time. However, it seems that in Omnipage, if I don't OCR them again, I can not save pdf as searchable, I can only save them as image pdf. So am I missing something? Can Omnipage directly use the already existed text layer to re-save as MRC pdf without redundant OCR?
This is part of a ebook pdf downloaded from SpringerLink
If you open it in Omnipage 19, in the document manager windows, it shows that the image resolution is 300DPI. See the screen capture http://pasteboard.co/1EGzt2OL.png
I am pretty sure this pdf is much more than 300DPI and probably 600 dpi.
Please look into this matter. I strongly doubt this is a bug. What is more, I don't understand why there is no 600DPI option in Omnipage？ I think 600DPI is so common in ebook!