I am using OmniPage Pro 16. I am trying to create a batch job which will watch a folder for PDF files. When files are placed in the directory, I want OmniPage to open the file, OCR it, and save the OCR'ed PDF file in my output directory, with the same name as the input PDF file.
In other words, let's say my input directory is c:\input\, and my output directory is c:\output\. If I copy 3 pdf files (file1.pdf, file2.pdf, file3.pdf) into c:\input at the same time, how do I get OmniPage to OCR these three files, and then save the results as c:\output\file1.pdf, c:\output\file2.pdf, etc?
I can't have each file broken up into separate pages, and I can't have the files all put into the same page. Is this possible?
Hi, I am trying to set up a DocuDirect job in OmniPage that will batch OCR a very large folder of documents (around 150k files) and I'm having some trouble.
I am able to get the job running, but it appears to attempt to import every single file into the GUI, which quickly consumes all of the resources on the machine. This does not seem to stop it from working however, and after leaving it running over the weekend, it does appear to have OCR'd many of the documents when I spot check them. However, the GUI had crashed with a windows error when I came back to it after the weekend. It says it was still processing on the "Load Files" step (see screenshot attached) though it had indeed clearly replaced several of the input PDFs with searchable PDFs.
So, unfortunately I have no way of knowing how many and which specific files were updated by this batch process. Is there possibly a log file somewhere generated for the job execution with that information? Also, is there a better way to process 150,000 files without importing them into the DocuDirect GUI and wasting the resources on the machine, preferably with some way of telling % of completion and which files were or were not yet converted?
Thanks in advance.
I'm using OP18. When I recognize a page and choose "copy to clipboard", it always wants to use flowing page. For my purposes, I prefer plain text. Is there a way to change the default format to plain text?
I want to configure Omnipage to read/scan the PDF files better, so it doesnt go crazy everytime theres a handwritten document it handles, or signatures in a computer-written document.
I have created a job within DirectDoc, and tried to configure it the best way possible, but still it creates weird symbols over normal text, and doesnt understand when something is handwritten, and isnt supposed to be converted into editable-text.
My need is this, a service running 24/7 without need for prompt-input, converting PDF's it continuesly scans from a network folder. As of now it works 80%, but some documents it really ruins.
Im guessing I need to fine-adjust how the intelligence of the engine works. But how?
This is my current configuration:
Any help would be greatly appreciated
I have had OmiPage 19.1 for at least 8 months. When I use it to OCR scanned documents or screenshots of old images it rarely ID's the font. Very frustrating!
I am not that proficient using it but might someone have a workflow other than in OmniPage's Index? It reall doesn't work for me.
I take sreenshots of very old militray files and try to load them into OCR so I can consolodate text into 1 document but OCR fails to recognize what the font is.
Any help would be appreciated!
I'm looking to OCR on images that use the HK supplementary character set, and Im not able to get it to work. Is the character set supported, or is something else wrong?
We used Omnipage to OCR and then edit some archival documents (to replace hyphens for example to aid in searches). We want to save the doc as a searchable PDF using the original document image, but have the search reflect our edits. The document has been saved in a number of formats and we cannot find a way to retain the original image with the underlying edits intact.
Any help is greatly appreciated. Thanks, Ron
Hello fellow Omnipage users
I've got Omnipage Ultimate 19 installed on a Windows 7 laptop.
After installation (some time back and I don't remember exactly when I did it) I used it maybe once or twice for testing it worked, and hardly ever thereafter.
Recently, I bought a new scanner with auto paper feed from Canon that had another version of OmniPage, Paperport, ePDFxyz whatever coming with it. I installed some of that software, but IIRC specifically NOT the bundled OmniPage Version, because it was no Ultimate version. I have hardly used that scanner either, but checked the installed software, which worked, too.
Now, today (no: yesterday!), I want to finally put these tools to some real use - and wanted to start by comparing the bundled Canon Software, PaperPort, ePDF..., and OmniPage Ultimate w/ respect to workflow and results.
But: When I start Omnipage Ultimate 19 now, only the splash screen appears, stays forever - and CPU load goes way up - and that's it.
It does not change when I start any of the programs from the OmniPage program directory directly. I've also tried to "repair" the software from the original CD, and I've uninstalled it and reinstalled it. After installation, my serial number was required (again, I had entered that when I installed it the first time), and that was accepted.
But re. startup: No change. Just the splash screen, and a lot of CPU usage.I have to kill the OmniPage19.exe in the task manager, to get CPU load back to normal. When I try to start it multiple times, I get multiple instances of that software in the task manager that consume CPU time, but nothing useful.
I've seen an older post where another user had a similar problem with Omnipage 12 and Windows XP, and there was a patch made available - but I have not found any patches or updates for Omnipage 19.
Now I'm wondering: does anybody else experience the same, or does anybody have any helpful ideas?
Thanks in advance and kind regards,
N.B.: To Nuance management:
In the meantime I've spent multiple hours on attempting to resolve that problem. From previous experience with Nuance products, I've the strong suspicion that it is actually caused by some voluntary-software-complication to-get-some-even-more-modern-looking-or-distinguished-new-GUI-or-advertising-or-license-management rather than the core functionality of the OCR program.
Actually, there are 5 OmniPage Ultimate 19 Boxes around here. These boxes were all bought at the recommended price, some are still unopened, and I was going to recommend their use to other people. I've actually used, supported, and resold OmniPage already some 20 years back, and used Recognita Plus before. In recent years, I've also seen Dragon from about version 10 or 11 thru 15, including the latest MPE over the years. And let me say: *Not one* program from the Nuance portfolio has ever appeared completely stable, or focused simply on the task it should complete for the customer, to me.
But quite constantly, there were (from a users viewpoint) perfectly unnecessary problems with installers, recognition of serial numbers, startup, etc. - where solutions had to come from third parties more than once. [...]
I'm scanning in hard copy text files, code, that has tabs and CR\LFs that seem to be ignored. What settings should I be enabling to retain the indented style of code. Also, how to retain '|' from being '1"?
I'm getting ready to update my Windows 7 Pro to Windows 10.
1. Is Omnipage 19 compatible with Windows 10?
2. Do I need to uninstall Omnipage BEFORE upgrading to Windows 10. At present I am NOT planning on doing a clean install of Windows 10.. just the upgrade.