Use secret password to decrypt PDF files during batch processing. Write log file information and statistics during recursive scan to file. Pattern used to match filenames during recursive scans. Skips files from OCRKit, with text layer or vector graphics. Scan directory recursively for new files. Since OCRKit version 16.9 additional command line options are supported: Options and is also more robust and cross-platform than AppleSCript. This greatly simplifies the use of OCRKit in batch processing, allows to set more Since OCRKit version 2.5 direct command line scripting is supported. open "Users:admin:Desktop:orderform.pdf" open POSIX path of "/Users/Admin/Desktop/orderform.pdf" end tell Command line Tell application "OCRKit" set resolution to 240 set rotation to 180 set destination app to "/Application/Some.app" - the legacy of AppleScript POSIX path handling. and simply tell OCRKit to open and thus process is via AppleScript: For example process incoming files, via shared folder, from MFP copy machine, etc. You can also script OCRKit to integrate it into your specific workflow. Unless you use Automatic rotation of the Pro version the text mustĪlso be in the right, readable orientation. Using more than 300 dpiĭoes not necessarily improve results, but mainly increases the resulting PDFįiles. Using 300 dpi for all regular daily office material. Should be between 200 and 300 dpi (dots / pixel per inch). The resolution for scans of regular office paperwork Readable for a human, you can imagine it is even harder to identify the textįor a computer program. This is usually the result of poor image quality. Processing settings such as the output format (PDF, RTF, HTML or plain/text) You control the language used for the text recognition, as well as all other With the digital imprinter of the Pro version you can add watermarks to your documents.Ĭommonly used marks are CONFIDENTIAL, PRELIMINARY, COPY, or similar terms that suit your workflow, in any font, shape, or rotation. Preview for visual control, or database and cloud application to archive the final document. In the Finalization tab you can choose whether to remove the original document to the system trash after processing,Īnd whether to notify another application about the new file. In anyĬase you can choose to have OCRKit sort the files to retain a reliably order while merging all files in one batch. The files in the multi-selection, at times macOS's Finder unfortunately sorts them in an arbitrary order. While this usually is the order you selected OCRKit is using the order as received by the operating system. for converting a bunch of JPEG or TIFFįiles into one highly compressed and searchable PDF. Select to merge all those files in one batch into a single output file. When you select or drop multiple files in OCRKit for processing, they are processed one by one. Option further to the left side you receive smaller files by compromising image quality by compression artifacts.īy default OCRKit uses the original filename, while you can choose to add an -OCR extension or edit each filename The compression quality controls the resulting file size, and thus affect the images quality. This can be Searchable or Highly Compressed PDF, or pure With the Format option you select the output format. It can also be useful to remove other small dot noise on your source material. Using the de-screen option you can reduce the pattern of small dots a times visible when scanning offset The color detection can be used to automatically convert color images to gray or black and white to save De-skew based on page content allows to correct small skew angle often The rotate option allows you to adjust the page orientation before processing - for example when the creationĪpplication rotated landscape pages. The actual page size for photos without resolution information, such as taken with a mobile phone. Using the match physical size for low resolution images, OCRKit can try to detect In this case the resolution option controls the resolution sued to rasterize the However PDF pages may contain more than one image, OCRKit tries to re-use images from source files 1:1. The mode option allows to choose special processing options for fax and dot-matrix printed documents.įor regular office text the dictionary based spelling correction is usually helpful to improve results.įor other alpha numeric data, such as scientific data or financial numbers turning off this option can be an advantage. Here you choose the language to be used for the optical character recognition.
0 Comments
Leave a Reply. |