|
.
|
|
PDF
Conversion Details
Overview Prime
Recognition software includes the capability to convert scanned images into PDF
formatted files. Several products from Prime Recognition support PDF
output, including PrimeOCR, an award winning, high accuracy "Voting" OCR
engine, PrimeZone (image to PDF only), and PrimePost (PRO to PDF).
|
 |
PrimeOCR's PDF output provides the most accurate OCR
results available to the production imaging marketplace while minimizing PDF file size
with full compression and retaining original image and text layout.
Three styles of PDF documents can be produced:
PDF Image with Hidden Text includes information from both the PDF Image and the PDF Normal file types.
The original bitmap image is included in the document while the OCR results are hidden
behind the image. This type of document is useful when the original image needs to be
retained while OCR results can be indexed, searched, or copied into another application.
Advantages of using
PrimeOCR for PDF Creation
OCR Accuracy
Designed for
high volume unattended production environments
Memory management for robust operation.
Many of today's products that produce PDF files have limitations processing a large number
of documents in batch mode, or handling multi-page TIFFs. Prime Recognition products
manage memory effectively so thousands of images and multi-page TIFFs can be processed
quickly without complications.
Image enhancement may be controlled by the user and may be done in a
separate step from OCR or within OCR process including deskew, auto-rotation, despeckle,
etc.
Speed
Process Time
OCR Conversion of 21 TIFF Images to PDF Image Plus
Text
OCR Process |
Time (min) |
% faster with PrimeOCR |
Other product |
7:30 |
n/a |
PrimeOCR Level 1 |
1:05 |
85% |
PrimeOCR Level 3 |
3:00 |
55% |
PrimeOCR Level 5 |
5:50 |
15% |
Process Time
Conversion of 21 TIFF Images to PDF Image Only
Process |
Time
(sec) |
%
faster with PrimeOCR |
Other product |
80 |
n/a |
PrimeOCR |
6 |
92% |
File
Size
Prime Recognition's PDF output can save up to 80% disk space vs.
other alternatives depending on the PDF file type.
File Size
Conversion of 21 TIFF Images (876.6KB total size)
PDF
File Type |
PrimeOCR
(File Size KB) |
Other
Product
(File Size KB) |
%
saving with PrimeOCR |
Normal |
117.0 |
620.1 |
80% |
Image Only |
926.0 |
1263.0 |
25% |
Image plus hidden text |
988.0 |
1560.5 |
35% |
All fonts are mapped to the base fonts found in the PDF
reader reducing file size (however "look and feel" of document in PDF Normal
format may suffer when the base fonts do not closely match fonts in document).
Both text and images are compressed within the PDF file to minimize
file size.
To further minimize file size, desampling of the images within a PDF
file is available with PrimeOCR PDF output. Desampling is fully configurable by the user
from 50 dpi to 600 dpi.
PrimeOCR PDF I/O Specifications
Input File
Formats:
TIFF - including large multi-page (>1,000's of
pages) files
PCX
Bitonal images, color and grayscale
JPEG
PDF
Output File Formats:
PDF Image Only
PDF Normal
PDF Image with Hidden Text
Color and grayscale output supported
Optimized PDF output available with Acrobat
installed.
|