OCR Software Accuracy Comparison

/OCR Software Accuracy Comparison
OCR Software Accuracy Comparison2018-12-20T12:34:23+00:00

OCR Software Accuracy Comparison

Looking for increased performance from your OCR software?

The best conventional OCR software products achieve about 98% average accuracy recognizing text on typical quality document images.   This level of accuracy sounds pretty good, but still leaves 40 errors remaining on a page of 2000 characters.

Some users, including forms processing operations, chose to correct these errors mounting high OCR error correction labor costs.  Others rely on fuzzy search to filter through OCR data hoping to find relevant data only to find that reviewing 10’s, even 100’s of unrelated documents takes time and costs money.

On a typical text page of 2000 characters, conventional OCR, on average, would generate 40 errors.  By implementing PrimeOCR’s “Voting” technology, the total errors could be reduced to 8 errors.

PrimeOCR Accuracy vs. Industry OCR

Prime Recognition developed PrimeOCR for the production marketplace to reduce the error rate typically found with conventional OCR engines.  PrimeOCR licenses and includes engine technology from the best retail OCR vendors.  Your image is passed through each engine and using Voting technology, PrimeOCR reduces overall OCR error rate by 65-80%.

PrimeOCR can be configured with level 3 accuracy reducing error rates by 65% or configured to level 6 accuracy which reduces error rates by up to 80%.  Level 6 accuracy takes more time to process and is more costly, but depending on your application, may be more cost-effective when compared with the costs associated with error correction or sifting through fuzzy search results.

Real World Results

What does 65% – 85% fewer errors look like when viewing OCR accuracy results on scanned documents?  These averages are based on a large number of scanned documents from different document types, various image qualities and varying types of fonts.  Some imaging projects may have much cleaner documents so the reduction in error rates will naturally be less, while other projects may include older documents with poor quality characters which PrimeOCR will further decrease the error rate.

Following are the visual results from one page as an illustrative sample.  Certainly not statistically conclusive because of the limited number of pages, but it is a good graphical rendition of what errors look like when comparing various levels of accuracy.  We processed the same page through an industry OCR  engine and then Level 3 accuracy of PrimeOCR and then Level 6 accuracy of PrimeOCR.

Although newer documents may not include so many errors as this single page, what is important to consider is that voting technology can reduce the number of errors by 65% – 85% when compared with traditional OCR software.

Traditional OCR Software 142 Errors Marked

Traditional OCR Software

142 errors marked

PrimeOCR Level 3 Accuracy 21 Errors Marked

PrimeOCR L3 Accuracy

21 errors marked

PrimeOCR Level 6 Accuracy 12 Errors Marked

PrimeOCR L6 Accuracy

12 errors marked

See Your Own Results

Want to see a comparison with your own images?  Send us a sample and see the difference in the results.

See how PrimeOCR can save you operating costs, how PrimeOCR produces cleaner data.  Or see details about PrimeOCR or our other products.

PrimeOCR now supports PDF for high accuracy formatted output.  The PDF output generated from PrimeOCR contains fewer errors than conventional OCR engines producing PDF and takes full advantage of PDF’s compression options to produce the smallest PDF file size available from any OCR engine.

Contact Us

Information and Sales:
sales@primerecognition.com

Support:
support@primerecognition.com

Call Us
(425) 895-0550

 


Cleaner OCR Data with PrimeOCR

High Accuracy OCR Saves Operating Costs


 

What Our Customers Say

“InfoEdge has experienced up to 50% reduction in OCR errors through the use of the voting technique.   (PrimeOCR) … Editing and correction of OCR errors can be the single largest cost in some applications, and reducing that cost can significantly reduce the bottom line of the entire project.” ~ KMWorld

 

“The University of Michigan Digital Library Production Services is extraordinarily pleased with the increase in OCR quality made possible through the use of PrimeOCR. Scalability is a critical issue in digital libraries, and Prime Recognition has contributed to our creating a large and scalable digital library production service.”
John Price-Wilkin, University of Michigan

 

“PrimeOCR gives us a much cleaner document before verification than most OCR packages do after verification.”  ~ Doug Thompson, Scan Center of America

 

“PrimeOCR provides the highest OCR accuracy available to the production market.  This high accuracy combined with PrimeOCR’s flexible architecture prods us with a powerful OCR platform to offer our customers.” ~ Robert J. Perry, Webhire

 

“What we release to the public, by law, must be 100% correct.  PrimeOCR has significantly reduced errors, allowing us a faster turn-around time to publish a document.”  ~ Rick Essex Rotunda

Sliding Bar Widget 2

Sliding Bar Widget 2

This Sliding Bar can be switched on or off in theme options, and can take any widget you throw at it or even fill it with your custom HTML Code. Its perfect for grabbing the attention of your viewers. Choose between 1, 2, 3 or 4 columns, set the background color, widget divider color, activate transparency, a top border or fully disable it on desktop and mobile.