Wednesday, January 14, 2015

Recognize Only White Listed Characters & OCR Preprocessed Images in .NET Apps

What’s new in this release?

We are pleased to announce the new release of Aspose.OCR for .NET 2.2.0. This release contains many useful improvements, including the long awaited feature of Character White Listing. Please refer to the release notes of v2.2.0 for a full list of bug fixes and improvements. If you are planning to upgrade the API from any previous version, we would suggest you to check the Public API Changes section first. Aspose.OCR for .NET API has exposed Whitelist property for the OcrConfig class in order to provide the feature of Character White Listing. Now the developers can add a list of characters for recognition purposes, and while performing the OCR operation the OcrEngine recognizes only the specified characters. The list of characters can be added to the OcrConfig class using its Whitelist property. As a default behavior, the OcrEngine recognizes only the white-listed characters if OcrConfig.Whitelist property is not null or not empty. Aspose.OCR for .NET API has exposed the PreprocessedImages class that allows the users to see how original input image changes during the OCR pre-processing. PreprocessedImages class has the following useful properties that can retrieve the image at a particular stage of the OCR pre-processing. Aspose.OCR for .NET API allows to extract the text either as a whole or by parts whereas each part has its own Style, Font, Text Size and Location in the image, and all of this information can easily be extracted using the IRecognizedPartInfo and IRecognizedTextPartInfo class. The API also allows to retrieve the hierarchy of each recognized part on the image as TextBlock, Line, Word or Character. Hierarchy of the recognized part can be extracted using the TextPartLevel class that has been exposed to the public API with this release. The IRecognizedTextPartInfo class has two useful properties such as Level and Children that provides the access to the hierarchical level and lower level textual parts respectively. This release includes plenty of new & improved features as listed below
  • Support text parts hierarchy
  • Character Whitelisting
  • Create API for viewing filtered images
  • Embed resources file into OCR dll
  • Support for French and Spanish languages
  • Improve working with languages through public API
  • Incorrect number recognized
  • Improve transparent images processing
Other most recent bug fixes are also included in this release

Overview: Aspose.OCR for .NET

Aspose.OCR for .NET is a character recognition component built to allow developers to add OCR functionality in their ASP .NET web applications, web services and Windows applications. It provides a simple set of classes for controlling character recognition tasks. It helps developers to work with image (BMP, TIFF) files from within their own applications. It allows developers to extract text from images quickly & easily, saving time & effort involved in developing an OCR solution from scratch.

No comments:

Post a Comment