Monday, October 27, 2014

Detect Text Blocks on Image & Keep Track of all The Recognized Blocks

What’s new in this release?

We are pleased to announce the new release of Aspose.OCR for .NET 2.1.0. Aspose.OCR for .NET API has exposed IRecognizedPartInfo to that could be text or detected picture block. Textual blocks are represented by IRecognizedTextPartInfo holding useful information about the recognized text part. Each part has its own style, font, text size, color, language and other attributes. If the text consists of several parts that are written in different font styles or even in different languages then IRecognizedTextPartInfo will hold an item representing individual element. Recognized Text blocks can be retrieved using the OcrEngine.Text property only after performing the OCR operation. These blocks are stored as an array of IRecognizedPartInfo and contains information as discussed above along with the position and size of the block on the image. The IRecognizedPartInfo.Box property can be used to get the rectangular area where the detected block (with text or picture) is shown on the image. The Box property returns the coordinates starting from the upper left corner of the image.Last but not the least, we have made some improvements to the OcrEngine for performance considerations and accuracy. Moreover, we have fixed a few problem regarding the recognition of numbers, and cases that may cause OutOfMemory & IndexOutOfRange exceptions. As always we appreciate your feedback so if you ever have anything to tell us about this release or anything else, please head to the Aspose.OCR forum for a chat. This release includes plenty of new & improved features as listed below
  • Text blocks detection
  • Latest version cannot correctly perform OCR on the sample provided with Aspose Examples Dashboard is now corrected
  • Incorrect recognition of numbers is fixed
  • IRecognizedTextPartInfo issues are resolved and now return the found part type
  • Improve text and picture blocks processing algorithm
  • Incorrect results returned by OCR is now corrected
  • Improve time taken to extract the text from an image
  • Unable to perform OCR on Arial 32pt Text is now fixed
Other most recent bug fixes are also included in this release

Overview: Aspose.OCR for .NET

Aspose.OCR for .NET is a character recognition component built to allow developers to add OCR functionality in their ASP .NET web applications, web services and Windows applications. It provides a simple set of classes for controlling character recognition tasks. It helps developers to work with image (BMP, TIFF) files from within their own applications. It allows developers to extract text from images quickly & easily, saving time & effort involved in developing an OCR solution from scratch.


No comments:

Post a Comment