OCR for bank statements

I need to be able to OCR bank statements, including getting all the numbers and description in a form that can be processed.  How did you cope with the fact that every bank any a different layout? ANSWER (from http://stackoverflow.com/questions/7362926/what-is-the-state-of-the-art-in-ocr-of-bank-statements-in-net) We have first hand experience, and I have done it in two different ways in the past. Full Page OCR First, you can take the approach of “full-page OCR” and then parse the information into your desired data format. There is a variety of Engines with .NET support, such as ABBYY Engine SDK, or even a completely free-to-start cloud-based on-demand OCR API (OCR Cloud 2.0, http://www.ocr-it.com/ocr-cloud-2-0-api). This is more of a classic approach I used for over past 10 years and up to a few years ago. OCR provides you with a complete text-based result, and you use algorithms to extract informaiton. This approach is quite static and requires heavy programming usually, especially if there are multiple variations. There are two potentially troublesome areas to look for in this approach: A. making sure that OCR provides consistent layout and text structure so it could be parsed reliably. If there is a table without gridlines, or if there is just tabular data that could be detected as a table, then OCR may work unpredictably from document to document, which essentially breaks your parsing down the road. B. making sure that your parsing logic can accommodate various formatting differences and multiple variations of data structures. This is pure programming that requires code changes for adjustments or updates. Dynamic Data Capture Second, use a modern dynamic data capture system that automates template identification and...

OCR-IT Announces a Powerful Cloud-Based OCR API for Ad-Hoc Integrators and Mobile Developers

This press release has been published on August 29, 2011 here: http://www.i-newswire.com/ocr-it-delivers-a-powerful-cloud/126874 August 30, 2011 – OCR-IT LLC, a fast-growing document conversion and image processing company, today announced public availability of its newest cloud-based OCR Web API, code-named “OCR Cloud 2.0”. The platform ties together high-quality Optical Character Recognition (OCR) technologies from several leading manufacturers through a powerful HTTP POST API. The API allows developers with various backgrounds and working environments to access powerful image and picture cleanup tools and text recognition algorithms. Optical character recognition supports over 50 Latin and Cyrillic languages, provides enhanced control over text extraction and supports a wide selection of output formats. “As today’s mobile cameras, such as in iPhone 4 and soon to be released iPhone 5, as well as Android-based phones and tablets, continue to increase picture quality, smart developers are looking for new and creative ways to combine these data-gathering resources together with this Web OCR API to create applications that excite end users,” said principals at OCR-IT. “By offering this advanced OCR API, OCR Cloud 2.0 offers developers a chance to create applications that have low costs in development and usage but can lead in innovation.” Historically, high quality OCR has been developed for Windows platform, while other popular operating systems such as Mac OS and Linux, and mobile platforms such as iOS and Android, are left with inferior OCR technologies. OCR-IT OCR Cloud 2.0 erases these boundaries with high-quality Optical Character Recognition and accurate text extraction from any Web-enabled platform, making it truly OS-independent. Now, every developer can utilize this feature-rich powerful productivity technology to create a variety of never-seen-before applications....