<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>OCR-IT LLC</title>
	<atom:link href="http://www.ocr-it.com/feed" rel="self" type="application/rss+xml" />
	<link>http://www.ocr-it.com</link>
	<description>Document Conversion Experts: Document Processing Services &#38; OCR API</description>
	<lastBuildDate>Wed, 08 Feb 2012 19:41:08 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.1.1</generator>
		<item>
		<title>OCR for bank statements</title>
		<link>http://www.ocr-it.com/343?utm_source=rss&#038;utm_medium=rss&#038;utm_campaign=343</link>
		<comments>http://www.ocr-it.com/343#comments</comments>
		<pubDate>Mon, 12 Sep 2011 07:23:09 +0000</pubDate>
		<dc:creator>OCR-IT LLC Team</dc:creator>
				<category><![CDATA[Best Practices]]></category>
		<category><![CDATA[Document Conversion]]></category>
		<category><![CDATA[OCR Cloud API]]></category>
		<category><![CDATA[abbyy]]></category>
		<category><![CDATA[data capture]]></category>
		<category><![CDATA[forms processing]]></category>
		<category><![CDATA[OCR Cloud 2.0]]></category>
		<category><![CDATA[Online OCR]]></category>
		<category><![CDATA[Web API]]></category>

		<guid isPermaLink="false">http://www.ocr-it.com/?p=343</guid>
		<description><![CDATA[I need to be able to OCR bank statements, including getting all the numbers and description in a]]></description>
			<content:encoded><![CDATA[<p>I need to be able to OCR bank statements, including getting all the numbers and description in a form that can be processed.  How did you cope with the fact that every bank any a different layout?</p>
<p><span style="font-size: 15px; font-weight: bold;"><strong>ANSWER (from <a href="http://stackoverflow.com/questions/7362926/what-is-the-state-of-the-art-in-ocr-of-bank-statements-in-net">http://stackoverflow.com/questions/7362926/what-is-the-state-of-the-art-in-ocr-of-bank-statements-in-net</a>)</strong></span></p>
<p>We have first hand experience, and I have done it in two different ways in the past.</p>
<p><strong>Full Page OCR</strong> First, you can take the approach of &#8220;full-page OCR&#8221; and then parse the information into your desired data format. There is a variety of Engines with .NET support, such as ABBYY Engine SDK, or even a completely free-to-start cloud-based on-demand OCR API (<a rel="nofollow" href="http://www.ocr-it.com/ocr-cloud-2-0-api">OCR Cloud 2.0</a>, <a rel="nofollow" href="http://www.ocr-it.com/ocr-cloud-2-0-api">http://www.ocr-it.com/ocr-cloud-2-0-api</a>). This is more of a classic approach I used for over past 10 years and up to a few years ago. OCR provides you with a complete text-based result, and you use algorithms to extract informaiton. This approach is quite static and requires heavy programming usually, especially if there are multiple variations. There are two potentially troublesome areas to look for in this approach:</p>
<p>A. making sure that OCR provides consistent layout and text structure so it could be parsed reliably. If there is a table without gridlines, or if there is just tabular data that could be detected as a table, then OCR may work unpredictably from document to document, which essentially breaks your parsing down the road.</p>
<p>B. making sure that your parsing logic can accommodate various formatting differences and multiple variations of data structures. This is pure programming that requires code changes for adjustments or updates.</p>
<p><strong>Dynamic Data Capture</strong> Second, use a modern dynamic data capture system that automates template identification and data extraction. This is the approach I have been using instead of parsing for a few years now, and it is several times faster and more convenient to create and operate. In this process you would use a specialized software, such as <a rel="nofollow" href="http://www.wisetrend.com/abbyy_flexicapture.shtml">ABBYY FlexiCapture</a>(<a rel="nofollow" href="http://www.wisetrend.com/abbyy_flexicapture.shtml">http://www.wisetrend.com/abbyy_flexicapture.shtml</a>), which will take care of two aforementioned issues with variable data formats and different templates. Before processing it needs to be setup and &#8220;trained&#8221; to identify different statement types and how the data is located on those different variations. It performs all setup through User Interface and not coding, and you can plugin custom scripts if desired. If it needs to be re-trained for a new template, or trained to capture some data more reliably, it takes a few minutes without coding or programming experience. I have trained accountants to maintain and adjust their invoice templates themselves.<br />
(see their feedback here:<a rel="nofollow" href="http://www.prweb.com/releases/2011/wisetrend_wiseinvoice/prweb4936544.htm">http://www.prweb.com/releases/2011/wisetrend_wiseinvoice/prweb4936544.htm</a>)</p>
<p>NOTE: FlexiCapture is not .NET SDK, but an application with complete automation. It can be used interactively by operators, or typically I use it for 100% server-based automation. Once setup, I feed images to it for Input, and get my properly formatted text as Output in either CSV, XML, or direct export into my ODBC databases. So you could use it as &#8216;black box&#8217; server based component.</p>
<p>I have a sample project for bank statements somewhere, so please let me know if you would like to see it yourself live.</p>
<p>SOURCE: OCR &amp; Data Capture consultants with 11 years experience.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.ocr-it.com/343/feed</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>OCR-IT Announces a Powerful Cloud-Based OCR API for Ad-Hoc Integrators and Mobile Developers</title>
		<link>http://www.ocr-it.com/ocr-it-announces-a-powerful-cloud-based-ocr-api-for-ad-hoc-integrators-and-mobile-developers?utm_source=rss&#038;utm_medium=rss&#038;utm_campaign=ocr-it-announces-a-powerful-cloud-based-ocr-api-for-ad-hoc-integrators-and-mobile-developers</link>
		<comments>http://www.ocr-it.com/ocr-it-announces-a-powerful-cloud-based-ocr-api-for-ad-hoc-integrators-and-mobile-developers#comments</comments>
		<pubDate>Tue, 30 Aug 2011 05:55:07 +0000</pubDate>
		<dc:creator>OCR-IT LLC Team</dc:creator>
				<category><![CDATA[OCR Cloud API]]></category>
		<category><![CDATA[Free OCR SDK]]></category>
		<category><![CDATA[Mobile OCR]]></category>
		<category><![CDATA[OCR Cloud 2.0]]></category>
		<category><![CDATA[Online OCR]]></category>
		<category><![CDATA[Web API]]></category>

		<guid isPermaLink="false">http://www.ocr-it.com/?p=332</guid>
		<description><![CDATA[This press release has been published on August 29, 2011 here: http://www.i-newswire.com/ocr-it-delivers-a-powerful-cloud/126874 August 30, 2011 - OCR-IT LLC, a fast-growing]]></description>
			<content:encoded><![CDATA[<p>This press release has been published on August 29, 2011 here: <a href="http://www.i-newswire.com/ocr-it-delivers-a-powerful-cloud/126874">http://www.i-newswire.com/ocr-it-delivers-a-powerful-cloud/126874</a></p>
<p>August 30, 2011 - <img src="http://www.i-newswire.com/Uploads/thumb_126874_1_1.png" alt="" hspace="8" width="200" align="left" />OCR-IT LLC, a fast-growing document conversion and image processing company, today announced public availability of its newest cloud-based OCR Web API, code-named “OCR Cloud 2.0”. The platform ties together high-quality Optical Character Recognition (OCR) technologies from several leading manufacturers through a powerful HTTP POST API. The API allows developers with various backgrounds and working environments to access powerful image and picture cleanup tools and text recognition algorithms. Optical character recognition supports over 50 Latin and Cyrillic languages, provides enhanced control over text extraction and supports a wide selection of output formats.</p>
<p>“As today’s mobile cameras, such as in iPhone 4 and soon to be released iPhone 5, as well as Android-based phones and tablets, continue to increase picture quality, smart developers are looking for new and creative ways to combine these data-gathering resources together with this Web OCR API to create applications that excite end users,” said principals at OCR-IT. “By offering this advanced OCR API, OCR Cloud 2.0 offers developers a chance to create applications that have low costs in development and usage but can lead in innovation.”</p>
<p>Historically, high quality OCR has been developed for Windows platform, while other popular operating systems such as Mac OS and Linux, and mobile platforms such as iOS and Android, are left with inferior OCR technologies. OCR-IT OCR Cloud 2.0 erases these boundaries with high-quality Optical Character Recognition and accurate text extraction from any Web-enabled platform, making it truly OS-independent. Now, every developer can utilize this feature-rich powerful productivity technology to create a variety of never-seen-before applications.</p>
<p>The initial release of OCR Cloud was developed over two years ago by WiseTREND (<a href="http://www.wisetrend.com/" target="_blank">http://www.wisetrend.com</a>), a leading integrator and custom development firm for advanced OCR and data capture solutions. The API has been adopted by thousands of developers worldwide. Developers serving various industries have reported hundreds of successful implementations including internal corporate projects, the development of e-discovery processes and the creation of mobile applications, as well as for the addition of OCR support on operating systems that have no native support for OCR capabilities.</p>
<p>“OCR-IT LLC takes over OCR Cloud 2.0 with the creation of the second generation of capabilities,” said OCR-IT representative. “With the continued participation of the WiseTREND team, we will continue full-force development of this API so that it will fulfill its promise as one of the most powerful cloud-based OCR platforms in the marketplace today.”</p>
<p>Developers can sign up for a FREE development account and start integration into their applications immediately. With a few lines of code, programmers can start processing images form any Web-enabled computer or mobile device. Full documentation and comprehensive code samples in different programming languages are provided. For more information and additional documentation, visit: <a href="http://www.ocr-it.com/ocr-cloud-2-0-api" target="_blank">http://www.ocr-it.com/ocr-cloud-2-0-api</a></p>
<p>About WiseTREND (the developer and lead partner in OCR Cloud 2.0 API initiative)<br />
WiseTREND, a power-integrator of OCR, document recognition, form processing and data capture technologies and solutions, works with a variety of organizations to successfully utilize the most powerful technologies of today and create competitive technologies of tomorrow. The company helps to solve business challenges through state-of-the-art document processing solutions, techniques, efficiency, and innovation. For additional company and product information, please visit: http://www.wisetrend.com</p>
]]></content:encoded>
			<wfw:commentRss>http://www.ocr-it.com/ocr-it-announces-a-powerful-cloud-based-ocr-api-for-ad-hoc-integrators-and-mobile-developers/feed</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
	</channel>
</rss>

