Answer on StackOverflow: Detecting text in images

This question was answered on www.StackOverflow.com QUESTION Is there any good way of detecting whether an image contains text or not? I’m not looking for a way to retrieve the text, only to detect if there is one or more characters present in the image. I can understand that there is no foolproof way of detecting text, like when the font is a bit off standard; it might be hard to recognize. I’m after a “as good as can be” solution. See examples of text in images below: ANSWER Detecting if there is text is nearly the same as extracting the text, i.e. if you are able to extract text, it confirms that there is text. Detecting the text is roughly 90% same steps as extracting the text, the last 10% being some optimizations for specific languages and text types within OCR to produce better text recognition. Most of the heavy lifting happens at the beginning of the process, specifically image binarization and backgrounds removal, segmentation into objects, document analysis for layout, object type detection, and recognition of each object separately. For background information, take a look at the blog post I wrote many months ago about detecting and extracting various text via OCR from complex pictures and images: http://www.ocr-it.com/user-scenario-process-digital-camera-pictures-and-ocr-to-extract-specific-numbers For given images, take these steps one after another, and you will be able to decide if today’s technology can see text in these, and any other pictures. Binarization. Convert images to black & white. After this conversion can you see printed text characters. If no – end of of process – no text can be detected. If yes, proceed...

Analysis of Picture Taking Instructions in various mobile apps

NOTE: This report, with additional screenshots and illustrations, is available in PDF form here [su_document url=”http://www.ocr-it.com/wp-content/uploads/2014/08/OCR-IT-Analysis-of-Picture-Taking-Instructions-Screen-in-Mobile-Applications.pdf” height=”200″]   The purpose of this document is a) analyze various approaches mobile developers take to instruct and guide their users towards taking better pictures on mobile devices, and b) to establish suggested mechanism of instructing users how to take better pictures. Current Applications Analysis (Android Play Store)   CamScanner Includes ‘first run’ swipeable instructions. No separate picture taking instructions. Custom camera screen. CMC Image Scanner Includes ‘first run’ scrollable instructions, with “do not show again” option. No separate picture taking instructions. Uses default camera screen. Document scanner No ‘first run’ instructions. No separate picture taking instructions. Uses default camera screen. Document Scanner: Trial No ‘first run’ instructions. No separate picture taking instructions. Uses default camera screen. Droid Scan Lite Has ‘first run’ instructions and info. No separate picture taking instructions. Uses default camera screen. Handy Scanner Includes ‘first run’ swipeable instructions. No separate picture taking instructions. Custom camera screen. MDScan Lite No ‘first run’ instructions. No separate picture taking instructions. Custom camera screen. Mobile Scanner No ‘first run’ instructions. No separate picture taking instructions. Custom camera screen. Portable Scanner Lite Has ‘first run’ instructions that disappear after at least one scan is added, and reappear when app has no documents in it. No separate picture taking instructions. Custom camera screen. Scan to PDF Free No ‘first run’ instructions. No separate picture taking instructions. Uses default camera screen. YunMaiDoc – ScanDoc Document Reader US/EU No ‘first run’ instructions. Has a separate picture taking instructions screen. Custom camera screen. NOTE 1: This app had picture...