JOCR is a specialized optical character recognition (ocr) software that can recognize characters from images taken on the computer's desktop.
Before we delve into the functionality it should be noted that JOCR requires the Microsoft Office Document Imaging component which is a component of Microsoft Office 2003 and newer Office versions. The Office Tool has to be installed for JOCR to function at all.
JOCR itself is a portable software program. The interface looks like that of a screen capturing software programs. It offers to capture a region, the desktop or the active window. The image is then displayed in the program's interface with options to print or copy it, or to recognize it which starts the OCR process.
Recognition is obviously the main part as it will analyze the image at hand to discover all the characters that it contains. The document imaging software displays the identified character count, and displays them in the default text editor from where they can be edited or copied easily.
The document imaging software can be useful in situations where text has to be copied from interface elements. This can be error messages, text that is displayed in applications or images. The recognition rate depends largely on the type of image and text used. It ranges from brilliant (almost no editing required) to weak (failed to recognize certain characters, lots of editing required).
JOCR is compatible with the following languages: Chinese, Czech, Danish, Dutch, English, Finnish, French, German, Greek, Hungarian, Italian, Japanese, Korean, Norwegian, Polish, Portuguese, Russian, Spanish, Swedish, Turkish.
JOCR is a well designed program that you may use to turn text on any image or program that you see on the Windows machine to text.
Note: Jocr has not been updated since 2006 which makes it likely that it won't work on newer versions of Windows or Office. A comparable solution is GOCR, an open source character recognition software that you can download from the project's Sourceforge page.
The program is available for Linux and Windows, and as source code. You need to load images into the program so that the textual contents of the images can be turned to plain text that you can copy. While it does not support screen capture functionality, it is easy enough to create a screenshot yourself, save it as an image file and load it into the program to get the text recognized.Advertisement
Ghacks is a technology news blog that was founded in 2005 by Martin Brinkmann. It has since then become one of the most popular tech news sites on the Internet with five authors and regular contributions from freelance writers.