optical character recognition

Home > ... > Science and Technology > Computers and Electrical Engineering > Computers and Computing > ...

optical character recognition

The Columbia Encyclopedia, Sixth Edition | 2008 | The Columbia Encyclopedia, Sixth Edition. Copyright 2008 Columbia University Press. (Hide copyright information) Copyright

optical character recognition (OCR), method for the machine-reading of typeset, typed, and, in some cases, hand-printed letters, numbers, and symbols using optical sensing and a computer. The light reflected by a printed text, for example, is recorded as patterns of light and dark areas by an array of photoelectric cells in a optical scanner. A computer program analyzes the patterns and identifies the characters they represent, with some tolerance for less than perfect and uniform text. OCR is also used to produce text files from computer files that contain images of alphanumeric characters, such as those produced by fax transmissions. See also computer graphics ; pen-based computer ; personal digital assistant .

Hide all research tools
Print this article Print all entries for this topic Cite this article Link to this article
Link to this article

CloseClose

Create a link to this page

Copy and paste this link tag into your Web page or blog:

<a href="http://www.encyclopedia.com/topic/.aspx#1E1-opticalc" title="Facts and information about optical character recognition">optical character recognition</a>

Add this article to Del.icio.usBookmark this article on DiigoShare this article on FacebookSubmit this article to RedditGive this article a thumbs-up on StumbleUpon
Show all research tools

Cite this article
Pick a style below, and copy the text for your bibliography.

  • MLA
  • Chicago
  • APA

"optical character recognition." The Columbia Encyclopedia, Sixth Edition. 2008. Encyclopedia.com. 27 Dec. 2009 <http://www.encyclopedia.com>.

"optical character recognition." The Columbia Encyclopedia, Sixth Edition. 2008. Encyclopedia.com. (December 27, 2009). http://www.encyclopedia.com/doc/1E1-opticalc.html

"optical character recognition." The Columbia Encyclopedia, Sixth Edition. 2008. Retrieved December 27, 2009 from Encyclopedia.com: http://www.encyclopedia.com/doc/1E1-opticalc.html

Learn more about citation styles

optical character recognition

A Dictionary of Business and Management | 2006 | © A Dictionary of Business and Management 2006, originally published by Oxford University Press 2006. (Hide copyright information) Copyright

optical character recognition (OCR) The recognition of printed characters by a light-sensitive optical scanner. The scanner recognizes the shape of a letter by scanning it with a very fine point of light. It then uses a computer to compare the pattern of reflected light with the patterns of the letters of the alphabet stored in its memory. OCR is often used to read responses on questionnaires, thus reducing human error and increasing the speed of analysis.

Hide all research tools
Print this article Print all entries for this topic Cite this article Link to this article
Link to this article

CloseClose

Create a link to this page

Copy and paste this link tag into your Web page or blog:

<a href="http://www.encyclopedia.com/topic/.aspx#1O18-opticalcharacterrecognitn" title="Facts and information about optical character recognition">optical character recognition</a>

Add this article to Del.icio.usBookmark this article on DiigoShare this article on FacebookSubmit this article to RedditGive this article a thumbs-up on StumbleUpon
Show all research tools

Cite this article
Pick a style below, and copy the text for your bibliography.

  • MLA
  • Chicago
  • APA

"optical character recognition." A Dictionary of Business and Management. 2006. Encyclopedia.com. 27 Dec. 2009 <http://www.encyclopedia.com>.

"optical character recognition." A Dictionary of Business and Management. 2006. Encyclopedia.com. (December 27, 2009). http://www.encyclopedia.com/doc/1O18-opticalcharacterrecognitn.html

"optical character recognition." A Dictionary of Business and Management. 2006. Retrieved December 27, 2009 from Encyclopedia.com: http://www.encyclopedia.com/doc/1O18-opticalcharacterrecognitn.html

Learn more about citation styles

Optical Character Recognition

A Dictionary of Computing | 2004 | | © A Dictionary of Computing 2004, originally published by Oxford University Press 2004. (Hide copyright information) Copyright

Optical Character Recognition

Optical Character Recognition (OCR) uses a device that reads pencil marks and converts them into a computer-usable form. OCR technology recognizes characters on a source document using the optical properties of the equipment and media. OCR improves the accuracy of data collection and reduces the time required by human workers to enter the data.

Although OCR is used for high-speed data entry, it did not begin with the computer industry. The beginnings of OCR can be traced back to 1809 when the first patents for devices to aid the blind were awarded. In 1912 Emmanuel Goldberg patented a machine that read characters, converted them into standard telegraph code, and then transmitted telegraphic messages over wires without human intervention. In 1914 Fournier D'Albe invented an OCR device called the optophone that produced sounds. Each sound corresponded to a specific letter or character. After learning the character equivalent for various sounds, visually impaired people were able to "read" the printed material. Developments in OCR continued throughout the 1930s, becoming more important with the beginnings of the computer industry in the 1940s. OCR development in the 1950s attempted to address the needs of the business world.

Methods for Recording Data

OCR requires hardware, in the form of a scanning device, and software to convert the images and character data from the source document into a digital form. Three primary methods are used to record data on a source document to be read by an OCR device. These include optically readable marks, bar codes, and optically readable characters, including handwritten characters.

Optical mark recognition (OMR) uses OMR paper, sometimes called a "mark sense form." This paper has a series of rectangular shapes that are filled in using a pencil. The completed form is then fed through a scanning device that reads the filled-in rectangles. The software of the OMR scanning device can perform an elementary statistical analysis of the data. OMR technology is commonly used to score standardized tests, such as the Scholastic Aptitude Test (SAT) and Graduate Management Aptitude Test (GMAT), quickly and accurately.

Bar codes are zebra-striped marks of various widths that appear on, or are attached to, most manufactured retail products. The most common use of the bar code is the 10-digit Universal Product Code (UPC) . Other kinds of bar code systems are used in a variety of placesfrom overnight mail packages to airplane luggage tags. The width and combination of the stripes on the bar code represent data. A bar code reader consists of a scanner and decoder. The scanner emits a beam of light that is swept past the bar code and senses light reflections to distinguish between the bars and spaces. A photo detector converts the spaces into an electrical signal and the bars into the absence of an electrical signal. The decoder analyzes the signal patterns to validate and interpret the corresponding data.

Some OCR readers can convert typed and handwritten documents into digital data. These readers scan the shape of a character on a document, compare the scanned character with a pre-defined shape, and convert the character into its corresponding bit pattern for storage in main computer memory. This technology is still in development; handwritten documents do not scan with 100 percent accuracy.

A special type of OCR, magnetic ink character recognition (MICR), is used by several industries, including banks. The enormous amount of paper in the form of checks, loans, and bank statements, combined with the need for accurate and quick processing, prompted the banking industry to seek new ways to manage the flow of paper. In 1956 the American Bankers Association recommended adopting magnetic ink for high-speed automatic character recognition, resulting in MICR. With MICR, data are recorded using a magnetic ink that is readable by either a scanning device or a person. On bank checks, which represent the most common use of MICR, characters in magnetic ink detail the bank's identification number, the individual's account number, and the check number. Checks can be scanned and the data are quickly and accurately read into a computer for further processing.

Another use of OCR allows printed documentssuch as text, images, or photographsto be stored in a computer. Either hand-held scanners or page scanners are used to convert physical documents into computer-readable forms. Page scanners are stationary. The page is typically placed face down on the glass plate of the scanner and then scanned. Hand-held scanners are manually moved over the document. Both types of scanners can convert monochrome or color pictures, forms, text, and other images into machine-readable digital data. The data can then be modified, saved, and distributed over computer networks.

see also Artificial Intelligence; Input Devices; Pattern Recognition; Virtual Reality; Virtual Reality in Education.

Terri L. Lenox and Charles R. Woratschek

Bibliography

Schantz, Herbert F. The History of OCR, Optical Character Recognition. Manchester Center, VT: Recognition Technologies Users Association, 1982.

Shelly, Gary B., and Thomas J. Cashman. Introduction to Computers and Data Processing. Brea, CA: Anaheim Publishing Company, 1980.

Stair, Ralph M., and George W. Reynolds. Principles of Information Systems: A Managerial Approach, 5th ed. Boston: Course TechnologyITP, 2001.

Hide all research tools
Print this article Print all entries for this topic Cite this article Link to this article
Link to this article

CloseClose

Create a link to this page

Copy and paste this link tag into your Web page or blog:

<a href="http://www.encyclopedia.com/topic/.aspx#1G2-3401200249" title="Facts and information about optical character recognition">optical character recognition</a>

Add this article to Del.icio.usBookmark this article on DiigoShare this article on FacebookSubmit this article to RedditGive this article a thumbs-up on StumbleUpon
Show all research tools

Cite this article
Pick a style below, and copy the text for your bibliography.

  • MLA
  • Chicago
  • APA

Lenox, Terri L.; Charles R. Woratschek. "Optical Character Recognition." Computer Sciences. The Gale Group Inc. 2002. Encyclopedia.com. 27 Dec. 2009 <http://www.encyclopedia.com>.

Lenox, Terri L.; Charles R. Woratschek. "Optical Character Recognition." Computer Sciences. The Gale Group Inc. 2002. Encyclopedia.com. (December 27, 2009). http://www.encyclopedia.com/doc/1G2-3401200249.html

Lenox, Terri L.; Charles R. Woratschek. "Optical Character Recognition." Computer Sciences. The Gale Group Inc. 2002. Retrieved December 27, 2009 from Encyclopedia.com: http://www.encyclopedia.com/doc/1G2-3401200249.html

Learn more about citation styles

Related topics

  Edit this list

Related articles from newspapers, magazines, and more

Japanese Inventor Develops Optical Character Recognition System
News Wire article from: US Fed News Service, Including US State News; 5/16/2007; 500 words ; ...developed an image scanner and an optical character recognition system using said image scanner...providing an image scanner and an optical recognition system using said...intended region' to carry out character recognition and can carry out...
OmniPage is OCR that meets your needs. (Caere Corp.'s OmniPage Professional 2.0 scanning software)(optical character recognition) (Software Review) (Evaluation)
Magazine article from: Computer Shopper; 4/1/1992; ; 700+ words ; Optical character recognition (OCR...percent of the characters it processes...2,000-character page. A fast...dedicated to the optical recognition...1,980-character page in just...better than 22 characters per second...
OPTICAL CHARACTER RECOGNITION MOTIVATED BY PRIMATE VISUAL SYSTEM
Magazine article from: Neural Network World; 9/1/2007; ; 700+ words ; ...inspired approach to optical character recognition is proposed in this...1. Introduction Optical character recognition (OCR...tried to deal with character recognition from...handwritten alphanumeric characters even in the presence...
Optical character recognition: the OCR component of an identification or inspection system is an important element in boosting overall quality.(SOFTWARE)
Magazine article from: Quality; 5/1/2008; ; 700+ words ; Optical character recognition (OCR) is the...of defining a character font. Whether...like Asian characters), whether it...and touching characters. On the other...image) for each character present in the...
Read-It O.C.R. Pro 5.0. (optical character recognition software) (Software Review) (Evaluation)
Magazine article from: Macworld; 4/1/1995; ; 700+ words ; Optical Character Recognition Software PROS: Instant OCR with Read...General-purpose, accurate optical character recognition of printed text...While I don't necessarily expect any optical character recognition program to get...
LIGATURE SOFTWARE ANNOUNCES COMPETITIVE UPGRADE FOR USERS OF OPTICAL CHARACTER RECOGNITION SOFTWARE
PR Newswire; 9/11/1995; 700+ words ; ...for users of Optical Character Recognition (OCR) software...Western European character sets, sophisticated...up to 300 characters/second...read printed characters. "This upgrade...Western European character sets, and...
Caere Announces OmniPage for Arabic at CeBIT '96; Caere Provides First OmniFont Optical Character Recognition Software for the Arabic Language.
Business Wire; 3/13/1996; 700+ words ; ...recognized leader in optical character recognition (OCR) technology...engine to recognize each character in each font -- through...performed for each specific character in each font -- even...recognizes essentially any character in virtually any typeface...isolating individual ...
Cognex Expands Vision Sensor Capabilities With New Optical Character Verification and Character Recognition Tools.
Business Wire; 10/2/2001; 700+ words ; ...high-performance optical character verification (OCV) and optical character recognition (OCR) software tools...provide unmatched character verification and reading...legibility of printed characters in a number of industries...
Evaluating the new OCR desktop programs; today's optical character recognition software can read print better, and cheaper, than ever.
Magazine article from: Folio: the Magazine for Magazine Management; 6/1/1991; ; 700+ words ; Today's optical character recognition software can read print...cheaper, than ever Optical character recognition...analyze the shape of a character in terms of lines and...errors"-reading a character as the wrong letter...material, present their characters to tile ...
Israeli Inventors Develop Optical Character Recognition Errors Automatic Correction Method
News Wire article from: US Fed News Service, Including US State News; 8/5/2008; 526 words ; ...system for optical character recognition. According...for encoding characters includes identifying...sequences of the character codes that...respective extension character code with each...containing characters is divided...

Pictures from Google Image Search

Click to see an enlarged picture
Click to see an enlarged picture
Click to see an enlarged picture

For students and teachers!

Encyclopedia.com provides students and teachers facts, information, and biographies from verified, citable sources, including:

Encyclopedia.com provides students and teachers facts, information, and biographies from verified, citable sources, including:

Popular on Newser: