REDUNDANT ASPECT RATIO DECODING OF DEVANAGARI CHARACTERS
First Claim
1. A method to identify words of text in images, the method comprising:
- receiving a rectangular portion of an image of a scene of real world captured by a camera;
using the rectangular portion to determine whether a predetermined test is satisfied;
when the predetermined test is not satisfied, operating an OCR decoder on a block, and storing in a first hypothesis in memory, first candidates for recognition as a character in the block;
when the predetermined test is satisfied, operating one or more OCR decoders, and storing in the first hypothesis the first candidates to be recognized as the character in the block, and predetermined amount additionally storing second candidates to be recognized as the character in a second hypothesisusing a word decoder, to output a first word comprising at least one first candidate in the first hypothesis, and a first confidence level associated with the first word; and
using the word decoder a second time, to output a second word comprising at least one second candidate in the second hypothesis, and a second confidence level associated with the second word;
storing in memory, one of the first word and the second word identified as being recognized in the rectangular portion, based on at least comparison of the first confidence level and the second confidence level;
wherein at least the receiving, the checking, and the storing are performed by at least one processor.
1 Assignment
0 Petitions
Accused Products
Abstract
An electronic device and method receive a block sliced from a rectangular portion of an image of a scene of real world captured by a camera and use a property of the block to operate one of multiple optical character recognition (OCR) decoders. In an illustrative aspect, a first OCR decoder is configured to recognize characters whose property satisfies the test based on a first limit, the first limit being obtained by reducing a predetermined limit by an overlap amount. In this illustrative aspect, a second OCR decoder is configured to recognize characters whose property does not satisfy the test based on a second limit, the second limit being obtained by increasing the predetermined limit by the overlap amount. When the property of the block satisfies the test, the first OCR decoder is operated and alternatively the second OCR decoder is operated, resulting in candidates for a character being identified.
20 Citations
19 Claims
-
1. A method to identify words of text in images, the method comprising:
-
receiving a rectangular portion of an image of a scene of real world captured by a camera; using the rectangular portion to determine whether a predetermined test is satisfied; when the predetermined test is not satisfied, operating an OCR decoder on a block, and storing in a first hypothesis in memory, first candidates for recognition as a character in the block; when the predetermined test is satisfied, operating one or more OCR decoders, and storing in the first hypothesis the first candidates to be recognized as the character in the block, and predetermined amount additionally storing second candidates to be recognized as the character in a second hypothesis using a word decoder, to output a first word comprising at least one first candidate in the first hypothesis, and a first confidence level associated with the first word; and using the word decoder a second time, to output a second word comprising at least one second candidate in the second hypothesis, and a second confidence level associated with the second word; storing in memory, one of the first word and the second word identified as being recognized in the rectangular portion, based on at least comparison of the first confidence level and the second confidence level; wherein at least the receiving, the checking, and the storing are performed by at least one processor. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8)
-
-
9. At least one non-transitory computer readable storage media comprising a plurality of instructions to be executed by at least one processor to identify words of text in an image of a scene of real world, the plurality of instructions comprising:
-
instructions to receive a rectangular portion of an image of a scene of real world captured by a camera; instructions to use the rectangular portion to determine whether a predetermined test is satisfied; when the predetermined test is not satisfied, instructions to operate an OCR decoder on a block, and instructions to store in a first hypothesis in memory, first candidates for recognition as a character in the block; when the predetermined test is satisfied, instructions to operate one or more OCR decoders, and storing in the first hypothesis the first candidates to be recognized as the character in the block, and additionally storing second candidates to be recognized as the character in a second hypothesis instructions to use a word decoder, to output a first word comprising at least one first candidate in the first hypothesis, and a first confidence level associated with the first word; and instructions to use the word decoder a second time, to output a second word comprising at least one second candidate in the second hypothesis, and a second confidence level associated with the second word; instructions to store in memory, one of the first word and the second word identified as being recognized in the rectangular portion, based on at least comparison of the first confidence level and the second confidence level. - View Dependent Claims (10, 11, 12, 13, 14)
-
-
15. A mobile device to decode text in real world images, the mobile device comprising:
-
a camera; a memory operatively connected to the camera to receive at least an image therefrom, the image comprising one or more text regions; at least one processor operatively connected to the memory to execute a plurality of instructions stored in the memory; wherein the plurality of instructions cause the at least one processor to; receive a rectangular portion of an image of a scene of real world captured by a camera; use the rectangular portion to determine whether a predetermined test is satisfied; when the predetermined test is not satisfied, operate an OCR decoder on a block, and storing in a first hypothesis in memory, first candidates for recognition as a character in the block; when the predetermined test is satisfied, operate one or more OCR decoders, and storing in the first hypothesis the first candidates to be recognized as the character in the block, and additionally storing second candidates to be recognized as the character in a second hypothesis use a word decoder, to output a first word comprising at least one first candidate in the first hypothesis, and a first confidence level associated with the first word; and use the word decoder a second time, to output a second word comprising at least one second candidate in the second hypothesis, and a second confidence level associated with the second word; store in memory, one of the first word and the second word identified as being recognized in the rectangular portion, based on at least comparison of the first confidence level and the second confidence level. - View Dependent Claims (16, 17, 18)
-
-
19. A mobile device comprising:
-
a camera to capture an image of an environment outside the mobile device; a memory coupled to the camera for storing the image; means for receiving a rectangular portion of an image of a scene of real world captured by a camera; means for using the rectangular portion to determine whether a predetermined test is satisfied; responsive to the predetermined test being not satisfied, means for operating an OCR decoder on a block, and storing in a first hypothesis in memory, first candidates for recognition as a character in the block; responsive to the predetermined test is satisfied, means for operating one or more OCR decoders, and storing in the first hypothesis the first candidates to be recognized as the character in the block, and additionally storing second candidates to be recognized as the character in a second hypothesis means for using a word decoder, to output a first word comprising at least one first candidate in the first hypothesis, and a first confidence level associated with the first word; and means for using the word decoder a second time, to output a second word comprising at least one second candidate in the second hypothesis, and a second confidence level associated with the second word; and means for storing in memory, one of the first word and the second word identified as being recognized in the rectangular portion, based on at least comparison of the first confidence level and the second confidence level.
-
Specification