Method of handling complex variants of words through prefix-tree based decoding for Devanagiri OCR
First Claim
1. A method to decode text in real world images, the method comprising:
- receiving a natural image of a scene of real world captured by a camera of a mobile device, the natural image comprising text pixels in one or more text regions and further comprising non-text pixels;
detecting in the natural image, a text region comprising a first sequence of characters of a predetermined script in which a predetermined language is expressed;
selecting a second sequence of characters from a predetermined set of sequences, based on the second sequence of characters matching the first sequence of characters, wherein at least one of the sequences in the predetermined set is associated with information on location of one or more modifiers;
after the selecting and responsive to the second sequence being associated with the information, analyzing the natural image to determine whether at least one pixel satisfies a predetermined test associated with said information;
adding a modifier to a specific character in a copy of the second sequence of characters, when the predetermined test is satisfied; and
identifying said first sequence of characters detected in said text region, as a word in the predetermined language comprising the copy of said second sequence of characters with the modifier;
wherein one or more of the receiving, the detecting, the selecting, the analyzing, the adding, and the identifying are performed by at least one processor coupled to a memory.
1 Assignment
0 Petitions
Accused Products
Abstract
An electronic device and method identify a block of text in a portion of an image of real world captured by a camera of a mobile device, slice sub-blocks from the block and identify characters in the sub-blocks that form a first sequence to a predetermined set of sequences to identify a second sequence therein. The second sequence may be identified as recognized (as a modifier-absent word) when not associated with additional information. When the second sequence is associated with additional information, a check is made on pixels in the image, based on a test specified in the additional information. When the test is satisfied, a copy of the second sequence in combination with the modifier is identified as recognized (as a modifier-present word). Storage and use of modifier information in addition to a set of sequences of characters enables recognition of words with or without modifiers.
-
Citations
29 Claims
-
1. A method to decode text in real world images, the method comprising:
-
receiving a natural image of a scene of real world captured by a camera of a mobile device, the natural image comprising text pixels in one or more text regions and further comprising non-text pixels; detecting in the natural image, a text region comprising a first sequence of characters of a predetermined script in which a predetermined language is expressed; selecting a second sequence of characters from a predetermined set of sequences, based on the second sequence of characters matching the first sequence of characters, wherein at least one of the sequences in the predetermined set is associated with information on location of one or more modifiers; after the selecting and responsive to the second sequence being associated with the information, analyzing the natural image to determine whether at least one pixel satisfies a predetermined test associated with said information; adding a modifier to a specific character in a copy of the second sequence of characters, when the predetermined test is satisfied; and identifying said first sequence of characters detected in said text region, as a word in the predetermined language comprising the copy of said second sequence of characters with the modifier; wherein one or more of the receiving, the detecting, the selecting, the analyzing, the adding, and the identifying are performed by at least one processor coupled to a memory. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8)
-
-
9. At least one non-transitory computer readable storage media comprising a plurality of instructions to be executed by at least one processor to decode text in real world images, the plurality of instructions comprising:
-
first instructions to receive a natural image of a scene of real world captured by a camera of a mobile device, the natural image comprising text pixels in one or more text regions and further comprising non-text pixels; second instructions to detect in the natural image, a text region comprising a first sequence of characters of a predetermined script in which a predetermined language is expressed; third instructions to select a second sequence of characters from a predetermined set of sequences, based on the second sequence of characters matching the first sequence of characters, wherein at least one of the sequences in the predetermined set is associated with information on location of one or more modifiers; fourth instructions, to be executed after the third instructions and responsive to the second sequence being associated with the information, to analyze the natural image to determine whether at least one pixel satisfies a predetermined test associated with said information; fifth instructions to add a modifier to a specific character in a copy of the second sequence of characters, when the predetermined test is satisfied; and sixth instructions to identify said first sequence of characters detected in said text region, as a word in the predetermined language comprising the copy of said second sequence of characters with the modifier. - View Dependent Claims (10, 11, 12, 13, 14, 15, 16)
-
-
17. A mobile device to decode text in real world images, the mobile device comprising:
-
a camera; a memory operatively connected to the camera to receive at least a natural image of a scene therefrom, the natural image comprising text pixels in one or more text regions and further comprising non-text pixels; at least one processor operatively connected to the memory to execute a plurality of instructions stored in the memory; wherein the plurality of instructions cause the at least one processor to; detect in the natural image, a text region comprising a first sequence of characters of a predetermined script in which a predetermined language is expressed; select a second sequence of characters from a predetermined set of sequences, based on the second sequence of characters matching the first sequence of characters, wherein at least one of the sequences in the predetermined set is associated with information on location of one or more modifiers; after selection of the second sequence of characters and responsive to the second sequence being associated with the information, analyzing the natural image to determine whether at least one pixel satisfies a predetermined test associated with said information; add a modifier to a specific character in a copy of the second sequence of characters, when the predetermined test is satisfied; and identify said first sequence of characters detected in said text region, as a word in the predetermined language comprising the copy of said second sequence of characters with the modifier. - View Dependent Claims (18, 19, 20, 21, 22)
-
-
23. An apparatus to decode text in real world images, the apparatus comprising:
-
a memory storing a natural image of a scene outside the apparatus, the natural image comprising text pixels in one or more text regions and further comprising non-text pixels; means for detecting in the natural image, a text region comprising a first sequence of characters of a predetermined script in which a predetermined language is expressed; means for selecting a second sequence of characters from a predetermined set of sequences, based on the second sequence of characters matching the first sequence of characters, wherein at least one of the sequences in the predetermined set is associated with information on location of one or more modifiers; means, operable after the means for selecting and responsive to the second sequence being associated with the information, for analyzing the natural image to determine whether at least one pixel satisfies a predetermined test associated with said information; means for adding a modifier to a specific character in a copy of the second sequence of characters, when the predetermined test is satisfied; and means for identifying said first sequence of characters detected in said text region, as a word in the predetermined language comprising the copy of said second sequence of characters with the modifier. - View Dependent Claims (24, 25, 26, 27, 28, 29)
-
Specification