Method and apparatus for recognizing text in an image sequence of scene imagery
First Claim
Patent Images
1. Method for recognizing text in a captured imagery having a plurality of frames, said method using a processor to perform steps comprising of:
- (a) detecting a text region in a first frame of the plurality of frames;
(b) applying, using the processor, optical character recognition processing (OCR) to said detected text region to identify potential text for said first frame; and
(c) agglomerating the potential text with potential text for at least a second frame of the plurality of frames, in a manner that takes an OCR result from each of the first frame and the at least said second frame, to produce a single recognition result for text in the detected text region.
2 Assignments
0 Petitions
Accused Products
Abstract
An apparatus and a concomitant method for detecting and recognizing text information in a captured imagery. The present method transforms the image of the text to a normalized coordinate system before performing OCR, thereby yielding more robust recognition performance. The present invention also combines OCR results from multiple frames, in a manner that takes the best recognition results from each frame and forms a single result that can be more accurate than the results from any of the individual frames.
-
Citations
28 Claims
-
1. Method for recognizing text in a captured imagery having a plurality of frames, said method using a processor to perform steps comprising of:
-
(a) detecting a text region in a first frame of the plurality of frames; (b) applying, using the processor, optical character recognition processing (OCR) to said detected text region to identify potential text for said first frame; and (c) agglomerating the potential text with potential text for at least a second frame of the plurality of frames, in a manner that takes an OCR result from each of the first frame and the at least said second frame, to produce a single recognition result for text in the detected text region. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10)
-
-
11. Apparatus for recognizing text in a captured imagery having a plurality of frames, said apparatus comprising:
-
means for detecting a text region in a first frame of the plurality of frames; means for applying optical character recognition processing (OCR) to said detected text region to identify potential text for said first frame; and means for agglomerating the potential text with potential text for at least a second frame of the plurality of frames, in a manner that takes an OCR result from each of the first frame and the at least said second frame, to produce a single recognition result for text in the detected text region. - View Dependent Claims (12, 13, 14, 15, 16, 17, 18, 19, 20)
-
-
21. Method for recognizing text in a captured imagery having a plurality of frames, said method using a processor to perform steps comprising of:
-
(a) detecting a text region in a frame of the captured imagery; (b) applying, using the processor, optical character recognition processing (OCR) to said detected text region to identify potential text for said frame; and (c) agglomerating the OCR identified potential text over a plurality of frames in the captured imagery to recognize the text in the detected text region, wherein said agglomerating step (c) comprises a step of; updating an agglomeration structure with said OCR identified potential text of a current frame, and wherein said updating step comprises steps of; (c1) finding correspondence between a text region of said agglomeration structure with a text region of said current frame; and (c2) finding character-to-character correspondence for each pair of overlapping lines between said text region of said agglomeration structure with said text region of said current frame to find one or more character group pairs. - View Dependent Claims (22, 23, 24)
-
-
25. Apparatus for recognizing text in a captured imagery having a plurality of frames, said apparatus comprising:
-
means for detecting a text region in a frame of the captured imagery; means for applying optical character recognition processing (OCR) to said detected text region to identify potential text for said frame; and means for agglomerating the OCR identified potential text over a plurality of frames in the captured imagery to extract the text in the detected text region, wherein said agglomerating means updates an agglomeration structure with said OCR identified potential text of a current frame and finds correspondence between a text region of said agglomeration structure with a text region of said current frame, and wherein said agglomerating means further finds character-to-character correspondence for each pair of overlapping lines between said text region of said agglomeration structure with said text region of said current frame to find one or more character group pairs. - View Dependent Claims (26, 27, 28)
-
Specification