Methods and systems for identifying text orientation in a digital image
First Claim
Patent Images
1. A method for determining a text orientation in a digital image, said method comprising:
- in a first text line comprising a first plurality of text characters in a digital image, determining a first text-line orientation of said first text line, wherein said determining said first text-line orientation comprises;
determining a text-line bounding box for said first text line;
calculating an aspect ratio for said text-line bounding box; and
calculating said first text-line orientation based on said aspect ratio;
determining, for each of said text characters in said first plurality of text characters, a first-edge position measurement corresponding to a bounding edge associated with a first side of said first text line, thereby producing a plurality of first-edge position measurements;
determining, for each of said text characters in said first plurality of text characters, a second-edge position measurement corresponding to a bounding edge associated with a second side of said first text line, thereby producing a plurality of second-edge position measurements;
computing a first first-alignment feature value relative to said first text-line orientation for said first text line in said digital image, wherein said computing a first first-alignment feature comprises;
calculating a sample mean for said plurality of first-edge position measurements, thereby producing a ceiling measurement; and
calculating an error measure between said ceiling measurement and said plurality of first-edge position measurements, thereby producing said first first-alignment feature value;
computing a first second-alignment feature value relative to said first text-line orientation for said first text line in said digital image, wherein said computing a first second-alignment feature comprises;
calculating a sample mean for said plurality of second-edge position measurements, thereby producing a floor measurement; and
calculating an error measure between said floor measurement and said plurality of second-edge position measurements, thereby producing said first second-alignment feature value; and
determining a first text orientation of said first plurality of text characters in said digital image based on said first first-alignment feature value and said first second-alignment feature value, wherein said determining said first text orientation comprises determining a baseline-side of said first text line, wherein said determining said baseline-side of said first text line is based on the relative values of said first first-alignment feature value and said first second-alignment feature value and a relative frequency of occurrence of text characters with ascenders and text characters with descenders in a written language.
2 Assignments
0 Petitions
Accused Products
Abstract
Aspects of the present invention relate to systems and methods for determining text orientation in a digital image.
70 Citations
21 Claims
-
1. A method for determining a text orientation in a digital image, said method comprising:
-
in a first text line comprising a first plurality of text characters in a digital image, determining a first text-line orientation of said first text line, wherein said determining said first text-line orientation comprises; determining a text-line bounding box for said first text line; calculating an aspect ratio for said text-line bounding box; and calculating said first text-line orientation based on said aspect ratio; determining, for each of said text characters in said first plurality of text characters, a first-edge position measurement corresponding to a bounding edge associated with a first side of said first text line, thereby producing a plurality of first-edge position measurements; determining, for each of said text characters in said first plurality of text characters, a second-edge position measurement corresponding to a bounding edge associated with a second side of said first text line, thereby producing a plurality of second-edge position measurements; computing a first first-alignment feature value relative to said first text-line orientation for said first text line in said digital image, wherein said computing a first first-alignment feature comprises; calculating a sample mean for said plurality of first-edge position measurements, thereby producing a ceiling measurement; and calculating an error measure between said ceiling measurement and said plurality of first-edge position measurements, thereby producing said first first-alignment feature value; computing a first second-alignment feature value relative to said first text-line orientation for said first text line in said digital image, wherein said computing a first second-alignment feature comprises; calculating a sample mean for said plurality of second-edge position measurements, thereby producing a floor measurement; and calculating an error measure between said floor measurement and said plurality of second-edge position measurements, thereby producing said first second-alignment feature value; and determining a first text orientation of said first plurality of text characters in said digital image based on said first first-alignment feature value and said first second-alignment feature value, wherein said determining said first text orientation comprises determining a baseline-side of said first text line, wherein said determining said baseline-side of said first text line is based on the relative values of said first first-alignment feature value and said first second-alignment feature value and a relative frequency of occurrence of text characters with ascenders and text characters with descenders in a written language. - View Dependent Claims (7, 9, 11, 12, 13, 14, 16)
-
-
2. A method for determining a text orientation in a digital image, said method comprising:
-
in a first text line comprising a first plurality of text characters in a digital image, determining a first text-line orientation of said first text line; determining, for each of said text characters in said first plurality of text characters, a first-edge position measurement corresponding to a bounding edge associated with a first side of said first text line, thereby producing a plurality of first-edge position measurements; determining, for each of said text characters in said first plurality of text characters, a second-edge position measurement corresponding to a bounding edge associated with a second side of said first text line, thereby producing a plurality of second-edge position measurements;
orientation for said first text line in said digital image, wherein said computing first-alignment feature comprises;calculating a sample mean for said plurality of first-edge position measurements, thereby producing a ceiling measurement; and calculating an error measure between said ceiling measurement and said plurality of first-edge position measurements, thereby producing said first first-alignment feature value; computing a first second-alignment feature value relative to said first text-line orientation for said first text line in said digital image, wherein said computing a first second-alignment feature comprises; calculating a sample mean for said plurality of second-edge position measurements, thereby producing a floor measurement; and calculating an error measure between said floor measurement and said plurality of second-edge position measurements, thereby producing said first second-alignment feature value; determining a first text orientation of said first plurality of text characters in said digital image based on said first first-alignment feature value and said first second-alignment feature value; and wherein said determining a first text-line orientation comprises; determining a text-line bounding box for said first text line; calculating an aspect ratio for said text-line bounding box; and determining said first text-line orientation based on said aspect ratio. - View Dependent Claims (6, 8, 10)
-
-
3. A method for determining a text orientation in a digital image, said method comprising:
-
in a first text line comprising a first plurality of text characters in a digital image, determining a first text-line orientation of said first text line, wherein said determining said first text-line orientation comprises; determining a text-line bounding box for said first text line; calculating an aspect ratio for said text-line bounding box; and calculating said first text-line orientation based on said aspect ratio; determining a first-side reference line for a first side of said first text line, said first-side reference line characterized by a first-side-reference-line position measurement; determining a second-side reference line for a second side of said first text line, said second-side reference line characterized by a second-side-reference-line position measurement; determining, for each of said first plurality of text characters, a first-edge position measurement corresponding to a bounding edge associated with a first side of said first text line, thereby producing a plurality of first-edge position measurements; determining, for each of said first plurality of text characters, a second-edge position measurement corresponding to a bounding edge associated with a second side of said first text line, thereby producing a plurality of second-edge position measurements; computing a first first-alignment feature value relative to said first text-line orientation for said first text line in said digital image, wherein said computing a first first-alignment feature comprises; calculating a difference between each of said plurality of first-edge position measurements and said first-side-reference-line position measurement, thereby producing a first plurality of difference measurements; calculating a first maximum, said first maximum corresponding to the maximum value of said first plurality of difference measurements; calculating the absolute value of the difference between each of said first plurality of difference measurements and said first maximum, thereby producing a first plurality of difference-from-maximum values; and summing said first plurality of difference-from-maximum values, thereby producing said first first-alignment feature value; computing a first second-alignment feature value relative to said first text-line orientation for said first text line in said digital image, wherein said computing a first second-alignment feature comprises; calculating a difference between each of said plurality of second-edge position measurements and said second-side-reference-line position measurement, thereby producing a second plurality of difference measurements; calculating a second maximum, said second maximum corresponding to the maximum value of said second plurality of difference measurements; calculating the absolute value of the difference between each of said second plurality of difference measurements and said second maximum, thereby producing a second plurality of difference-from-maximum values; and summing said second plurality of difference-from-maximum values, thereby producing said first second-alignment feature value; and determining a first text orientation of said first plurality of text characters in said digital image based on said first first-alignment feature value and said first second-alignment feature value, wherein said determining said first text orientation comprises determining a baseline-side of said first text line, wherein said determining said baseline-side of said first text line is based on the relative values of said first first-alignment feature value and said first second-alignment feature value and a relative frequency of occurrence of text characters with ascenders and text characters with descenders in a written language. - View Dependent Claims (15, 17, 18, 19)
-
-
4. A system for determining a text orientation in a digital image, said system comprising a non-transitory computer-readable medium comprising:
-
a text-line orientation determiner for determining a first text-line orientation of a first text line in a digital image, wherein said first text line comprises a first plurality of text characters; a bounding-box determiner for determining a bounding box for each of said first plurality of text characters, thereby producing a plurality of bounding boxes, wherein each of said bounding boxes comprises; a first edge, said first edge characterized by a first-edge position measurement, thereby producing a plurality of first-edge position measurements, and said first edge associated with a first side of said first text line; and a second edge, said second edge characterized by a second-edge position measurement, thereby producing a plurality of second-edge position measurements, and said second edge associated with a second side of said first text line; a first alignment feature calculator for computing a first-alignment feature value relative to said first text-line orientation for said first text line in said digital image, wherein first-alignment feature calculator comprises; a first sample-mean calculator for calculating a sample mean for said plurality of first-edge position measurements, thereby producing a ceiling measurement; and a first error-measure calculator for calculating an error measure between said ceiling measurement and said plurality of first-edge position measurements, thereby producing said first first-alignment feature value; a second alignment feature calculator for computing a second-alignment feature value relative to said first text-line orientation for said first text line in said digital image, wherein said second-alignment feature calculator comprises; a second sample-mean calculator for calculating a sample mean for said plurality of second-edge position measurements, thereby producing a floor measurement; and a second error-measure calculator for calculating an error measure between said floor measurement and said plurality of second-edge position measurements, thereby producing said first second-alignment feature value; a text orientation determiner for determining a text orientation of said first plurality of text characters in said digital image based on said first first-alignment feature value and said first second-alignment feature value, wherein said determining said text orientation comprises determining a baseline-side of said first text line, wherein said determining said baseline-side of said first text line is based on the relative values of said first-alignment feature value and said second-alignment feature value and a relative frequency of occurrence of text characters with ascenders and text characters with descenders in a written language; and wherein said text-line orientation determiner comprises; a text-line bounding box determiner for determining a text-line bounding box for said first text line; an aspect-ratio calculator for calculating an aspect ratio for said text-line bounding box; and wherein said text-line orientation determiner determines said text-line orientation based on said aspect ratio. - View Dependent Claims (20)
-
-
5. A system for determining a text orientation in a digital image, said system comprising a non-transitory computer-readable medium comprising:
-
a text-line orientation determiner for determining a first text-line orientation of a first text line in a digital image, wherein said first text line comprises a first plurality of text characters; a text-line bounding box determiner for determining a first-text-line bounding box for said first text line, wherein said first-text-line bound box comprises; a first-text-line first edge, said first-text-line first edge characterized by a first-text-line-first-edge position measurement and said first-text-line first edge associated with a first text-line-side of said first text line; and a first-text-line second edge, said first-text-line second edge characterized by a first-text-line-second-edge position measurement and associated with a second text-line-side of said first text line; a character-bounding-box determiner for determining a bounding box for each of said first plurality of text characters, thereby producing a plurality of bounding boxes, wherein each of said bounding boxes comprises; a first edge, said first edge characterized by a first-edge position measurement, thereby producing a plurality of first-edge position measurements, and said first edge associated with a first side of said first text line; and a second edge, said second edge characterized by a second-edge position measurement, thereby producing a plurality of second-edge position measurements, and said second edge associated with a second side of said first text line; a first alignment feature calculator for computing a first alignment feature value relative to said first text-line orientation for said first text line in said digital image, wherein said first alignment feature calculator comprises; a first difference calculator for calculating a difference between each of said plurality of first-edge position measurements and said first-text-line first-edge position measurement, thereby producing a first plurality of difference measurements; a first maximum calculator for calculating a first maximum, said first maximum corresponding to the maximum value of said first plurality of difference measurements; a first absolute-value calculator for calculating the absolute value of the difference between each of said first plurality of difference measurements and said first maximum, thereby producing a first plurality of difference-from-maximum values; and a first accumulator for summing said first plurality of difference-from-maximum values, thereby producing said first first-alignment feature value; a second alignment feature calculator for computing a second alignment feature value relative to said first text-line orientation for said first text line in said digital image, wherein said second alignment feature calculator comprises; a second difference calculator for calculating a difference between each of said plurality of second-edge position measurements and said first-text-line second-edge position measurement, thereby producing a second plurality of difference measurements; a second maximum calculator for calculating a second maximum, said second maximum corresponding to the maximum value of said second plurality of difference measurements; a second absolute-value calculator for calculating the absolute value of the difference between each of said second plurality of difference measurements and said second maximum, thereby producing a second plurality of difference-from-maximum values; and a second accumulator for summing said second plurality of difference-from-maximum values, thereby producing said first second-alignment feature value; and a text orientation determiner for determining a text orientation of said first plurality of text characters in said digital image based on said first first-alignment feature value and said first second-alignment feature value, wherein said determining said text orientation comprises determining a baseline-side of said first text line, wherein said determining said baseline-side of said first text line is based on the relative values of said first alignment feature value and said second alignment feature value and a relative frequency of occurrence of text characters with ascenders and text characters with descenders in a written language; and wherein said text-line orientation determiner comprises; a text-line bounding box determiner for determining a text-line bounding box for said first text line; an aspect-ratio calculator for calculating an aspect ratio for said text-line bounding box; and wherein said text-line orientation determiner determines said text-line orientation based on said aspect ratio. - View Dependent Claims (21)
-
Specification