Layout analysis
First Claim
1. A computer-implemented layout analyzer that is recorded on computer storage media and capable of execution by a computer, comprising the following computer-executable components:
- a connected component organizer that organizes connected components by color, vertical position and horizontal position;
a connected component joiner that joins connected components into words, based on statistics from groups comprising connected components and at least one of color, vertical position, horizontal position, height of connected components, or width of connected components;
a word organizer that organizes words by at least one of color, vertical position, horizontal position, or length, or combinations thereof; and
a word joiner that joins words into lines, based on statistics from groups comprising words and at least one of color, vertical position, horizontal position, height, width, angle, or length of words;
wherein after each joining operation, the statistics from groups comprising the connected components and from groups comprising words are re-computed and output as revised statistics, each joining operation comprises joining components into words and joining words into lines and producing a respective list of potential connected components and words to be revisited for further joining by the component joiner and word joiner utilizing the revised statistics;
wherein the word joiner provides the output to the connected component joiner to initiate further attempted joining of connected components based in part upon the list of potential connected components and upon the revised statistics from previously joined connected components and words;
wherein the layout analyzer outputs layout information to be used by an encoder to enable data compression and to enable reflow of connected components on an output device; and
wherein the output from the encoder is then stored for use by a user.
2 Assignments
0 Petitions
Accused Products
Abstract
A system and method facilitating layout analysis is provided. The invention includes a layout analyzer having a connected component organizer, a connected joiner, a word organizer and a word joiner. The invention provides for the connected component organizer to organize connected components based upon color, horizontal position and/or vertical position. The invention provides for the connected component joiner to join connected components based, at least in part, upon color, vertical position, horizontal position, a distance between the connected components, height of the connected components and/or width of the connected components. The word organizer organizes words and the word joiner joins words into lines. The joining of words into lines can cause the connected component joining to attempt to further join connected components into words.
-
Citations
26 Claims
-
1. A computer-implemented layout analyzer that is recorded on computer storage media and capable of execution by a computer, comprising the following computer-executable components:
-
a connected component organizer that organizes connected components by color, vertical position and horizontal position; a connected component joiner that joins connected components into words, based on statistics from groups comprising connected components and at least one of color, vertical position, horizontal position, height of connected components, or width of connected components; a word organizer that organizes words by at least one of color, vertical position, horizontal position, or length, or combinations thereof; and a word joiner that joins words into lines, based on statistics from groups comprising words and at least one of color, vertical position, horizontal position, height, width, angle, or length of words; wherein after each joining operation, the statistics from groups comprising the connected components and from groups comprising words are re-computed and output as revised statistics, each joining operation comprises joining components into words and joining words into lines and producing a respective list of potential connected components and words to be revisited for further joining by the component joiner and word joiner utilizing the revised statistics; wherein the word joiner provides the output to the connected component joiner to initiate further attempted joining of connected components based in part upon the list of potential connected components and upon the revised statistics from previously joined connected components and words; wherein the layout analyzer outputs layout information to be used by an encoder to enable data compression and to enable reflow of connected components on an output device; and wherein the output from the encoder is then stored for use by a user. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16)
-
-
17. A method for layout analysis that is recorded on a computer storage media and capable of execution by a computer, comprising the following computer-executed acts:
-
organizing connected components by color, vertical position and horizontal position; joining connected components into at least one word based on at least one of color, vertical position and horizontal position, a distance between the connected components, height of the connected components or width of the connected components producing a list of potential connected components to be revisited for further joining into words; re-computing statistics after each joining operation; utilizing the statistics and the list of potential connected components to initiate further joining of the connected components; outputting layout information to be used by an encoder to enable data compression and to enable reflow of connected components on an output device; and storing the output information from the encoder for use by a user. - View Dependent Claims (18)
-
-
19. A layout analysis system that is recorded on a computer storage media and capable of execution by a computer, comprising the following computer-executable components:
-
a connected component store storing information associated with a plurality of connected components; and a layout analyzer comprising a connected component organizer adapted to organize connected components by at least one of color, vertical position or horizontal position, or combinations thereof, the layout analyzer further comprising a connected component joiner adapted to join connected components into words, based on statistics from groups comprising connected components and at least one of color, vertical position, horizontal position, height of connected components, or width of connected components the layout analyzer further comprising a word organizer adapted to organize words by at least one of color, vertical position, horizontal position, or length, or combinations thereof, the layout analyzer further comprising a word joiner adapted to join words into lines, based on statistics from groups comprising words and at least one of color, vertical position, horizontal position, height, width, angle, or length of words, wherein after each joining operation, the statistics are re-computed and output as revised statistics from groups comprising the connected components and from groups comprising words, each joining operation comprises joining components into words and joining words into lines and producing a respective list of potential connected components and words to be revisited for further joining by the component joiner and word joiner utilizing the revised statistics, wherein the word joiner provides the output to the connected component joiner to initiate further attempted joining of connected components based in part upon the list of potential connected components and upon the revised statistics from previously joined connected components and words, wherein the layout analyzer outputs layout information to be used by an encoder to enable data compression and to enable reflow of connected components on an output device, and wherein the output from the encoder is then stored for use by a user. - View Dependent Claims (20)
-
-
21. A method for layout analysis that is recorded on a computer storage media and capable of execution by a computer, comprising the following computer-executed acts:
-
organizing connected component information; joining connected components into words; producing a list of potential connected components to be revisited for further joining into words; joining words into a line; producing a list of potential words to be revisited for further joining into lines; re-computing statistics after joining each word into a line; utilizing the statistics to attempt further joining of connected components into words based, at least in part, upon revised statistics generated by the joining of words into lines and upon the list of potential connected components; joining lines into a block; re-computing statistics after joining each line into a block; utilizing the statistics to attempt further joining of connected components into words based, at least in part, upon revised statistics generated by the joining of lines into the block and upon the list of potential connected components; re-computing statistics after joining each connected component into a word; utilizing the statistics to attempt further joining of words into lines based, at least in part, upon revised statistics generated by the joining of lines into the block and upon the list of potential words; outputting layout information to be used by an encoder to enable data compression and to enable reflow of connected components on an output device; and storing the output information from the encoder for use by a user.
-
-
22. A segmented layered image system that is recorded on a computer storage media and capable of execution by a computer, comprising the following computer-executable components:
-
a mask separator adapted to assign pixels of a document image into at least one of a foreground and a background, the assignment information stored in a mask; a layout analyzer comprising a connected component organizer adapted to organize connected components by at least one of color, vertical position or horizontal position, or combinations thereof the layout analyzer further comprising a connected component joiner adapted to join connected components into words, based on statistics from groups comprising connected components and at least one of color, vertical position, horizontal position, height of connected components, or width of connected components, the layout analyzer further comprising a word organizer adapted to organize words by at least one of color, vertical position, horizontal position, or length, or combinations thereof, the layout analyzer further comprising a word joiner adapted to join words into lines, based on statistics from groups comprising words and at least one of color, vertical position, horizontal position, height, width, angle, or length of words, wherein after each joining operation, the statistics from groups comprising the connected components and from groups comprising words are re-computed and output as revised statistics, each joining operation comprises joining components into words and joining words into lines and producing a respective list of potential connected components and words to be revisited for further joining by the component joiner and word joiner utilizing the revised statistics, wherein the word joiner provides the output to the connected component joiner to initiate further attempted joining of connected components based in part upon the list of potential connected components and upon the revised statistics from previously joined connected components and words, wherein the layout analyzer outputs layout information to be used by an encoder to enable data compression and to enable reflow of connected components on an output device, and wherein the output from the encoder is then stored for use by a user; and a mask encoder operative to encode the mask into a mask bit stream.
-
-
23. A system for layout analysis that is recorded on a computer storage media and capable of execution by a computer, comprising:
a data packet adapted to be transmitted between two or more computer components that facilitates layout analysis, the data packet comprising; layout information comprising data identifying at least two connected components as joined to each other, joining being based, at least in part, upon at least one of color, vertical position, horizontal position, a distance between the connected components, height of the connected components or width of the connected components; wherein after each joining operation, the layout information is re-computed and output as revised statistics from groups comprising the connected components and from groups comprising words, each joining operation comprises joining components into words and joining words into lines and producing a respective list of potential connected components and words to be revisited for further joining utilizing the revised statistics; wherein providing the output to the connected component to initiate further attempted joining of connected components based in part upon the list of potential connected components and upon the revised statistics from previously joined connected components and words; wherein the layout analyzer outputs layout information to be used by an encoder to enable data compression and to enable reflow of connected components on an output device; and wherein the output from the encoder is then stored for use by a user.
-
24. A computer storage media storing computer executable components of a system for layout analysis, comprising:
-
a connected component organizer that organizes connected components by at least one of color, vertical position or horizontal position, or combinations thereof; a connected component joiner that joins connected components into words, based on statistics from groups comprising connected components and at least one of color, vertical position, horizontal position, height of connected components, or width of connected components; a word organizer that organizes words by at least one of color, vertical position, horizontal position, or length, or combinations thereof; and a word joiner that joins words into lines, based on statistics from groups comprising words and at least one of color, vertical position, horizontal position, height, width, angle, or length of words; wherein after each joining operation, the statistics from groups comprising the connected components and from groups comprising words are re-computed and output as revised statistics, each joining operation comprises joining components into words and joining words into lines and producing a respective list of potential connected components and words to be revisited for further joining by the component joiner and word joiner utilizing the revised statistics; wherein the word joiner provides the output to the connected component joiner to initiate further attempted joining of connected components based in part upon the list of potential connected components and upon the revised statistics from previously joined connected components and words; wherein the layout analyzer outputs layout information to be used by an encoder to enable data compression and to enable reflow of connected components on an output device; and wherein the output from the encoder is then stored for use by a user.
-
-
25. A method for layout analysis that is recorded on a computer storage media and capable of execution by a computer, comprising:
-
organizing connected component information; joining connected components into words; producing a list of potential connected components to be revisited for further joining into words; joining words into a line; producing a list of potential words to be revisited for further joining into lines; re-computing statistics after joining each word into a line; utilizing the statistics to attempt further joining of connected components into words based, at least in part, upon revised statistics generated by the joining of words into lines and upon the list of potential connected components; joining lines into a block; re-computing statistics after joining each line into a block; utilizing the statistics to attempt further joining of connected components into words based, at least in part, upon revised statistics generated by the joining of lines into the block and upon the list of potential connected components; re-computing statistics after joining each connected component into a word; utilizing the statistics to attempt further joining of words into lines based, at least in part, upon revised statistics generated by the joining of lines into the block and upon the list of potential words; outputting layout information to be used by an encoder to enable data compression and to enable reflow of connected components on an output device; and storing the output information from the encoder for use by a user.
-
-
26. A computer-implemented layout analyzer that is recorded on a computer storage media and capable of execution by a computer, comprising:
-
computer-implemented means for organizing connected components by at least one of color, vertical position or horizontal position, or combinations thereof; computer-implemented means for joining connected components into words, based on statistics from groups comprising connected components and at least one of color, vertical position, horizontal position, height of connected components, or width of connected components; computer-implemented means for organizing words by at least one of color, vertical position, horizontal position, or length, or combinations thereof; computer-implemented means for joining words into lines, based on statistics from groups comprising words and at least one of color, vertical position, horizontal position, height, width, angle, or length of words; wherein after each joining operation, the statistics from groups comprising the connected components and from groups comprising words change and are re-computed and output as revised statistics, each joining operation comprises joining components into words and joining words into lines and producing a respective list of potential connected components and words to be revisited for further joining by the component joiner and word joiner utilizing the revised statistics; and wherein the computer-implemented means for joining words provides the output to the computer-implemented means for joining connected components to initiate further attempted joining of connected components based in part upon the list of potential connected components and upon the revised statistics from previously joined connected components and words; computer-implemented means for outputting layout information to be used by an encoder to enable data compression and to enable reflow of connected components on an output device; and computer-implemented means for storing the output information from the encoder for use by a user.
-
Specification