Selection of text in an unstructured document
First Claim
1. A non-transitory computer readable medium storing a computer program which when executed by at least one processor defines a selection of text in a document, the computer program comprising sets of instructions for:
- receiving an unstructured document comprising a plurality of unassociated glyphs;
associating sets of glyphs into a plurality of columns;
creating a structured document from the unstructured document comprising a plurality of layouts and a flow of reading through the plurality of layouts, each particular layout comprising at least one of the plurality of columns and a reading order for the columns within the particular layout;
displaying the structured document;
receiving a start point in a first column in a first layout and an end point in a second column in a second, different layout, the second layout having an order value after the first layout; and
defining a selection of text from the start point to the end point by using the structured document, the selection including (i) all glyphs after the start point in the first column in the first layout according to the reading order for the columns in the first layout, (ii) all glyphs of the columns of the intervening layouts between the first and second layouts according to the flow of reading through the layouts, and (iii) all glyphs that precede the end point in the second layout according to the reading order for the columns in the second layout.
1 Assignment
0 Petitions
Accused Products
Abstract
Some embodiments provide a method for defining a selection of text in an unstructured document that includes a number of glyphs. The method identifies associated sets of glyphs and a reading order that specifies a flow of reading through the glyphs. The method displays the document. The method receives a start point and end point for a selection of text within the displayed document. The method defines a selection of text from the start point to the end point by using the identified sets of glyphs and intended flow of reading.
-
Citations
24 Claims
-
1. A non-transitory computer readable medium storing a computer program which when executed by at least one processor defines a selection of text in a document, the computer program comprising sets of instructions for:
-
receiving an unstructured document comprising a plurality of unassociated glyphs; associating sets of glyphs into a plurality of columns; creating a structured document from the unstructured document comprising a plurality of layouts and a flow of reading through the plurality of layouts, each particular layout comprising at least one of the plurality of columns and a reading order for the columns within the particular layout; displaying the structured document; receiving a start point in a first column in a first layout and an end point in a second column in a second, different layout, the second layout having an order value after the first layout; and defining a selection of text from the start point to the end point by using the structured document, the selection including (i) all glyphs after the start point in the first column in the first layout according to the reading order for the columns in the first layout, (ii) all glyphs of the columns of the intervening layouts between the first and second layouts according to the flow of reading through the layouts, and (iii) all glyphs that precede the end point in the second layout according to the reading order for the columns in the second layout. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13)
-
-
14. A non-transitory computer readable medium storing a computer program for execution by at least one processor, the computer program comprising sets of instructions for:
-
analyzing an unstructured document that comprises a plurality of primitive elements to create a structured document from the unstructured document as a hierarchical model of the unstructured document, the structured document comprising a plurality of layouts and a reading order through the layouts, the layouts comprising separate ordered sequences of text columns; displaying the structured document; receiving a start point in a first layout and an initial end point in a second, different layout for a selection of text in the displayed structured document; automatically moving the initial end point to a new end point within the first layout based on positions of the start point and the initial end point in order for the start point and the end point to be in the same layout; and selecting text in the structured document from the start point to the new end point by using the reading order in the first layout. - View Dependent Claims (15, 16, 17, 18)
-
-
19. A method for defining a selection of text in a document, the method comprising:
-
receiving an unstructured document comprising a plurality of unassociated glyphs; associating sets of glyphs in a plurality of layouts, the layouts comprising separate ordered sequences of text columns; identifying a reading order that specifies a flow of reading through the glyphs; creating a structured document from the unstructured document, the structured document comprising the plurality of layouts and the reading order through the glyphs; displaying the structured document; receiving a start point in a first layout and an initial end point in the structured document for a selection of text within the displayed structured document; calculating a line from the start point to the initial end point; automatically selecting a new end point along the line; and defining a selection of text from the start point to the new end point by using the identified sets of glyphs and the intended flow of reading. - View Dependent Claims (20, 21, 22, 23, 24)
-
Specification