Method and Apparatus for Extraction
First Claim
Patent Images
1. A method of extraction, comprising:
- accessing at least a first set of data of a first document, the first document including markup language, wherein the first set of data includes selected data, the selected data at least partly specifying document data;
accessing at least a second set of data of a second document, the second document including markup language;
finding one or more sets of corresponding data of the second set of data, each of one or more sets of corresponding data having a strength of correspondence to the selected data of the first set of data;
if two or more sets of corresponding data are found, then
1) if one of the corresponding sets of data has a substantially higher strength of correspondence than strengths of correspondence of the other corresponding sets of data, assigning a high measure of quality to the selection of the selected data, and
2) assigning a low measure of quality to the selection of the selected data, if at least one of;
2a) none of the corresponding sets of data has a substantially higher strength of correspondence than strengths of correspondence of the other corresponding sets of data, and 2b) if strengths of correspondence of all corresponding sets of data are low.
7 Assignments
0 Petitions
Accused Products
Abstract
The present invention pertains to the field of computer software. More specifically, the present invention relates to one or more of the definition, extraction, delivery, and hyper-linking of clips, for example web clips.
138 Citations
79 Claims
-
1. A method of extraction, comprising:
-
accessing at least a first set of data of a first document, the first document including markup language, wherein the first set of data includes selected data, the selected data at least partly specifying document data;
accessing at least a second set of data of a second document, the second document including markup language;
finding one or more sets of corresponding data of the second set of data, each of one or more sets of corresponding data having a strength of correspondence to the selected data of the first set of data;
if two or more sets of corresponding data are found, then
1) if one of the corresponding sets of data has a substantially higher strength of correspondence than strengths of correspondence of the other corresponding sets of data, assigning a high measure of quality to the selection of the selected data, and
2) assigning a low measure of quality to the selection of the selected data, if at least one of;
2a) none of the corresponding sets of data has a substantially higher strength of correspondence than strengths of correspondence of the other corresponding sets of data, and 2b) if strengths of correspondence of all corresponding sets of data are low. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12)
-
-
13. A method of extraction, comprising:
-
accessing at least a first set of data of a first document, the first document including markup language, wherein the first set of data includes selected data, the selected data at least partly specifying document data;
accessing at least a second set of data of a second document, the second document including markup language;
finding one or more sets of corresponding data of the second set of data, each of one or more sets of corresponding data having a strength of correspondence to the selected data of the first set of data, the strength of correspondence at least partly determined by an edit sequence between at least part of the second set of data and at least part of the first set of data, the edit sequence including any of insertions, deletions, substitutions, matches, and repetitions, including;
considering at least repetitions for inclusion in the edit sequence between at least part of the second set of data and at least part of the first set of data;
if two or more sets of corresponding data are found, then
1) if one of the corresponding sets of data has a substantially higher strength of correspondence than strengths of correspondence of the other corresponding sets of data, assigning a high measure of quality to the selection of the selected data, and
2) if none of the corresponding sets of data has a substantially higher strength of correspondence than strengths of correspondence of the other corresponding sets of data, assigning a low measure of quality to the selection of the selected data. - View Dependent Claims (14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46)
-
-
47. A method of extraction, comprising:
-
accessing at least a first set of data of a first document, the first document including markup language, wherein the first set of data includes selected data, the selected data at least partly specifying document data;
accessing at least a second set of data of a second document, the second document including markup language;
finding one or more sets of corresponding data of the second set of data, each of one or more sets of corresponding data having a strength of correspondence to the selected data of the first set of data, the strength of correspondence at least partly determined by a tree-based edit sequence between at least part of the second set of data and at least part of the first set of data, the tree-based edit sequence including any of insertions, deletions, substitutions, matches, and repetitions, including;
considering at least repetitions for inclusion in the tree-based edit sequence between at least part of the second set of data and at least part of the first set of data;
if two or more sets of corresponding data are found, then
1) if one of the corresponding sets of data has a substantially higher strength of correspondence than strengths of correspondence of the other corresponding sets of data, assigning a high measure of quality to the selection of the selected data, and
2) if none of the corresponding sets of data has a substantially higher strength of correspondence than strengths of correspondence of the other corresponding sets of data, assigning a low measure of quality to the selection of the selected data. - View Dependent Claims (48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79)
-
Specification