Method and apparatus for extraction
First Claim
Patent Images
1. A method of extracting a clip of a web page on a remote computer system, the method comprising:
- prior to the extraction, receiving, in a computer system, selection of a view, wherein the view comprises definition of the clip of the web page including markup language;
storing data from a portion of the web page defining the view in a view repository database;
automatically accessing from the view repository database, at least the data from the portion of the web page defining the view;
automatically accessing, over a computer network, new data from the web page, which has changed, on the remote computer system from which the clip of the web page is to be extracted;
in an extraction engine in a computer system, finding one or more sets of corresponding data of the new data from the web page which has changed, each of one or more sets of corresponding data of the new data from the web page which has changed having a strength of correspondence to the data, accessed from the view repository database, from the portion of the web page defining the view;
determining whether (a) two or more sets of corresponding data of the new data from the web page which has changed are found and (b) whether one of the corresponding sets of data of the new data from the web which has changed has a substantially higher strength of correspondence to the data accessed from the view repository database, from the portion of the web page defining the view, than strengths of correspondence of the other corresponding sets of data of the new data from the web page from which has changed;
if two or more sets of corresponding data of the new data from the web page which has changed are found, then
1) if one of the corresponding sets of data of the new data from the web which has changed has a substantially higher strength of correspondence to the data accessed from the view repository database, from the portion of the web page defining the view, than strengths of correspondence of the other corresponding sets of data of the new data from the web page from which has changed, then assigning a high measure of quality to the selection of the corresponding data having a higher strength of correspondence to the data accessed from the view repository database, from the portion of the web page defining the view, and
2) assigning a low measure of quality to the selection of the corresponding data of the new data from the web page which has changed, if at least one of;
2a) none of the corresponding sets of data of the new data from the web page which has changed has a substantially higher strength of correspondence than strengths of correspondence of the other corresponding sets of data of the new data from the web page which has changed, and 2b) if strengths of correspondence of all corresponding sets of data of the new data from the web page which has changed are low;
if the high measure of quality is assigned to one of the corresponding sets of data of the new data from the web page which has changed, then extracting the corresponding set of data from the web page which has changed, as the clip of the web page having the defined clip of the web page;
transmitting, over a computer network to an electronic display device, the extracted corresponding set of new data from the web page which has changed, as the clip of the web page having the defined clip of the web page, whereby a graphical display of the electronic display device renders the extracted corresponding set of new data, as the clip of the web page having the defined clip of the web page.
7 Assignments
0 Petitions
Accused Products
Abstract
The present invention pertains to the field of computer software. More specifically, the present invention relates to one or more of the definition, extraction, delivery, and hyper-linking of clips, for example web clips.
92 Citations
42 Claims
-
1. A method of extracting a clip of a web page on a remote computer system, the method comprising:
-
prior to the extraction, receiving, in a computer system, selection of a view, wherein the view comprises definition of the clip of the web page including markup language; storing data from a portion of the web page defining the view in a view repository database; automatically accessing from the view repository database, at least the data from the portion of the web page defining the view; automatically accessing, over a computer network, new data from the web page, which has changed, on the remote computer system from which the clip of the web page is to be extracted; in an extraction engine in a computer system, finding one or more sets of corresponding data of the new data from the web page which has changed, each of one or more sets of corresponding data of the new data from the web page which has changed having a strength of correspondence to the data, accessed from the view repository database, from the portion of the web page defining the view; determining whether (a) two or more sets of corresponding data of the new data from the web page which has changed are found and (b) whether one of the corresponding sets of data of the new data from the web which has changed has a substantially higher strength of correspondence to the data accessed from the view repository database, from the portion of the web page defining the view, than strengths of correspondence of the other corresponding sets of data of the new data from the web page from which has changed; if two or more sets of corresponding data of the new data from the web page which has changed are found, then
1) if one of the corresponding sets of data of the new data from the web which has changed has a substantially higher strength of correspondence to the data accessed from the view repository database, from the portion of the web page defining the view, than strengths of correspondence of the other corresponding sets of data of the new data from the web page from which has changed, then assigning a high measure of quality to the selection of the corresponding data having a higher strength of correspondence to the data accessed from the view repository database, from the portion of the web page defining the view, and
2) assigning a low measure of quality to the selection of the corresponding data of the new data from the web page which has changed, if at least one of;
2a) none of the corresponding sets of data of the new data from the web page which has changed has a substantially higher strength of correspondence than strengths of correspondence of the other corresponding sets of data of the new data from the web page which has changed, and 2b) if strengths of correspondence of all corresponding sets of data of the new data from the web page which has changed are low;if the high measure of quality is assigned to one of the corresponding sets of data of the new data from the web page which has changed, then extracting the corresponding set of data from the web page which has changed, as the clip of the web page having the defined clip of the web page; transmitting, over a computer network to an electronic display device, the extracted corresponding set of new data from the web page which has changed, as the clip of the web page having the defined clip of the web page, whereby a graphical display of the electronic display device renders the extracted corresponding set of new data, as the clip of the web page having the defined clip of the web page.
-
-
2. A method of extracting a clip of a web page on a remote computer system, the method comprising:
-
prior to the extraction, receiving, in a computer system, selection of a view, wherein the view comprises definition of the clip of the web page including markup language; storing data from a portion of the web page defining the view in a view repository database; automatically accessing from the view repository database, at least the data from the portion of the web page defining the view; automatically accessing, over a computer network, new data from the web page, which has changed, on the remote computer system from which the clip of the web page is to be extracted; in an extraction engine in a computer system, finding one or more sets of corresponding data of the new data from the web page which has changed, each of one or more sets of corresponding data of the new data from the web page which has changed having a strength of correspondence to the data, accessed from the view repository database from the portion of the web page defining the view, the strength of correspondence at least partly determined by an edit sequence between at least part of the new data from the web page which has changed and at least part of the data, accessed from the view repository database, from the portion of the web page defining the view, the edit sequence including any of insertions, deletions, substitutions, matches, and repetitions, including; considering at least repetitions for inclusion in the edit sequence between at least part of the new data from the web page which has changed and at least part of the data accessed from the view repository database, from the portion of the web page defining the view; and if two or more sets of corresponding data of the new data from the web page which has changed are found, then
1) if one of the corresponding sets of data of the new data from the web which has changed has a substantially higher strength of correspondence to the data accessed from the view repository database, from the portion of the web page defining the view than strengths of correspondence of the other corresponding sets of data of the new data from the web page from which has changed, then assigning a high measure of quality to the selection of the corresponding data having a higher strength of correspondence to the data accessed from the view repository database, from the portion of the web page defining the view, and
2) if none of the corresponding sets of data has a substantially higher strength of correspondence than strengths of correspondence of the other corresponding sets of data of the new data from the web page which has changed, assigning a low measure of quality to the selection of the selected data of the new data from the web page which has changed;if the high measure of quality is assigned to one of the corresponding sets of data of the new data from the web page which has changed, then extracting the corresponding set of data from the web page which has changed, as the clip of the web page having the defined clip of the web page; and transmitting, over a computer network to an electronic display device, the extracted corresponding set of new data from the web page which has changed, as the clip of the web page having the defined clip of the web page, whereby a graphical display of the electronic display device renders the extracted corresponding set of new data, as the clip of the web page having the defined clip of the web page. - View Dependent Claims (3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21)
-
-
22. A method of extracting a clip of a web page on a remote computer system, the method comprising:
-
prior to the extraction, receiving, in a computer system, selection of a view, wherein the view comprises definition of the clip of the web page including markup language; storing data from a portion of the web page defining the view in a view repository database; automatically accessing from the view repository database, at least the data from the portion of the web page defining the view; automatically accessing, over a computer network, new data from the web page, which has changed, on the remote computer system from which the clip of the web page is to be extracted; in an extraction engine in a computer system, finding one or more sets of corresponding data of the new data from the web page which has changed, each of one or more sets of corresponding data of the new data from the web page which has changed having a strength of correspondence to the data, accessed from the view repository database, from the portion of the web page defining the view, the strength of correspondence at least partly determined by a tree-based edit sequence between at least part of the new data from the web page which has changed and at least part of the data, accessed from the view repository database, from the portion of the web page defining the view, the tree-based edit sequence including any of insertions, deletions, substitutions, matches, and repetitions, including; considering at least repetitions for inclusion in the tree-based edit sequence between at least part of the new data from the web page which has changed and at least part of the data accessed from the view repository database, from the portion of the web page defining the view; if two or more sets of corresponding data of the new data from the web page which has changed are found, then
1) if one of the corresponding sets of data of the new data from the web which has changed has a substantially higher strength of correspondence than strengths of correspondence of the other corresponding sets of data of the new data from the web page from which has changed, then assigning a high measure of quality to the selection of the corresponding data having a higher strength of correspondence to the data accessed from the view repository database, from the portion of the web page defining the view, and
2) if none of the corresponding sets of data of the new data from the web page which has changed has a substantially higher strength of correspondence than strengths of correspondence of the other corresponding sets of data of the new data from the web page which has changed, assigning a low measure of quality to the selection of the selected data of the new data from the web page which has changed. - View Dependent Claims (23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40)
-
-
41. A computer system comprising at least a computer, the computer system for extracting a clip of a web page on a remote computer system, the computer system comprising:
-
a view repository database for storing data from a portion of the web page defining a view, wherein the view comprises definition of the clip of the web page including markup language; an engine, in a server coupled to the view repository database, that prior to the extraction, receives a selection of the view and stores data from a portion of the web page defining the view in a view repository database; an engine, in a server coupled to the view repository database, that automatically accesses from the view repository database, at least the data from the portion of the web page defining the view; an engine, in a server coupled to the view repository database, that automatically accesses, over a computer network, new data from the web page, which has changed, on the remote computer system from which the clip of the web page is to be extracted; and an extraction engine, in a server coupled to the view repository database, that finds one or more sets of corresponding data of the new data from the web page which has changed, each of one or more sets of corresponding data of the new data from the web page which has changed having a strength of correspondence to the data, accessed from the view repository database, from the portion of the web page defining the view; if two or more sets of corresponding data of the new data from the web page which has changed are found, then
1) if one of the corresponding sets of data of the new data from the web which has changed has a substantially higher strength of correspondence to the data accessed from the view repository database, from the portion of the web page defining the view, than strengths of correspondence of the other corresponding sets of data of the new data from the web page from which has changed, then assigns a high measure of quality to the selection of the corresponding data having a higher strength of correspondence to the data accessed from the view repository database, from the portion of the web page defining the view, and
2) assigns a low measure of quality to the selection of the corresponding data of the new data from the web page which has changed, if at least one of;
2a) none of the corresponding sets of data of the new data from the web page which has changed has a substantially higher strength of correspondence than strengths of correspondence of the other corresponding sets of data of the new data from the web page which has changed, and 2b) if strengths of correspondence of all corresponding sets of data of the new data from the web page which has changed are low;if the high measure of quality is assigned to one of the corresponding sets of data of the new data from the web page which has changed, then extracts the corresponding set of data from the web page which has changed, as the clip of the web page having the defined clip of the web page; transmits, over a computer network to an electronic display device, the extracted corresponding set of new data from the web page which has changed, as the clip of the web page having the defined clip of the web page, whereby a graphical display of the electronic display device renders the extracted corresponding set of new data, as the clip of the web page having the defined clip of the web page.
-
-
42. A computer system comprising at least a computer, the computer system for extracting a clip of a web page on a remote computer system, the computer system comprising:
-
a view repository database for storing data from a portion of the web page defining a view, wherein the view comprises definition of the clip of the web page including markup language; an engine, in a server coupled to the view repository database, that prior to the extraction, receives a selection of the view and stores data from a portion of the web page defining the view in a view repository database; an engine, in a server coupled to the view repository database, that automatically accesses from the view repository database, at least the data from the portion of the web page defining the view; an engine, in a server coupled to the view repository database, that automatically accesses, over a computer network, new data from the web page, which has changed, on the remote computer system from which the clip of the web page is to be extracted; and an extraction engine, in a server coupled to the view repository database, that finds one or more sets of corresponding data of the new data from the web page which has changed, each of one or more sets of corresponding data of the new data from the web page which has changed having a strength of correspondence to the data, accessed from the view repository database, from the portion of the web page defining the view, the strength of correspondence at least partly determined by an edit sequence between at least part of the new data from the web page which has changed and at least part of the data, accessed from the view repository database, from the portion of the web page defining the view, the edit sequence including any of insertions, deletions, substitutions, matches, and repetitions, including; considering at least repetitions for inclusion in the edit sequence between at least part of the new data from the web page which has changed and at least part of the data accessed from the view repository database, from the portion of the web page defining the view; and if two or more sets of corresponding data of the new data from the web page which has changed are found, then
1) if one of the corresponding sets of data of the new data from the web which has changed has a substantially higher strength of correspondence to the data accessed from the view repository database, from the portion of the web page defining the view than strengths of correspondence of the other corresponding sets of data of the new data from the web page from which has changed, then assigning a high measure of quality to the selection of the corresponding data having a higher strength of correspondence to the data accessed from the view repository database, from the portion of the web page defining the view, and
2) if none of the corresponding sets of data has a substantially higher strength of correspondence than strengths of correspondence of the other corresponding sets of data of the new data from the web page which has changed, assigning a low measure of quality to the selection of the selected data of the new data from the web page which has changed;if the high measure of quality is assigned to one of the corresponding sets of data of the new data from the web page which has changed, then extracts the corresponding set of data from the web page which has changed, as the clip of the web page having the defined clip of the web page; and transmits, over a computer network to an electronic display device, the extracted corresponding set of new data from the web page which has changed, as the clip of the web page having the defined clip of the web page, whereby a graphical display of the electronic display device renders the extracted corresponding set of new data, as the clip of the web page having the defined clip of the web page.
-
Specification