×

Method and apparatus for extraction

  • US 7,934,152 B2
  • Filed: 06/28/2006
  • Issued: 04/26/2011
  • Est. Priority Date: 08/18/2000
  • Status: Active Grant
First Claim
Patent Images

1. A method of extracting a clip of a web page on a remote computer system, the method comprising:

  • prior to the extraction, receiving, in a computer system, selection of a view, wherein the view comprises definition of the clip of the web page including markup language;

    storing data from a portion of the web page defining the view in a view repository database;

    automatically accessing from the view repository database, at least the data from the portion of the web page defining the view;

    automatically accessing, over a computer network, new data from the web page, which has changed, on the remote computer system from which the clip of the web page is to be extracted;

    in an extraction engine in a computer system, finding one or more sets of corresponding data of the new data from the web page which has changed, each of one or more sets of corresponding data of the new data from the web page which has changed having a strength of correspondence to the data, accessed from the view repository database, from the portion of the web page defining the view;

    determining whether (a) two or more sets of corresponding data of the new data from the web page which has changed are found and (b) whether one of the corresponding sets of data of the new data from the web which has changed has a substantially higher strength of correspondence to the data accessed from the view repository database, from the portion of the web page defining the view, than strengths of correspondence of the other corresponding sets of data of the new data from the web page from which has changed;

    if two or more sets of corresponding data of the new data from the web page which has changed are found, then

         1) if one of the corresponding sets of data of the new data from the web which has changed has a substantially higher strength of correspondence to the data accessed from the view repository database, from the portion of the web page defining the view, than strengths of correspondence of the other corresponding sets of data of the new data from the web page from which has changed, then assigning a high measure of quality to the selection of the corresponding data having a higher strength of correspondence to the data accessed from the view repository database, from the portion of the web page defining the view, and

         2) assigning a low measure of quality to the selection of the corresponding data of the new data from the web page which has changed, if at least one of;

    2a) none of the corresponding sets of data of the new data from the web page which has changed has a substantially higher strength of correspondence than strengths of correspondence of the other corresponding sets of data of the new data from the web page which has changed, and 2b) if strengths of correspondence of all corresponding sets of data of the new data from the web page which has changed are low;

    if the high measure of quality is assigned to one of the corresponding sets of data of the new data from the web page which has changed, then extracting the corresponding set of data from the web page which has changed, as the clip of the web page having the defined clip of the web page;

    transmitting, over a computer network to an electronic display device, the extracted corresponding set of new data from the web page which has changed, as the clip of the web page having the defined clip of the web page, whereby a graphical display of the electronic display device renders the extracted corresponding set of new data, as the clip of the web page having the defined clip of the web page.

View all claims
  • 7 Assignments
Timeline View
Assignment View
    ×
    ×