×

METHOD AND SYSTEM FOR COLLECTING AND RETRIEVING INFORMATION FROM WEB SITES

  • US 20080147631A1
  • Filed: 12/14/2006
  • Published: 06/19/2008
  • Est. Priority Date: 12/14/2006
  • Status: Abandoned Application
First Claim
Patent Images

1. A method for collecting and retrieving information from Web sites, the method comprising:

  • acquiring a set of Web pages;

    for each Web page in the set of Web pages;

    analyzing the Web page for data artifacts;

    classifying each data artifact on the Web page as one of a predetermined set of types; and

    indexing and organizing in at least one data structure each classified data artifact, each indexed and organized data artifact in the at least one data structure being associated with a subject, all indexed and organized data artifacts that are associated with a non-unique subject being associated with a single subject entry;

    receiving a query indicating a particular subject to be searched;

    retrieving search results from the at least one data structure, the search results including a set of data artifacts associated with the particular subject; and

    displaying at least some of the search results, the displayed data artifacts in the search results being grouped in accordance with their respective types, the displayed data artifacts in the search results within each type being listed in descending order of relevance to the particular subject.

View all claims
  • 2 Assignments
Timeline View
Assignment View
    ×
    ×