×

Self-improving system and method for classifying pages on the world wide web

  • US 20030225763A1
  • Filed: 04/14/2003
  • Published: 12/04/2003
  • Est. Priority Date: 04/15/2002
  • Status: Abandoned Application
First Claim
Patent Images

1. A method of categorizing documents comprising:

  • locating a plurality of documents to be categorized;

    extracting textual and contextual features from within each of the documents;

    identifying untrustworthy documents from the extracted features, said untrustworthy documents being eliminated from the plurality of documents to be categorized;

    evaluating each of the documents according to one or more of the extracted textual and contextual features;

    identifying lists of documents from the evaluated documents relating to a topic in response to a user query relating to the topic; and

    identifying documents within the identified lists relating to the topic.

View all claims
  • 2 Assignments
Timeline View
Assignment View
    ×
    ×