×

Systems and methods for authoritativeness grading, estimation and sorting of documents in large heterogeneous document collections

  • US 20030221166A1
  • Filed: 09/03/2002
  • Published: 11/27/2003
  • Est. Priority Date: 05/17/2002
  • Status: Abandoned Application
First Claim
Patent Images

1. A method for creating a document textual authority model used to determine an authority of a document having a plurality of document content features, the method comprising:

  • determining, for each document in a set of documents, a set of document classification attributes;

    applying a document attribute evaluation framework to each document in the set of documents to determine a textual authoritativeness value or a textual authority class for the document;

    selecting a subset of document content features from the plurality of document content features; and

    encoding the subset of document content features into a feature vector x; and

    determining a predictive model used to assign the feature vector x to an authority rank or class.

View all claims
  • 3 Assignments
Timeline View
Assignment View
    ×
    ×