×

Automatic expert identification, ranking and literature search based on authorship in large document collections

  • US 8,280,882 B2
  • Filed: 04/20/2006
  • Issued: 10/02/2012
  • Est. Priority Date: 04/21/2005
  • Status: Active Grant
First Claim
Patent Images

1. A computer-implemented method for an author-centric search, comprising:

  • initializing a first data structure and a second data structure for each of a plurality of documented communications wherein each of the plurality of documented communications has at least one author to which the respective documented communication is attributed;

    utilizing the first data structure and the second data structure to compute a relevancy score for each of the plurality of documented communications;

    determining a score for an author of at least one of the plurality of documented communications based in part on the relevancy score for each of the plurality of documented communications authored by the author;

    prompting a user to enter a search string;

    parsing the search string into one or more words;

    populating at least one memory space of the first data structure for each documented communication with data based on the occurrence of the one or more words in the documented communication;

    populating at least one memory space of the second data structure for each documented communication with a weighted value for an author of a given documented communication that signifies a statistical preference for the data in the corresponding memory space of the first data structure;

    executing a mathematical function based on an aggregate of the data and the weighted value of the first and second data structures for each documented communication in order to compute the relevancy score for the documented communication; and

    displaying search results based at least in part upon a ranked listing of one or more author scores, wherein the weighted value for the author comprises a predefined value utilized to create the statistical preference for data in the corresponding memory space of the first data structure, the weighted value for the author being determined based on at least two of;

    a time of publication for the documented communication, a number of documented communications having the author, a prestige of the documented communication, and a number of authors for the documented communication.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×