AUTOMATIC EXPERT IDENTIFICATION, RANKING AND LITERATURE SEARCH BASED ON AUTHORSHIP IN LARGE DOCUMENT COLLECTIONS
First Claim
1. A computer implemented search system, comprising:
- a component that calculates a relevancy score for at least one information source associated with an author; and
a module that generates an author score based in part on an expert weight and the relevancy score.
1 Assignment
0 Petitions
Accused Products
Abstract
Disclosed is an author-centric search that facilitates identifying a source commonly associated with a topic by, for example, providing a ranked listing of experts in a field of knowledge related to a search phrase. The search phrase can be captured and parsed into the individual words (e.g., substrings) of the search phrase. Based on occurrences of the words in one or more documented communications, statistics can be generated to determine the relevancy of each documented communication in relation to the search phrase. Further, additional statistics can be generated describing the occurrence of multiple words in a documented communication and/or a distance of words between the search phrase words in a documented communication. The statistics can be utilized to generate expert scores. The expert scores can be sorted for and/or displayed to the user.
-
Citations
20 Claims
-
1. A computer implemented search system, comprising:
-
a component that calculates a relevancy score for at least one information source associated with an author; and
a module that generates an author score based in part on an expert weight and the relevancy score. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10)
-
-
11. A method for an author-centric search, comprising:
-
initializing a first data structure and a second data structure for each of a plurality of documented communications;
utilizing the first data structure and the second data structure to compute a relevancy score for each of the plurality of documented communications; and
determining a score for an author based in part on the relevancy score for each of the plurality of documented communications associated with the author. - View Dependent Claims (12, 13, 14, 15, 16, 17, 18, 19)
-
-
20. A system for an author-centric search, comprising:
-
means for initializing a first data structure and a second data structure for each of a plurality of documented communications;
means for utilizing the first data structure and the second data structure to compute a relevancy score for each of the plurality of documented communications; and
means for determining a score for an author based in part on the relevancy score for each of the plurality of documented communications associated with the author.
-
Specification