Ranking expertise
First Claim
1. A computer-implemented method comprising:
- identifying a plurality of identities stored in a repository of identities, each identity corresponding to an expert of one or more topics;
identifying a plurality of topics stored in a repository of information, each topic i) describing information included by a document of a corpus of documents and ii) distinguishing the document from the remaining documents of the corpus of documents, the plurality of topics including the one or more topics; and
processing each document in the corpus of documents, the processing including;
identifying one or more identities of the plurality of identities that occur within the document,identify one or more topics of the plurality of topics that occur within the document,for each identity that occurs within the document, determining, using one or more processors, an identity score for the identity with respect to the document, the identity score indicating a degree of relevance between the associated identity and the document,for each topic that occurs within the document, determining a topic score for the topic with respect to the document, the topic score indicating a degree of relevance between the associated topic and the document,identifying one or more combinations of i) the one or more identities that occur within the document, and ii) the one or more topics that occur within the document,for each identified combination, determining an aggregate score for the document based on the identity score associated with the combination for the document and the topic score associated with the combination for the document; and
aggregating, for each identified combination, the aggregate score of each document of the corpus of documents for the identified combination to define a composite score of the identified combination across the corpus of documents.
2 Assignments
0 Petitions
Accused Products
Abstract
Methods, systems and apparatus, including computer program products, for ranking expertise. In some implementations a method is provided that includes identifying a plurality of identities, and identifying a plurality of topics using one or more documents in a repository. For a document in a corpus of documents identifying one or more occurrences of any identity in the plurality of identities and one or more occurrences of any topic in the plurality of topics, determining an association between the identities occurring in the document and the document including deriving an identity score for each unique identity occurring in the document, determining an association between the topics occurring in the document and the document including deriving a topic score for each unique topic occurring in the document, and using the determined associations to derive a score of the document with respect to identities and topics occurring in the document.
34 Citations
42 Claims
-
1. A computer-implemented method comprising:
-
identifying a plurality of identities stored in a repository of identities, each identity corresponding to an expert of one or more topics; identifying a plurality of topics stored in a repository of information, each topic i) describing information included by a document of a corpus of documents and ii) distinguishing the document from the remaining documents of the corpus of documents, the plurality of topics including the one or more topics; and processing each document in the corpus of documents, the processing including; identifying one or more identities of the plurality of identities that occur within the document, identify one or more topics of the plurality of topics that occur within the document, for each identity that occurs within the document, determining, using one or more processors, an identity score for the identity with respect to the document, the identity score indicating a degree of relevance between the associated identity and the document, for each topic that occurs within the document, determining a topic score for the topic with respect to the document, the topic score indicating a degree of relevance between the associated topic and the document, identifying one or more combinations of i) the one or more identities that occur within the document, and ii) the one or more topics that occur within the document, for each identified combination, determining an aggregate score for the document based on the identity score associated with the combination for the document and the topic score associated with the combination for the document; and aggregating, for each identified combination, the aggregate score of each document of the corpus of documents for the identified combination to define a composite score of the identified combination across the corpus of documents. - View Dependent Claims (2, 3, 4, 5, 6, 7, 40)
-
-
8. A method comprising:
-
receiving a search query; determining, using one or more processors, that the search query matches an identity among a plurality of identities, each identity of the plurality of identities corresponding to an expert of one or more experts; identifying multiple topics associated with the identity based on multiple documents in which the identity is referred, each topic of the multiple topics i) describing information included by a document of the multiple documents and ii) distinguishing the document from the remaining documents of the multiple documents; for each document of the multiple documents, obtaining a composite score of each topic of the document based on an aggregation of aggregate scores of the topic for the document, the aggregate score of each topic based on an identity score associated with the document and a topic score of each topic associated with the document, the topic score for each topic indicating a degree of relevance between the topic and the document, and the identity score for the identity indicating a degree of relevance between the identity and the document; ordering the multiple topics based on a respective composite score associated with each document of the multiple documents; and presenting the multiple topics based on the ordering. - View Dependent Claims (9, 10, 41)
-
-
11. A method comprising:
-
receiving a search query; determining, using one or more processors, that the search query matches a topic among a plurality of topics, each topic of the plurality of topics i) describing information included by a document of multiple documents and ii) distinguishing the document from the remaining documents of the multiple documents; identifying multiple identities associated with the topic based on multiple documents in which the topic is referred, each identity of the multiple identities corresponding to an expert of one or more experts; for each document of the multiple documents, obtaining a composite score of each identity of the document based on an aggregation of aggregate scores of the identity for the document, the aggregate score of each identity based on a topic score associated with the document and an identity score of each identity associated with the document, the identity score for each identity indicating a degree of relevance between the identity and the document, and the topic score for the topic indicating a degree of relevance between the topic and the document; ordering the multiple identities based on a respective composite score associated with each document of the multiple documents; and presenting the multiple identities based on the ordering. - View Dependent Claims (12, 13, 42)
-
-
14. A computer program product, encoded on a non-transitory computer-readable medium, operable to cause data processing apparatus to perform operations comprising:
-
identifying a plurality of identities stored in a repository of identities, each identity corresponding to an expert of one or more topics; identifying a plurality of topics stored in a repository of information, each topic i) describing information included by a document of a corpus of documents and ii) distinguishing the document from the remaining documents of the corpus of documents, the plurality of topics including the one or more topics; and processing each document in the corpus of documents, the processing including; identifying one or more identities of the plurality of identities that occur within the document, identifying one or more topics of the plurality of topics that occur within the document, for each identity that occurs within the document, determining an identity score for the identity with respect to the document, the identity score indicating a degree of relevance between the associated identity and the document, for each topic that occurs within the document, determining a topic score for the topic with respect to the document, the topic score indicating a degree of relevance between the associated topic and the document, identifying one or more combinations of i) the one or more identities that occur within the document, and ii) the one or more topics that occur within the document, for each identified combination, determining an aggregate score for the document based on the identity score associated with the combination for the document and the topic score associated with the combination for the document; and aggregating, for each identified combination, the aggregate score of each document of the corpus of documents for the identified combination to define a composite score of the identified combination across the corpus of documents. - View Dependent Claims (15, 16, 17, 18, 19, 20)
-
-
21. A computer program product, encoded on a non-transitory computer-readable medium, operable to cause data processing apparatus to perform operations comprising:
-
receiving a search query; determining that the search query matches an identity among a plurality of identities, each identity of the plurality of identities corresponding to an expert of one or more experts; identifying multiple topics associated with the identity based on multiple documents in which the identity is referred, each topic of the multiple topics i) describing information included by a document of the multiple documents and ii) distinguishing the document from the remaining documents of the multiple documents; for each document of the multiple documents, obtaining a composite score of each topic of the document based on an aggregation of aggregate scores of the topic for the document, the aggregate score of each topic based on an identity score associated with the document and a topic score of each topic associated with the document, the topic score for each topic indicating a degree of relevance between the topic and the document, and the identity score for the identity indicating a degree of relevance between the identity and the document; ordering the multiple topics based on a respective composite score associated with each document of the multiple documents; and presenting the multiple topics based on the ordering. - View Dependent Claims (22, 23)
-
-
24. A computer program product, encoded on a non-transitory computer-readable medium, operable to cause data processing apparatus to perform operations comprising:
-
receiving a search query; determining that the search query matches a topic among a plurality of topics, each topic of the plurality of topics i) describing information included by a document of multiple documents and ii) distinguishing the document from the remaining documents of the multiple documents; identifying multiple identities associated with the topic based on multiple documents in which the topic is referred, each identity of the multiple identities corresponding to an expert of one or more experts; for each document of the multiple documents, obtaining a composite score of each identity of the document based on an aggregation of aggregate scores of the identity for the document, the aggregate score of each identity based on a topic score associated with the document and an identity score of each identity associated with the document, the identity score for each identity indicating a degree of relevance between the identity and the document, and the topic score for the topic indicating a degree of relevance between the topic and the document; ordering the multiple identities based on a respective composite score associated with each document of the multiple documents; and presenting the multiple identities based on the ordering. - View Dependent Claims (25, 26, 28, 29, 30, 31, 32, 33)
-
-
27. A system comprising:
one or more processors configured to perform operations including; identifying a plurality of identities stored in a repository of identities, each identity corresponding to an expert of one or more topics; identifying a plurality of topics stored in a repository of information, each topic i) describing information included b a document of a corpus of documents and ii) distinguishing the document from the remaining documents of the corpus of documents, the plurality of topics including the one or more topics; and processing each document in the corpus of documents, the processing including; means for identifying one or more identities of the plurality of identities that occur within the document, means for identifying one or more topics of the plurality of topics that occur within the document, for each identity occurring within the document, means for determining an identity score for the identity with respect to the document, the identity score indicating a degree of relevance between the associated identity and the document, for each topic occurring within the document, means for determining a topic score for the topic with respect to the document, the topic score indicating a degree of relevance between the associated topic and the document, means for identifying one or more combinations of i) the one or more identities that occur within the document, and ii) the one or more topics that occur within the document, for each identified combination, means for determining an aggregate score for the document based on the identity score associated with the combination for the document and the topic score associated with the combination for the document; and means for aggregating, for each identified combination, the aggregate score of each document of the corpus of documents for the identified combination to define a composite score of the identified combination across the corpus of documents.
-
34. A system comprising:
-
one or more processors configured to perform operations including; receiving a search query; determining that the search query matches an identity among a plurality of identities, each identity of the plurality of identities corresponding to an expert of one or more experts; identifying multiple topics associated with the identity based on multiple documents in which the identity is referred, each topic of the multiple topics i) describing information included by a document of the multiple documents and ii) distinguishing the document from the remaining documents of the multiple documents; for each document of the multiple documents, obtaining a composite score of each topic of the document based on an aggregation of aggregate scores of the topic for the document, the aggregate score of each topic based on an identity score associated with the document and a topic score of each topic associated with the document, the topic score for each topic indicating a degree of relevance between the topic and the document, and the identity score for the identity indicating a degree of relevance between the identity and the document; ordering the multiple topics based on a respective composite score associated with each document of the multiple documents; and presenting the multiple topics based on the ordering. - View Dependent Claims (35, 36)
-
-
37. A system comprising:
-
one or more processors configured to perform operations including; receiving a search query; determining that the search query matches a topic among a plurality of topics, each topic of the plurality of topics i) describing information included by a document of multiple documents and ii) distinguishing the document from the remaining documents of the multiple documents; identifying multiple identities associated with the topic based on multiple documents in which the topic is referred, each identity of the multiple identities corresponding to an expert of one or more experts; for each document of the multiple documents, obtaining a composite score of each identity of the document based on an aggregation of aggregate scores of the identity for the document, the aggregate score of each identity based on a topic score associated with the document and an identity score of each identity associated with the document, the identity score for each identity indicating a degree of relevance between the identity and the document, and the topic score for the topic indicating a degree of relevance between the topic and the document; ordering the multiple identities based on a respective composite score associated with each document of the multiple documents; and presenting the multiple identities based on the ordering. - View Dependent Claims (38, 39)
-
Specification