METHODOLOGIES AND ANALYTICS TOOLS FOR LOCATING EXPERTS WITH SPECIFIC SETS OF EXPERTISE
First Claim
1. A method for use with a collection of documents P0, the method comprising:
- generating categories representing fields of expertise derived from the collection of documents P0;
extracting structured fields from the collection of documents P0;
constructing a contingency table having a first axis defined by the extracted structured fields and a second axis defined by the categories; and
using the contingency table to identify a set of experts having a related expertise.
0 Assignments
0 Petitions
Accused Products
Abstract
A method and analytics tools for locating experts with specific sets of expertise are disclosed, the method including providing a collection of documents P0; generating categories representing fields of expertise derived from the collection of documents P0; refining the taxonomy of the categories by applying user domain knowledge; extracting structured fields from the collection of documents P0; constructing a contingency table having a first axis defined by the extracted structured fields and a second axis defined by the categories; and using the contingency table to identify a set of experts having a related expertise. The method may also include a network graph analysis that aids visualization of the relationship between people and expertise.
32 Citations
20 Claims
-
1. A method for use with a collection of documents P0, the method comprising:
-
generating categories representing fields of expertise derived from the collection of documents P0; extracting structured fields from the collection of documents P0; constructing a contingency table having a first axis defined by the extracted structured fields and a second axis defined by the categories; and using the contingency table to identify a set of experts having a related expertise. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
-
8. A method for use with a set of seed documents P0 extracted from a data warehouse, the method comprising:
-
searching the data warehouse to provide a set of additional documents P1 similar to documents of P0, wherein said similarity is determined using a statistical method; generating an initial taxonomy for a combined document set P0+P1 that includes all documents from both the set of seed documents P0 and the set of additional documents P1; and iterating the processes of extracting, searching, and generating using domain knowledge to produce a refined taxonomy from the initial taxonomy. - View Dependent Claims (9, 10, 11, 12, 13)
-
-
14. A computer program product comprising a computer useable medium including a computer readable program, wherein the computer readable program when executed on a computer causes the computer to:
-
find a relationship between categories of a taxonomy and a collection of names of people derived from structured fields used to classify a set of documents from a data warehouse; plot said relationship as a network graph; and perform a network graph analysis to find a set of names from said collection of names of people that is most related to a set of said categories of said taxonomy. - View Dependent Claims (15, 16, 17, 18, 19)
-
-
20. A computer program product for use with a collection of documents P0, the computer program product comprising a computer useable medium including a computer readable program, wherein the computer readable program when executed on a computer causes the computer to:
-
generate categories representing fields of expertise derived from the collection of documents P0; extract structured fields from the collection of documents P0; construct a contingency table having a first axis defined by the extracted structured fields and a second axis defined by the categories; and identify, using the contingency table, a set of experts having a related expertise.
-
Specification