Method and system for mining information based on relationships
First Claim
1. A computer-implemented method for a computer to identify authors that have a relationship, the method comprising:
- providing an indication of authors of papers;
identifying clusters of authors based on co-author relationships between the authors wherein authors within a same cluster have a relationship;
storing in a memory of the computer an indication of the identified clusters of authors; and
calculating an importance of authors within an author clusterwherein the importance of an author is defined recursively based on the importance of the papers authored by the author andwherein the importance of an author and a paper is defined as follows;
Rauthor=W diag(RpaperNTpaper)
Rpaper=WT diag(RauthorNTauthor)where Rauthor is a vector of the importance (or ranking) of the authors, Rpaper is a vector of the importance (or ranking) of the papers, W is an adjacency matrix mapping authors to papers, WT is a transpose of the adjacency matrix W, NTpaper is a matrix of normalization terms for the papers, and NTauthor is a matrix of normalization terms for the authors.
2 Assignments
0 Petitions
Accused Products
Abstract
A method and system for identifying information about people is provided. The information system identifies groups of people that have relationships based on their relationships to documents or more generally to objects. The information system initially is provided with an indication of which people have which relationships to which documents. The information system then identifies clusters of people based on having a relationship to the same objects. The information system may also identify clusters of related objects associated with a cluster of people. When a user wants to identify information about a person, the user can provide the name of that person to the information system. The information system then can retrieve and display the names of the other people who are in the same cluster as the person.
42 Citations
20 Claims
-
1. A computer-implemented method for a computer to identify authors that have a relationship, the method comprising:
-
providing an indication of authors of papers; identifying clusters of authors based on co-author relationships between the authors wherein authors within a same cluster have a relationship; storing in a memory of the computer an indication of the identified clusters of authors; and calculating an importance of authors within an author cluster wherein the importance of an author is defined recursively based on the importance of the papers authored by the author and wherein the importance of an author and a paper is defined as follows;
Rauthor=W diag(RpaperNTpaper)
Rpaper=WT diag(RauthorNTauthor)where Rauthor is a vector of the importance (or ranking) of the authors, Rpaper is a vector of the importance (or ranking) of the papers, W is an adjacency matrix mapping authors to papers, WT is a transpose of the adjacency matrix W, NTpaper is a matrix of normalization terms for the papers, and NTauthor is a matrix of normalization terms for the authors. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10)
-
-
11. A computer-readable storage medium containing instructions for controlling a computer system to provide information about authors that have a relationship, by a method comprising:
-
accessing indications of authors of papers to identify clusters of authors based on co-author relationships between the authors wherein authors within a same cluster have a co-author relationship; storing an indication of the identified clusters of authors; and calculating an importance of authors within an author cluster wherein the importance of an author is defined recursively based on the importance of the papers authored by the author and wherein the importance of an author and a paper is defined as follows;
Rauthor=W diag(RpaperNTpaper)
Rpaper=WT diag(RauthorNTauthor)where Rauthor is a vector of the importance of the authors, Rpaper is a vector of the importance of the papers, W is an adjacency matrix mapping authors to papers, WT is a transpose of the adjacency matrix W, NTpaper is a matrix of normalization terms for the papers, and NTauthor is a matrix of normalization terms for the authors. - View Dependent Claims (12, 13, 14, 15, 16, 17, 18, 19, 20)
-
Specification