Automatic identification of related entities
First Claim
Patent Images
1. A computer-implemented method comprising:
- identifying connections between an entity and at least one or more seed entities from at least one data source, the one or more seed entities received from a device of a user;
generating a set of lists based on the identified connections, each list in the set including one or more entities;
determining a background probability for each entity in the set of lists, the background probability indicating a frequency that each entity appears in the set of lists;
calculating a list score for each list in the set of lists based on a number of seed entities in each list of the set and the determined background probabilities associated with the one or more entities in each list;
identifying a subset of lists from the set of lists based on the calculated list scores;
assigning the calculated list score to the respective one or more entities in each list of the identified subset based on the background probabilities associated with the one or more entities;
ranking the one or more entities in each list of the subset based on the assigned lists scores, so as to identify entities related to the one or more seed entities; and
transmitting at least a portion of the ranked one or more entities from the subset of lists to the user'"'"'s device.
2 Assignments
0 Petitions
Accused Products
Abstract
In one aspect, the present disclosure can be embodied in a method that includes identifying a collection of entities from one or more data sources, calculating a score for subsets of entities from the collection based on one or more seed entities associated with the collection, identifying one or more entities from each of the subsets based on the calculated score, assigning the calculated score to the identified one or more entities from the respective subset, and ranking the one or more entities based on the assigned score, so as to identify entities in the collection that are related to the one or more seed entities.
-
Citations
30 Claims
-
1. A computer-implemented method comprising:
-
identifying connections between an entity and at least one or more seed entities from at least one data source, the one or more seed entities received from a device of a user; generating a set of lists based on the identified connections, each list in the set including one or more entities; determining a background probability for each entity in the set of lists, the background probability indicating a frequency that each entity appears in the set of lists; calculating a list score for each list in the set of lists based on a number of seed entities in each list of the set and the determined background probabilities associated with the one or more entities in each list; identifying a subset of lists from the set of lists based on the calculated list scores; assigning the calculated list score to the respective one or more entities in each list of the identified subset based on the background probabilities associated with the one or more entities; ranking the one or more entities in each list of the subset based on the assigned lists scores, so as to identify entities related to the one or more seed entities; and transmitting at least a portion of the ranked one or more entities from the subset of lists to the user'"'"'s device. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20)
-
-
21. A system for identifying related list entities, the system comprising:
a memory comprising; a list identifier configured to provide instructions to; identify connections between an entity and at least one or more seed entities from at least one data source, the one or more seed entities received from a device of a user, and generate a set of lists based on the identified connections, each list in the set including one or more entities; a list scorer configured to provide instructions to; determine a background probability for each entity in the set of lists, the background probability indicating a frequency that each entity appears in the set of lists, calculate a list score for each list in the set of lists based on a number of seed entities in each list of the set and the determined background probabilities associated with the one or more entities in each list, identify a subset of lists from the set of lists based on the calculated list scores, assign the calculated list score to the respective one or more entities in each list of the identified subset based on the background probabilities associated with the one or more entities, and rank the one or more entities in each list of the subset based on the assigned lists scores, so as to identify entities related to the one or more seed entities; and a list processor configured to provide instructions to transmit at least a portion of the ranked one or more entities from the subset of lists to the user'"'"'s device; and a processor configured to execute instructions provided by the list identifier, the list scorer, and the list processor. - View Dependent Claims (22, 23, 24, 25, 26, 27, 28, 29)
-
30. A machine-readable medium comprising instructions stored therein, which when executed by a system, causes the system to perform operations comprising:
-
identifying social connections for a user of a social networking service based on a social graph associated with the social networking service and one or more seed users provided via an interface of the social networking service, the interface executing at a device of the user; deriving a set of lists based on the identified social connections, each list in the set of lists corresponding to a social circle including a subset of users of the social networking service; determining background probabilities for the subset of users in each list from the set of lists, the background probabilities indicating a frequency that each of the users in the subset appears as a list member in the set of lists; calculating a list score for each list in the set of lists based on a number of the one or more seed users in the subset of users in each list and the determined background probabilities associated with the subset of users in each list; identifying a subset of lists from the set of lists based on the calculated list scores; assigning the calculated list score to the respective users in each list of the identified subset of lists based on the background probabilities associated with the corresponding subset of users; ranking the users in each of the identified subset of lists based on the assigned lists scores, so as to identify users related to the one or more seed users; and transmitting at least a portion of the ranked users from the identified subset of lists to the device of the user via the interface of the social networking service.
-
Specification