Automated assistance for generating relevant and valuable search results for an entity of interest
First Claim
1. A system for identifying relevant information for an entity comprising:
- one or more processors; and
memory storing instructions that, when executed by the one or more processors, cause the system to;
generate a plurality of search queries comprising a seed entity and a plurality of entities associated with the seed entity, the plurality of entities including at least one first entity and at least one second entity, the at least one first entity being associated with the seed entity based on a hard link between the at least one first entity and the seed entity, the at least one second entity being associated with the seed entity based on a soft link between the at least one second entity and the seed entity, the soft link being generated based on one or more prior search queries;
conduct searches, with the search queries, in one or more data sources to obtain a plurality of search results, wherein each search result comprises a hit cluster, each hit cluster including properties of a hit entity and properties of one or more entities associated with the hit entity; and
determine a score for each of the hit clusters, taking as input (a) likelihood of match between the seed entity and the hit entity or between an entity associated with the seed entity and an entity associated with the hit entity, (b) presence of a new entity in the search result not present in the search queries and a difference between the new entity and an entity present in the search queries, and (c) characteristic of the new entity in the search result.
10 Assignments
0 Petitions
Accused Products
Abstract
Systems and methods are provided for identifying relevant information for an entity, referred to as a seed entity. A plurality of search queries can be generated each comprising a property of a seed entity or one of the entities associated with the seed entity (seed-linked entities). Preferably, a collection of search queries includes ones representing different properties of the seed entity and properties of different seed-linked entities. Optionally, the collection of search queries is optimized to reduce search burden. Searches can then be conducted with the search queries in one or more data sources to obtain a plurality of search results, wherein each search result comprises a hit entity and one or more entities associated with the hit entity (hit-linked entity). For each of the search results, a score can be determined taking as input (a) likelihood of match between the seed entity and the hit entity or between a seed-linked entity and a hit-linked entity, (b) presence of a new entity in the search result not present in the search queries or a difference between the new entity and an entity present in the search queries, and (c) characteristic of the new entity in the search result. Based on the scores, high priority search results can be presented a user for further analysis.
-
Citations
20 Claims
-
1. A system for identifying relevant information for an entity comprising:
-
one or more processors; and memory storing instructions that, when executed by the one or more processors, cause the system to; generate a plurality of search queries comprising a seed entity and a plurality of entities associated with the seed entity, the plurality of entities including at least one first entity and at least one second entity, the at least one first entity being associated with the seed entity based on a hard link between the at least one first entity and the seed entity, the at least one second entity being associated with the seed entity based on a soft link between the at least one second entity and the seed entity, the soft link being generated based on one or more prior search queries; conduct searches, with the search queries, in one or more data sources to obtain a plurality of search results, wherein each search result comprises a hit cluster, each hit cluster including properties of a hit entity and properties of one or more entities associated with the hit entity; and determine a score for each of the hit clusters, taking as input (a) likelihood of match between the seed entity and the hit entity or between an entity associated with the seed entity and an entity associated with the hit entity, (b) presence of a new entity in the search result not present in the search queries and a difference between the new entity and an entity present in the search queries, and (c) characteristic of the new entity in the search result. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 12)
-
-
11. A computer-implemented method comprising:
-
generating, on a suitably programmed computing device, a plurality of search queries comprising a seed entity and a plurality of entities associated with the seed entity, the plurality of entities including at least one first entity and at least one second entity, the at least one first entity being associated with the seed entity based on a hard link between the at least one first entity and the seed entity, the at least one second entity being associated with the seed entity based on a soft link between the at least one second entity and the seed entity, the soft link being generated based on one or more prior search queries; conducting searches, with the search queries, in one or more data sources to obtain a plurality of search results, wherein each search result comprises a hit cluster, each hit cluster including properties of a hit entity and properties of one or more entities associated with the hit entity; and determining a score for each of the hit clusters, taking as input (a) likelihood of match between the seed entity and the hit entity or between an entity associated with the seed entity and an entity associated with the hit entity, (b) presence of a new entity in the search result not present in the search queries and a difference between the new entity and an entity present in the search queries, and (c) characteristic of the new entity in the search result. - View Dependent Claims (13, 14, 15, 16, 17, 18, 19)
-
-
20. A non-transitory computer readable medium comprising instructions that, when executed, cause one or more processors to perform:
-
generating, on a suitably programmed computing device, a plurality of search queries comprising a seed entity and a plurality of entities associated with the seed entity, the plurality of entities including at least one first entity and at least one second entity, the at least one first entity being associated with the seed entity based on a hard link between the at least one first entity and the seed entity, the at least one second entity being associated with the seed entity based on a soft link between the at least one second entity and the seed entity, the soft link being generated based on one or more prior search queries; conducting searches, with the search queries, in one or more data sources to obtain a plurality of search results, wherein each search result comprises a hit cluster, each hit cluster including properties of a hit entity and properties of one or more entities associated with the hit entity; and determining a score for each of the hit clusters, taking as input (a) likelihood of match between the seed entity and the hit entity or between an entity associated with the seed entity and an entity associated with the hit entity, (b) presence of a new entity in the search result not present in the search queries and a difference between the new entity and an entity present in the search queries, and (c) characteristic of the new entity in the search result.
-
Specification