Disambiguation of online social mentions
First Claim
Patent Images
1. A method of disambiguating online social mentions of a real-world entity, the method including:
- selecting one or more core entity attributes that represent a real-world entity as a first search attribute set for use in searching biographical sources;
receiving, responsive to searching the biographical sources based on the first search attribute set, entity reflections that include supplemental entity attributes for the real-world entity;
using a combination of the selected core entity attributes and one or more supplemental entity attributes to define a second search attribute set that is narrower than the first search attribute set;
receiving, responsive to searching further web sources based on the second search attribute set, more entity reflections that include meta entity attributes for the real-world entity; and
updating the combination to include one or more of the meta entity attributes.
1 Assignment
0 Petitions
Accused Products
Abstract
The technology disclosed relates to identifying entity reflections that refer to a same real-world entity. In particular, it relates to using statistical functions to make probabilistic deductions about entity attributes, which are used to construct optimal combinations of entity attributes. These optimal combinations of entity attributes are further used to generate search queries that return more precise search results with greater recall.
-
Citations
20 Claims
-
1. A method of disambiguating online social mentions of a real-world entity, the method including:
-
selecting one or more core entity attributes that represent a real-world entity as a first search attribute set for use in searching biographical sources; receiving, responsive to searching the biographical sources based on the first search attribute set, entity reflections that include supplemental entity attributes for the real-world entity; using a combination of the selected core entity attributes and one or more supplemental entity attributes to define a second search attribute set that is narrower than the first search attribute set; receiving, responsive to searching further web sources based on the second search attribute set, more entity reflections that include meta entity attributes for the real-world entity; and updating the combination to include one or more of the meta entity attributes. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8)
-
-
9. A method for creating a unique online social mention of a real-world entity from a plurality of online social mentions, the method including:
-
selecting one or more core entity attributes that represent a real-world entity as a first search attribute set for searching biographical sources; electronically receiving, responsive to searching the biographical sources based on the first search attribute set, entity reflections that include supplemental entity attributes for the real-world entity; combining supplemental attributes having attributes scores above a predefined threshold with core attributes to define a second search attribute set; electronically receiving, responsive to searching further web sources based on the second search attribute set, more entity reflections that include meta entity attributes for the real-world entity; and updating the combination to include one or more of the meta entity attributes having attribute scores above the predefined threshold. - View Dependent Claims (10, 11)
-
-
12. A system, including:
one or more processors coupled to memory, the memory loaded with computer instructions that, when executed on the processors, implement actions including; selecting one or more core entity attributes that represent a real-world entity as a first search attribute set for use in searching biographical sources; receiving, responsive to searching the biographical sources based on the first search attribute set, entity reflections that include supplemental entity attributes for the real-world entity; using a combination of the selected core entity attributes and one or more supplemental entity attributes to define a second search attribute set that is narrower than the first search attribute set; receiving, responsive to searching further web sources based on the second search attribute set, more entity reflections that include meta entity attributes for the real-world entity; and updating the combination to include one or more of the meta entity attributes. - View Dependent Claims (13, 14, 15, 16, 17)
-
18. A system, including:
one or more processors coupled to memory, the memory loaded with computer instructions that, when executed on the processors, implement actions including; selecting one or more core entity attributes that represent a real-world entity as a first search attribute set for searching biographical sources; electronically receiving, responsive to searching the biographical sources based on the first search attribute set, entity reflections that include supplemental entity attributes for the real-world entity; combining supplemental attributes having attributes scores above a predefined threshold with core attributes to define a second search attribute set; electronically receiving, responsive to searching further web sources based on the second search attribute set, more entity reflections that include meta entity attributes for the real-world entity; and updating the combination to include one or more of the meta entity attributes having attribute scores above the predefined threshold.
-
19. A non-transitory computer readable medium storing a plurality of instructions for programming one or more processors to identify entity mentions referencing a same real-world entity, the instructions, when executed on the processors, implementing actions including:
-
selecting one or more core entity attributes that represent a real-world entity as a first search attribute set for use in searching biographical sources, including in the selection applying one or more probability distribution functions or joint probability distribution functions to estimate resulting cohort size; generating one or more searches for processing by a plurality of biographical sources using the first search attribute set; electronically receiving, responsive to the first search attribute set, entity reflections that include supplemental entity attributes for the real-world entity; selecting one or more extended entity attributes as a second search attribute set for use in searching web sources, including applying one or more further probability distribution functions or joint probability distribution functions to estimate resulting cohort size; generating one or more further web searches using the second search attribute set; electronically receiving, responsive to the second search attribute set, more entity reflections that include meta entity attributes for the real-world entity; and updating the anchor entity candidate to include one or more of the meta entity attributes.
-
-
20. A non-transitory computer readable medium storing a plurality of instructions for programming one or more processors to connect entity reflections to real-world entities in an ambiguous environment, the instructions, when executed on the processors, implementing actions including:
-
selecting one or more core entity attributes that represent a real-world entity as a first search attribute set for use in searching biographical sources, including in the selection applying one or more probability distribution functions or joint probability distribution functions to estimate resulting cohort size; generating one or more searches for processing by a plurality of biographical sources using the first search attribute set; electronically receiving, responsive to the first search attribute set, entity reflections that include supplemental entity attributes for the real-world entity; calculating attribute scores for supplemental attributes using a probability contribution function, wherein the attribute scores specify a quantitative assessment of similarity between the supplemental attributes and the core attributes; merging supplemental attributes with attributes scores above a predefined threshold with core attributes in an anchor entity candidate data object with extended entity attributes that represent the real-world entity; selecting one or more extended entity attributes as a second search attribute set for use in searching web sources, including applying one or more further probability distribution functions or joint probability distribution functions to estimate resulting cohort size; electronically receiving, responsive to the second search attribute set, more entity reflections that include meta entity attributes for the real-world entity; calculating attribute scores for meta entity attributes using a probability contribution function, wherein the attribute scores specify a quantitative assessment of similarity between the meta entity attributes and the extended entity attributes; and updating the anchor entity candidate to include one or more of the meta entity attributes with attribute scores above the predefined threshold.
-
Specification