Managing information about entities using clusters of received observations
First Claim
Patent Images
1. A method for managing information about entities, the method comprising:
- receiving an observation, by one or more computers, the observation including an updated piece of information about an entity having a geographic location and a context, wherein the context includes at least one value of an attribute describing the entity to which the updated piece of information relates;
storing, by the one or more computers, the received observation as an immutable observation that includes the updated piece of information about the entity having the geographic location and the context, wherein the immutable observation is not modifiable after storage of the immutable observation;
matching, by the one or more computers, the immutable observation with a first cluster of observations representing the entity using the context;
the matching comprising;
generating a query derived from one or more values of attributes included in the context;
identifying, by the one or more computers, one or more candidate clusters of observations responsive to the generated query;
generating, by the one or more computers, a respective score for the one or more candidate clusters, wherein the respective score is based on a comparison of one or more attribute values from the context and a corresponding one or more attribute values of the one or more candidate clusters; and
matching, by the one or more computers, the immutable observation with a select candidate cluster having the highest respective score; and
associating, by the one or more computers, the immutable observation with the first cluster of observations.
2 Assignments
0 Petitions
Accused Products
Abstract
Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for managing entities using observations. In one aspect, a method includes receiving an observation, the observation including an updated piece of information about an entity and a context, wherein the context includes at least one value of an attribute describing the entity to which the updated piece of information relates; matching the received observation with a first cluster of observations representing the entity using the context; and associating the received observation with the first cluster of observations.
-
Citations
30 Claims
-
1. A method for managing information about entities, the method comprising:
-
receiving an observation, by one or more computers, the observation including an updated piece of information about an entity having a geographic location and a context, wherein the context includes at least one value of an attribute describing the entity to which the updated piece of information relates; storing, by the one or more computers, the received observation as an immutable observation that includes the updated piece of information about the entity having the geographic location and the context, wherein the immutable observation is not modifiable after storage of the immutable observation; matching, by the one or more computers, the immutable observation with a first cluster of observations representing the entity using the context;
the matching comprising;generating a query derived from one or more values of attributes included in the context; identifying, by the one or more computers, one or more candidate clusters of observations responsive to the generated query; generating, by the one or more computers, a respective score for the one or more candidate clusters, wherein the respective score is based on a comparison of one or more attribute values from the context and a corresponding one or more attribute values of the one or more candidate clusters; and matching, by the one or more computers, the immutable observation with a select candidate cluster having the highest respective score; and associating, by the one or more computers, the immutable observation with the first cluster of observations. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9)
-
-
10. A method performed by a system of one or more computers, the method comprising:
-
receiving a plurality of observations, each observation including an updated piece of information about a respective entity and a context, wherein the context includes at least one value of an attribute describing the respective entity to which the updated piece of information relates, the plurality of observations including a first observation and a second observation; storing the plurality of received observations as a plurality of immutable observations that includes the updated piece of information about the entity having the geographic location and the context, wherein each immutable observation is not modifiable after storage of the immutable observation; clustering each of the immutable observations based on their contexts into a first cluster of observations, wherein the first cluster is regarded as representing a first entity; receiving additional observations, each additional observation including an updated piece of information about a respective entity and a context, wherein the context includes at least one value of an attribute describing the respective entity to which the updated piece of information relates; storing the plurality of received additional observations as a plurality of immutable additional observations that includes the updated piece of information about the entity having the geographic location and the context, wherein each immutable additional observation is not modifiable after storage of the immutable additional observation; generating a first score for the first cluster, wherein the first score is based on a comparison of the at least one attribute value from the context of the immutable additional observations and a corresponding at least one attribute value of the first cluster; generating a second score for the second cluster, wherein the second score is based on a comparison of the at least one attribute value from the context of the immutable additional observations and a corresponding at least one attribute value of the second cluster; matching the immutable additional observations with the first cluster if the first score is higher than the second score; and matching the immutable additional observations with the second cluster if the second score is higher than the first score.
-
-
11. A computer storage medium encoded with a computer program, the program comprising instructions that when executed by one or more computers cause the one or more computers to perform operations comprising:
-
receiving an observation, the observation including an updated piece of information about an entity and a context, wherein the context includes at least one value of an attribute describing the entity to which the updated piece of information relates; storing the received observation as an immutable observation that includes the updated piece of information about the entity having the geographic location and the context, wherein the immutable observation is not modifiable after storage of the immutable observation; matching the immutable observation with a first cluster of observations representing the entity using the context;
the matching comprising;generating a query derived from one or more values of attributes included in the context; identifying one or more candidate clusters of observations responsive to the generated query; generating a respective score for the one or more candidate clusters, wherein the respective score is based on a comparison of one or more attribute values from the context and a corresponding one or more attribute values of the one or more candidate clusters; and matching the immutable observation with a select candidate cluster having the highest respective score; and associating the immutable observation with the first cluster of observations. - View Dependent Claims (12, 13, 14, 15, 16, 17, 18, 19)
-
-
20. A computer storage medium encoded with a computer program, the program comprising instructions that when executed by one or more computers cause the one or more computers to perform operations comprising:
-
receiving a plurality of observations, each observation including an updated piece of information about a respective entity and a context, wherein the context includes at least one value of an attribute describing the respective entity to which the updated piece of information relates, the plurality of observations including a first observation and a second observation; storing the plurality of received observations as a plurality of immutable observations that includes the updated piece of information about the entity having the geographic location and the context, wherein each immutable observation is not modifiable after storage of the immutable observation; clustering each of the immutable observations based on their contexts into a first cluster of observations, wherein the first cluster is regarded as representing a first entity; receiving additional observations, each additional observation including an updated piece of information about a respective entity and a context, wherein the context includes at least one value of an attribute describing the respective entity to which the updated piece of information relates; storing the plurality of received additional observations as a plurality of immutable additional observations that includes the updated piece of information about the entity having the geographic location and the context, wherein each immutable additional observation is not modifiable after storage of the immutable additional observation; generating a first score for the first cluster, wherein the first score is based on a comparison of the at least one attribute value from the context of the immutable additional observations and a corresponding at least one attribute value of the first cluster; generating a second score for the second cluster, wherein the second score is based on a comparison of the at least one attribute value from the context of the immutable additional observations and a corresponding at least one attribute value of the second cluster; matching the immutable observations with the first cluster if the first score is higher than the second score; and matching the received additional observations with the second cluster if the second score is higher than the first score.
-
-
21. A system comprising one or more computers and one or more storage devices storing instructions that, when executed by the one or more computers, cause the one or more computers to perform operations comprising:
-
receiving an observation, the observation including an updated piece of information about an entity and a context, wherein the context includes at least one value of an attribute describing the entity to which the updated piece of information relates; storing the received observation as an immutable observation that includes the updated piece of information about the entity having the geographic location and the context, wherein the immutable observation is not modifiable after storage of the immutable observation; matching the immutable observation with a first cluster of observations representing the entity using the context, the matching comprising; generating a query derived from one or more values of attributes included in the context; identifying one or more candidate clusters of observations responsive to the generated query; generating a respective score for the one or more candidate clusters, wherein the respective score is based on a comparison of one or more attribute values from the context and a corresponding one or more attribute values of the one or more candidate clusters; and matching the immutable observation with a select candidate cluster having the highest respective score; and associating the immutable observation with the first cluster of observations. - View Dependent Claims (22, 23, 24, 25, 26, 27, 28, 29)
-
-
30. A system comprising one or more computers and one or more storage devices storing instructions that, when executed by the one or more computers, cause the one or more computers to perform operations comprising:
-
receiving a plurality of observations, each observation including an updated piece of information about a respective entity and a context, wherein the context includes at least one value of an attribute describing the respective entity to which the updated piece of information relates, the plurality of observations including a first observation and a second observation; storing the plurality of received observations as a plurality of immutable observations that includes the updated piece of information about the entity having the geographic location and the context, wherein each immutable observation is not modifiable after storage of the immutable observation; clustering each of the immutable observations based on their contexts into a first cluster of observations, wherein the first cluster is regarded as representing a first entity; receiving additional observations, each additional observation including an updated piece of information about a respective entity and a context, wherein the context includes at least one value of an attribute describing the respective entity to which the updated piece of information relates; storing the plurality of received additional observations as a plurality of immutable additional observations that includes the updated piece of information about the entity having the geographic location and the context, wherein each immutable additional observation is not modifiable after storage of the immutable additional observation; generating a first score for the first cluster, wherein the first score is based on a comparison of the at least one attribute value from the context of the immutable additional observations and a corresponding at least one attribute value of the first cluster; generating a second score for the second cluster, wherein the second score is based on a comparison of the at least one attribute value from the context of the immutable additional observations and a corresponding at least one attribute value of the second cluster; matching the immutable additional observations with the first cluster if the first score is higher than the second score; and matching the immutable additional observations with the second cluster if the second score is higher than the first score.
-
Specification