Detecting spatial outliers in a location entity dataset
First Claim
Patent Images
1. A method comprising:
- arranging, by a computing device, a plurality of location entities into a hierarchy of location descriptors, wherein individual ones of the location entities comprise global positioning system (GPS) coordinates captured by a GPS device;
storing the plurality of location entities on a location search server, the location search server being remotely located from the computing device;
determining, by the computing device, whether one location entity of the plurality of location entities is a spatial outlier based on determining whether a number of other location entities within a predetermined distance of the one location entity exceeds a threshold number, the other location entities and the one location entity sharing a location descriptor;
in response to determining that the one location entity is the spatial outlier, deleting, by the computing device, the one location entity; and
sending an update to the location search server such that the plurality of location entities on the location search server reflects the deletion of the spatial outlier.
2 Assignments
0 Petitions
Accused Products
Abstract
Disclosed herein are one or more embodiments that arrange a plurality of location entities into a hierarchy of location descriptors. One or more of the disclosed embodiments may determine whether one of the location entities is a spatial outlier based at least in part on presence of one or more other location entities within a predetermined distance of the one location entity. Also, the other location entities and the one location entity may share a location descriptor.
-
Citations
20 Claims
-
1. A method comprising:
-
arranging, by a computing device, a plurality of location entities into a hierarchy of location descriptors, wherein individual ones of the location entities comprise global positioning system (GPS) coordinates captured by a GPS device; storing the plurality of location entities on a location search server, the location search server being remotely located from the computing device; determining, by the computing device, whether one location entity of the plurality of location entities is a spatial outlier based on determining whether a number of other location entities within a predetermined distance of the one location entity exceeds a threshold number, the other location entities and the one location entity sharing a location descriptor; in response to determining that the one location entity is the spatial outlier, deleting, by the computing device, the one location entity; and sending an update to the location search server such that the plurality of location entities on the location search server reflects the deletion of the spatial outlier. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13)
-
-
14. An article of manufacture comprising:
-
a storage device; and a plurality of executable instructions stored on the storage device which, when executed by a computing device, perform operations including; arranging a plurality of location entities into a hierarchy of location descriptors, the plurality of location entities including yellow page entities and point of interest entities, wherein individual ones of the location entities comprise global positioning system (GPS) coordinates captured by a GPS device; storing the plurality of location entities on a location search server, the location search server being remotely located from the computing device; determining whether one of the yellow page entities is a spatial outlier based at least in part on presence of at least one of the point of interest entities within a predetermined distance of the one of the yellow page entities, the at least one of the point of interest entities and the one of the yellow page entities sharing a location descriptor; in response to determining that the one of the yellow page entities is the spatial outlier, deleting the one of the yellow page entities; and sending an update to the location search server such that the plurality of location entities on the location search server reflects the deletion of the spatial outlier. - View Dependent Claims (15, 16, 17, 18)
-
-
19. A system comprising:
-
a processor of a computing device; and logic configured to be executed by the processor to perform operations including; sending a plurality of location entities to a location search server for storage on the location search server, wherein individual ones of the location entities comprise global positioning system (GPS) coordinates captured by a GPS device, and wherein the location search server is remotely located from the computing device; segmenting address fields of a plurality of location entities into location descriptors, the segmenting including either or both of; segmenting based on commas and/or other characters indicating a separation between two or more terms; and segmenting based at least in part on one or more frameworks and/or dictionaries; arranging the plurality of location entities into a hierarchy of location descriptors, the arranging including; inserting a descriptor of each of the plurality of location entities derived from an address field of each of the plurality of location entities as a leaf node in a tree of location descriptors; determining that at least two leaf nodes refer to a same instance if the at least two leaf nodes share a same location descriptor and if the same location descriptor is shared by a number of descendant nodes of a same parent, the number exceeding a first threshold number; and combining the at least two leaf nodes, the combining including retaining one of the at least two leaf nodes at a lowest level in the hierarchy in which a number of occurrences of the at least two leaf nodes exceeds a second threshold number; determining whether one of the plurality of location entities is a spatial outlier based at least in part on determining whether a number of one or more other location entities within a predetermined distance of the one of the plurality of location entities exceeds a third threshold number, the one or more other location entities and the one of the plurality of location entities sharing a location descriptor; in response to determining that the one of the plurality of location entities is the spatial outlier, deleting the one of the plurality of location entities; and sending an update to the location search server such that the plurality of location entities on the location search server reflects the deletion of the spatial outlier. - View Dependent Claims (20)
-
Specification