Detecting Spatial Outliers in a Location Entity Dataset
First Claim
Patent Images
1. A method comprising:
- arranging, by a computing device, a plurality of location entities into a hierarchy of location descriptors; and
determining, by the computing device, whether one of the location entities is a spatial outlier based at least in part on presence of one or more other location entities within a predetermined distance of the one location entity, the other location entities and the one location entity sharing a location descriptor.
2 Assignments
0 Petitions
Accused Products
Abstract
Disclosed herein are one or more embodiments that arrange a plurality of location entities into a hierarchy of location descriptors. One or more of the disclosed embodiments may determine whether one of the location entities is a spatial outlier based at least in part on presence of one or more other location entities within a predetermined distance of the one location entity. Also, the other location entities and the one location entity may share a location descriptor.
109 Citations
20 Claims
-
1. A method comprising:
-
arranging, by a computing device, a plurality of location entities into a hierarchy of location descriptors; and determining, by the computing device, whether one of the location entities is a spatial outlier based at least in part on presence of one or more other location entities within a predetermined distance of the one location entity, the other location entities and the one location entity sharing a location descriptor. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14)
-
-
15. An article of manufacture comprising:
-
a storage medium; and a plurality of executable instructions stored on the storage medium which, when executed by a computing device, perform operations including; arranging a plurality of location entities into a hierarchy of location descriptors, the location entities including yellow page entities and point of interest entities; and determining whether one of the yellow page entities is a spatial outlier based at least in part on presence of at least one of the point of interest entities within a predetermined distance of the one yellow page entity, the one point of interest entity and the one yellow page entity sharing a location descriptor. - View Dependent Claims (16, 17, 18)
-
-
19. A system comprising:
-
a processor; and logic configured to be executed by the processor to perform operations including; segmenting address fields of a plurality of location entities into location descriptors, the segmenting including either or both of; segmenting based on commas and/or other characters indicating a separation between two or more terms; and segmenting based at least in part on one or more frameworks and/or dictionaries; arranging the location entities into a hierarchy of location descriptors, the arranging including; inserting a descriptor of each location entity derived from an address field of each location entity as a leaf node in a tree of location descriptors; determining that at least two leaf nodes refer to a same instance if the nodes share the same descriptor and if the same descriptor is shared by a number of descendant nodes of a same parent, the number exceeding a first threshold; and combining the at least two leaf nodes, the combining including retaining one of the leaf node at a lowest level in the hierarchy in which a number of occurrences of the at least two leaf nodes exceeds a second threshold; determining whether one of the location entities is a spatial outlier based at least in part on presence of one or more other location entities within a predetermined distance of the one location entity, the other location entities and the one location entity sharing a location descriptor; and in response to determining that the one location entity is a spatial outlier, deleting the one location entity. - View Dependent Claims (20)
-
Specification