Generating address term synonyms
First Claim
1. A method performed by one or more computers, the method comprising:
- determining that a plurality of addresses cannot be geocoded by a geocoding system, wherein each address includes a plurality of terms;
generating a plurality of variants of the addresses that can be geocoded by the geocoding system, wherein each variant of a respective address lacks a removed term included in the respective address;
receiving a plurality of name terms for each variant provided by the geocoding system;
associating each removed term with name terms received for all variants that lack the removed term, including determining, for each associated name term of each removed term, a count of the number of variants for which the geocoding system provided the name term;
determining, for each associated name term of each removed term, whether the name term is an address term synonym for the removed term based at least in part on the count of the number of variants.
2 Assignments
0 Petitions
Accused Products
Abstract
Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for generating address component synonyms. In one aspect, a method includes determining that a plurality of addresses cannot be geocoded by a geocoding system. Variants of the addresses that can be geocoded by the geocoding system are generated, wherein each variant of a respective address lacks a removed term. Name terms for each variant are provided by the geocoding system. Each removed term is associated with name terms received for all variants that lack the removed term, including determining, for each associated name term of each removed term, a count of the number of variants for which the geocoding system provided the name term. Whether a name term is an address term synonym for a removed term is determined based at least in part on the count of the number of variants.
-
Citations
30 Claims
-
1. A method performed by one or more computers, the method comprising:
-
determining that a plurality of addresses cannot be geocoded by a geocoding system, wherein each address includes a plurality of terms; generating a plurality of variants of the addresses that can be geocoded by the geocoding system, wherein each variant of a respective address lacks a removed term included in the respective address; receiving a plurality of name terms for each variant provided by the geocoding system; associating each removed term with name terms received for all variants that lack the removed term, including determining, for each associated name term of each removed term, a count of the number of variants for which the geocoding system provided the name term; determining, for each associated name term of each removed term, whether the name term is an address term synonym for the removed term based at least in part on the count of the number of variants. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10)
-
-
11. A system of one or more computers configured to perform operations comprising:
-
determining that a plurality of addresses cannot be geocoded by a geocoding system, wherein each address includes a plurality of terms; generating a plurality of variants of the addresses that can be geocoded by the geocoding system, wherein each variant of a respective address lacks a removed term included in the respective address; receiving a plurality of name terms for each variant provided by the geocoding system; associating each removed term with name terms received for all variants that lack the removed term, including determining, for each associated name term of each removed term, a count of the number of variants for which the geocoding system provided the name term; determining, for each associated name term of each removed term, whether the name term is an address term synonym for the removed term based at least in part on the count of the number of variants. - View Dependent Claims (12, 13, 14, 15, 16, 17, 18, 19, 20)
-
-
21. A non-transitory computer storage medium encoded with a computer program, the program comprising instructions that when executed by data processing apparatus cause the data processing apparatus to perform operations comprising:
-
determining that a plurality of addresses cannot be geocoded by a geocoding system, wherein each address includes a plurality of terms; generating a plurality of variants of the addresses that can be geocoded by the geocoding system, wherein each variant of a respective address lacks a removed term included in the respective address; receiving a plurality of name terms for each variant provided by the geocoding system; associating each removed term with name terms received for all variants that lack the removed term, including determining, for each associated name term of each removed term, a count of the number of variants for which the geocoding system provided the name term; determining, for each associated name term of each removed term, whether the name term is an address term synonym for the removed term based at least in part on the count of the number of variants. - View Dependent Claims (22, 23, 24, 25, 26, 27, 28, 29, 30)
-
Specification