System and method for contextual and free format matching of addresses
First Claim
1. A method for matching a first address and a second address using a processor, the first address and the second address each associated with one or more consumers, the method comprising:
- receiving the first address and the second address at the processor;
contextually identifying a first address part of the first address and a second address part of the second address, using the processor, wherein the first address part and the second address part each have an address part type that is alike, and wherein contextually identifying comprises;
deterministically evaluating a first string in the first address to identify the first address part and a second string in the second address to identify the second address part, using the processor; and
extracting first data from the first address and second data from the second address using the processor, based on the address part type of the first address part and the second address part;
normalizing, using the processor, the first address part to produce a first normalized address part and the second address part to produce a second normalized address part, based on a normalization rule;
comparing the first normalized address part and the second normalized address part, using the processor;
calculating a contextual matching score, based on comparing the first normalized address part and the second normalized address part, using the processor;
performing a free format token analysis of the first address and the second address, using the processor;
calculating a free format matching score, based on performing the free format token analysis of the first address and the second address, using the processor;
calculating an address likeness score, based on the contextual matching score and the free format matching score, using the processor; and
transmitting the address likeness score from the processor.
3 Assignments
0 Petitions
Accused Products
Abstract
A system and method for the matching addresses is provided. Addresses may be received from a search engine or other source for purposes of matching. Address parts in the addresses may be contextually identified. Identified address parts, including their associated data, that have address part types that are alike may be compared to one another and a contextual matching score may be calculated and assigned. A free format token analysis of the addresses may also be performed in parallel with, before, or after, the contextual identification, and a free format matching score may be calculated. An address likeness score may be calculated and assigned based on the contextual matching score and the free format matching score.
-
Citations
18 Claims
-
1. A method for matching a first address and a second address using a processor, the first address and the second address each associated with one or more consumers, the method comprising:
-
receiving the first address and the second address at the processor; contextually identifying a first address part of the first address and a second address part of the second address, using the processor, wherein the first address part and the second address part each have an address part type that is alike, and wherein contextually identifying comprises; deterministically evaluating a first string in the first address to identify the first address part and a second string in the second address to identify the second address part, using the processor; and extracting first data from the first address and second data from the second address using the processor, based on the address part type of the first address part and the second address part; normalizing, using the processor, the first address part to produce a first normalized address part and the second address part to produce a second normalized address part, based on a normalization rule; comparing the first normalized address part and the second normalized address part, using the processor; calculating a contextual matching score, based on comparing the first normalized address part and the second normalized address part, using the processor; performing a free format token analysis of the first address and the second address, using the processor; calculating a free format matching score, based on performing the free format token analysis of the first address and the second address, using the processor; calculating an address likeness score, based on the contextual matching score and the free format matching score, using the processor; and transmitting the address likeness score from the processor. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11)
-
-
12. A method for matching an address with a plurality of candidate addresses, using a processor, the address and the plurality of candidate addresses associated with one or more consumers, the method comprising:
-
receiving the address and the plurality of candidate addresses at the processor; identifying an address part of the address and a plurality of candidate address parts of each of the plurality of candidate addresses, based on a contextual identification analysis of the address and the plurality of candidate addresses, using the processor, wherein the address part and the plurality of candidate address parts each have an address part type that is alike, and wherein identifying comprises; deterministically evaluating a first string in the address to identify the address part and a plurality of strings in each of the plurality of candidate addresses to identify the plurality of candidate address parts, using the processor; and extracting the address data from the address and the plurality of candidate address data from the plurality of candidate addresses using the processor, based on the address part type; comparing address data with a plurality of candidate address data, using the processor, wherein the address data and the plurality of candidate address data is associated with the address part type; calculating a contextual matching score, based on comparing the address data with the plurality of candidate address data, using the processor; performing a free format token analysis of the address and the plurality of candidate addresses, using the processor; calculating a free format matching score, based on performing the free format token analysis, using the processor; calculating an address likeness score, based on the contextual matching score and the free format matching score, using the processor; and transmitting one or more matching addresses from the plurality of candidate addresses from the processor, based on the address likeness score. - View Dependent Claims (13, 14, 15, 16, 17, 18)
-
Specification