Systems and methods for creating standardized street addresses from raw address data
First Claim
1. A computer-based method for processing raw address data into a standardized street address using a single pass of the raw address data through the process using a computer device coupled to a database, the method comprising:
- receiving, at the computer device, raw data from a first party, the raw data including a raw street address;
retrieving a country code from the raw address data;
determining a standardized street address format based on the retrieved country code;
formatting the standardized street address using the determined standardized street address format formatting the raw street address using the computer device, formatting includes inserting a space character between characters of different types;
separating the formatted street address into at least one token using the computer device, each token including one or more characters indicating a portion of the raw street address;
searching for the at least one token in a look-up table, the search limited by the country code;
replacing one of the at least one tokens with a standard token from a lookup table;
operating on the replacement standard token using one or more rules that uses a context of the replacement standard token and an order of appearance of the replacement standard token in the raw street address to correct an incorrect assignment of a street name to the standard token; and
creating a standardized street address from the at least one token based on one or more conditions using a single pass of the raw address data through the process, wherein the one or more conditions facilitate determining at least one of inclusion, exclusion, and position of the at least one token in the standardized street address, the one or more conditions are based on the at least one of the at least one token, the at least one token and an adjacent token, a token type of at least one of the at least one token and an adjacent token, and a context of at least one of the at least one token and an adjacent token.
1 Assignment
0 Petitions
Accused Products
Abstract
A method and system for processing raw address data using a computer device coupled to a database are provided. The method includes receiving, at the computer device, raw data from a first party, the raw data including a raw street address. The method includes formatting the raw street address using the computer device and separating the formatted street address into at least one token using the computer device wherein each token includes one or more characters indicating a portion of the raw street address. The method further includes creating a standardized street address from the at least one token based on one or more conditions, wherein the one or more conditions facilitate determining at least one of inclusion, exclusion, and position of the at least one token in the standardized street address.
-
Citations
8 Claims
-
1. A computer-based method for processing raw address data into a standardized street address using a single pass of the raw address data through the process using a computer device coupled to a database, the method comprising:
-
receiving, at the computer device, raw data from a first party, the raw data including a raw street address; retrieving a country code from the raw address data; determining a standardized street address format based on the retrieved country code; formatting the standardized street address using the determined standardized street address format formatting the raw street address using the computer device, formatting includes inserting a space character between characters of different types; separating the formatted street address into at least one token using the computer device, each token including one or more characters indicating a portion of the raw street address; searching for the at least one token in a look-up table, the search limited by the country code; replacing one of the at least one tokens with a standard token from a lookup table; operating on the replacement standard token using one or more rules that uses a context of the replacement standard token and an order of appearance of the replacement standard token in the raw street address to correct an incorrect assignment of a street name to the standard token; and creating a standardized street address from the at least one token based on one or more conditions using a single pass of the raw address data through the process, wherein the one or more conditions facilitate determining at least one of inclusion, exclusion, and position of the at least one token in the standardized street address, the one or more conditions are based on the at least one of the at least one token, the at least one token and an adjacent token, a token type of at least one of the at least one token and an adjacent token, and a context of at least one of the at least one token and an adjacent token. - View Dependent Claims (2, 3)
-
-
4. A computer system for processing raw address data into a standardized street address using a single pass of the raw address data through the process, the computer system comprising a memory device and a processor in communication with the memory device, the computer system programmed to:
-
receive raw address data from a first party, the raw address data including a raw street address; retrieve a country code from the raw address data; determine a standardized street address format based on the retrieved country code; and format the standardized street address using the determined standardized street address format and format the raw street address by inserting a space character between different types of characters, the different types of characters including at least one of a numeric type character, an alpha type character, and a non alpha-numeric type character, separate the formatted street address into at least one token, each token including one or more characters indicating a portion of the raw street address; search for the at least one token in a look-up table, the search limited by the country code; replace one of the at least one tokens with a standard token from a lookup table;
operate on the replacement standard token using one or more rules that uses a context of the replacement standard token and an order of appearance of the replacement standard token in the raw street address to correct an incorrect assignment of a street name to the standard token; andcreate a standardized street address from the at least one token based on one or more conditions, wherein the standardized street address includes data from the raw address data in consistent positions and excludes data that is inconsistent with predetermined standardized data, and wherein the one or more conditions facilitate determining at least one of indicative of inclusion, exclusion, and position of the at least one token in the standardized street address, the one or more conditions are based on the at least one of the at least one token, the at least one token and an adjacent token, a token type of at least one of the at least one token and an adjacent token, and a context of at least one of the at least one token and an adjacent token. - View Dependent Claims (5)
-
-
6. One or more non-transitory computer-readable storage media having computer-executable instructions embodied thereon, wherein when executed by at least one processor, the computer-executable instructions cause the processor to:
-
receive raw address data from a first party, the raw address data including a raw street address; retrieve a country code from the raw address data; determine a standardized street address format based on the retrieved country code; and format the standardized street address using the determined standardized street address format and format the raw street address by inserting a space character between different types of characters, the different types of characters including at least one of a numeric type character, an alpha type character, and a non alpha-numeric type character; separate the formatted street address into at least one token, each token including one or more characters indicating a portion of the raw street address; search for the at least one token in a look-up table, the search limited by the country code; replace one of the at least one tokens with a standard token from a lookup table; operate on the replacement standard token using one or more rules that uses a context of the replacement standard token and an order of appearance of the replacement standard token in the raw street address to correct an incorrect assignment of a street name to the standard token; and create a standardized street address from the at least one token based on one or more conditions, wherein the standardized street address includes data from the raw address data in consistent positions and excludes data that is inconsistent with predetermined standardized data, and wherein the one or more conditions facilitate determining at least one of indicative of inclusion, exclusion, and position of the at least one token in the standardized street address, the one or more conditions are based on the at least one of the at least one token, the at least one token and an adjacent token, a token type of at least one of the at least one token and an adjacent token, and a context of at least one of the at least one token and an adjacent token, where the processor processes the raw address data into a standardized street address using a single pass of the raw address data through the process. - View Dependent Claims (7, 8)
-
Specification