×

Natural language parsers to normalize addresses for geocoding

  • US 8,868,479 B2
  • Filed: 09/29/2008
  • Issued: 10/21/2014
  • Est. Priority Date: 09/28/2007
  • Status: Active Grant
First Claim
Patent Images

1. A method for normalizing an input address comprising the steps of:

  • under control of a computer system comprising computer hardware;

    receiving an input address indicative of a physical address;

    parsing the input address into components;

    classifying each component with a preliminary address field classification according to;

    one or more predetermined regular expressions and a lexicon of known tokens, thereby generating classified components, wherein said classifying each component is performed by matching each component to the one or more predetermined regular expressions only when there is no match between that component and the lexicon of known tokens;

    determining which of at least one of a plurality of countries and jurisdictions corresponds to the address input;

    selecting a predictive model corresponding to the address input from a plurality of predictive models, each of the plurality of predictive models being an automated country-specific natural language parser uniquely defined for a corresponding one of the plurality of countries and jurisdictions, the selected predictive model comprising a graph having address field nodes and edges connecting the address field nodes, each address field node comprising an address field and a corresponding set of one or more address field classifications each assigned a first probability value, and each edge assigned a second probability value; and

    executing the selected predictive model to update the preliminary address field classification of at least some of the classified components with one of the address fields in the graph based at least partly on the first and second probability values of the address field nodes and the edges that correspond to the preliminary address field classification of each component.

View all claims
  • 9 Assignments
Timeline View
Assignment View
    ×
    ×