×

Method and system for creating and updating an entity name alias table

  • US 9,449,056 B1
  • Filed: 11/01/2012
  • Issued: 09/20/2016
  • Est. Priority Date: 11/01/2012
  • Status: Active Grant
First Claim
Patent Images

1. A computing system implemented method for creating an entity name alias table comprising the following, which when executed individually or collectively by any set of one or more processors perform a process including:

  • obtaining initial identification data including data representing known entity names associated with one or more entities;

    defining a search time window;

    obtaining historical entity name search data indicating entity name searches entered by a searching party in the defined search time window;

    analyzing the historical entity name search data to identify a pair of potentially related entity name searches, each pair of potentially related entity name searches including two associated entity names, wherein analyzing the historical entity name search data to identify a pair of potentially related entity name searches includes;

    transforming at least part of the historical entity name search data into entity name search data strings representing entity names associated with entity name searches;

    determining a raw string distance between entity name search data strings;

    determining a normalized string distance between entity name search data strings, wherein the raw string distance between a first entity name search data string and a second entity name search data string is determined as the number of characters in the first entity name search data string that are different from the characters in the second entity name search data string, wherein the normalized string distance between the first and second entity name search data strings is equal to the raw string distance between the first and second entity name search data strings divided by the square root of the length of the first entity name search data string multiplied by the length of the second entity name search data string;

    defining a threshold normalized string distance;

    identifying a pair of entity name searches as potentially related entity name searches if the entity name search data strings representing the entity names of the pair of entity name searches have a normalized string distance less than the threshold normalized string distance;

    identifying a matched known entity name in the initial identification data that matches a first entity name of the two associated entity names; and

    adding the second entity name of the two associated entity names to an entity name alias table associated with the matched known entity name.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×