Scoring unfielded personal names without prior parsing
First Claim
1. A method of determining a similarity between a name phrase and a comparison name phrase, the method comprising:
- for each name in the name phrase, scoring the name by a computing processor, wherein the scoring is based on field frequency of the name in a name database, wherein the field frequency indicates at least one of a given name frequency and a surname frequency in the database;
using the scoring, by the computing processor, to determine a transition from a given name to a surname in the name phrase, comprising;
calculating, by the computing processor, a transition score for each name in the name phrase for which a preceding name exists, wherein the transition score is calculated using the name and the preceding name, wherein the preceding name precedes the name in an order of names in the name phrase; and
using the transition score, by the computing processor, to determine the primary surname in the name phrase, wherein the transition score indicates a transition from given names in the name phrase to the primary surname in the name phrase;
determining, by the computing processor, a primary given name in the name phrase based on the scoring and the transition;
determining, by the computing processor, a primary surname in the name phrase based on the scoring and the transition; and
using the primary given name and primary surname, by the computing processor, to determine a similarity between the name phrase and a comparison name phrase, wherein the comparison name phrase comprises a comparison given name and a comparison surname.
1 Assignment
0 Petitions
Accused Products
Abstract
A system for determining a similarity between a name phrase and a comparison name phrase, for each name in the name phrase, scores the name. The scoring is based on the field frequency of the name in a name database, where the field frequency indicates a given name frequency and/or a surname frequency in the database. The system uses the scoring to determine a transition from a given name to a surname in the name phrase. The system determines a primary given name and a primary surname in the name phrase based on the scoring and the transition. The system uses the primary given name and primary surname to determine a similarity between the name phrase and a comparison name phrase, where the comparison name phrase comprises a comparison given name and a comparison surname.
18 Citations
7 Claims
-
1. A method of determining a similarity between a name phrase and a comparison name phrase, the method comprising:
-
for each name in the name phrase, scoring the name by a computing processor, wherein the scoring is based on field frequency of the name in a name database, wherein the field frequency indicates at least one of a given name frequency and a surname frequency in the database; using the scoring, by the computing processor, to determine a transition from a given name to a surname in the name phrase, comprising; calculating, by the computing processor, a transition score for each name in the name phrase for which a preceding name exists, wherein the transition score is calculated using the name and the preceding name, wherein the preceding name precedes the name in an order of names in the name phrase; and using the transition score, by the computing processor, to determine the primary surname in the name phrase, wherein the transition score indicates a transition from given names in the name phrase to the primary surname in the name phrase; determining, by the computing processor, a primary given name in the name phrase based on the scoring and the transition; determining, by the computing processor, a primary surname in the name phrase based on the scoring and the transition; and using the primary given name and primary surname, by the computing processor, to determine a similarity between the name phrase and a comparison name phrase, wherein the comparison name phrase comprises a comparison given name and a comparison surname. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
Specification