System and method for adaptive multi-cultural searching and matching of personal names
First Claim
1. A computer-implemented method of comparing input names and a stored names, the method comprising:
- accessing an input name of a particular human culture;
accessing a stored name of the particular human culture;
selecting a first linguisitic algorithm that is based on the stored name and other names in the particular human culture, wherein the first linguistic algorithm is based on one or more linguistic features including one or more of phonological features, orthographic features, semantic features, syntactic features, n-gram based features, the number of elements in names, the types of elements in names, variations in spelling of a name, accounting for exclusion of expected information in a name, positional information of names, inclusion of additional information in a name, and whether surnames match;
selecting a second linguistic algorithm, different from the first linguistic algorithm, that is based on the stored name and other names in the particular human culture, wherein the second linguistic algorithm is based on one or more linguistic features including one or more of phonological features, orthographic features, semantic features, syntactic features, n-gram based features, the number of elements in names, the types of elements in names, variations in spelling of a name, accounting for exclusion of expected information in a name, positional information of names, inclusion of additional information in a name, and whether surnames match, wherein the first linguistic algorithm and the second linguistic algorithm are based on different linguistic features of the particular human culture;
comparing the input name and the stored name using the first linguistic algorithm;
comparing the input name and the stored name using the second linguistic algorithm; and
providing a non-binary indication of whether the input name matches the stored name base on (i) the comparing of the input name and the stored name using the first linguistic algorithm and (ii) the comparing of the input name and the stored name using the second linguistic algorithm.
5 Assignments
0 Petitions
Accused Products
Abstract
An automated name searching system incorporates an automatic name classifier and a multi-path architecture in which different algorithms are applied based on cultural identity of the query name. The name classifier operates with a preemptive list, analysis of morphological elements, length, and linguistic rules. A name regularizer produces a character based computational representation of the name. A pronunciation equivalent representation such as an IPA language representation, and language specific rules to generate name searching keys, are used in a first pass to eliminate database entries which are obviously not matches for the query name. The methods can also be implemented as a callable set of library routines including an intelligent preprocessor and a name evaluator that produces a score comparing a query name and database name, based on a variety of user-adjustable parameters. The user-controlled parameters permit tuning of the search methodologies for specific custom applications.
-
Citations
43 Claims
-
1. A computer-implemented method of comparing input names and a stored names, the method comprising:
-
accessing an input name of a particular human culture;
accessing a stored name of the particular human culture;
selecting a first linguisitic algorithm that is based on the stored name and other names in the particular human culture, wherein the first linguistic algorithm is based on one or more linguistic features including one or more of phonological features, orthographic features, semantic features, syntactic features, n-gram based features, the number of elements in names, the types of elements in names, variations in spelling of a name, accounting for exclusion of expected information in a name, positional information of names, inclusion of additional information in a name, and whether surnames match;
selecting a second linguistic algorithm, different from the first linguistic algorithm, that is based on the stored name and other names in the particular human culture, wherein the second linguistic algorithm is based on one or more linguistic features including one or more of phonological features, orthographic features, semantic features, syntactic features, n-gram based features, the number of elements in names, the types of elements in names, variations in spelling of a name, accounting for exclusion of expected information in a name, positional information of names, inclusion of additional information in a name, and whether surnames match, wherein the first linguistic algorithm and the second linguistic algorithm are based on different linguistic features of the particular human culture;
comparing the input name and the stored name using the first linguistic algorithm;
comparing the input name and the stored name using the second linguistic algorithm; and
providing a non-binary indication of whether the input name matches the stored name base on (i) the comparing of the input name and the stored name using the first linguistic algorithm and (ii) the comparing of the input name and the stored name using the second linguistic algorithm. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42)
-
-
43. An apparatus comprising a computer readable medium having instructions stored thereon that when executed by a machine result in at least the following:
-
accessing an input name of a particular human culture;
accessing a stored name of the particular human culture;
selecting a first linguistic algorithm that is based on the stored name and other names in the particular human culture, wherein the first linguistic algorithm is based on one or more linguistic features including one or more of phonological features, orthographic features, semantic features, syntactic features, n-gram based features, the number of elements in names, the types of elements in names, variations in spelling of a name, accounting for exclusion of expected information in a name, positional information of names, inclusion of additional information in a name, and whether surnames match;
selecting a second linguistic algorithm, different from the first linguistic algorithm, that is based on the stored name and other names in the particular human culture, wherein the second linguistic algorithm is based on one or more linguistic features including one or more of phonological features, orthographic features, semantic features, syntactic features, n-gram based features, the number of elements in names, the types of elements in names, variations in spelling of a name, accounting for exclusion of expected information in a name, positional information of names, inclusion of additional information in a name, and whether surnames match, wherein the first linguistic algorithm and the second linguistic algorithm are based on different linguistic features of the particular human culture;
comparing the input name and the stored name using the first linguistic algorithm;
comparing the input name and the stored name using the second linguistic algorithm; and
providing a non-binary indication of whether the input name matches the stored name based on (i) the comparing of the input name and the stored name using the first linguistic algorithm and (ii) the comparing of the input name and the stored name using the second linguistic algorithm.
-
Specification