Searching and matching of data
First Claim
Patent Images
1. A method implemented by a computer comprising at least one processor and at least one memory for facilitating searching and matching of data, comprising:
- receiving an input data string including one or more ideographic elements;
converting the input data string to a Latin-based input data string at least in part by deconstructing the one or more ideographic elements into constituent radicals or strokes and cross-referencing a constituent radical or stroke to a pre-defined Latin character so as to generate one or more sets of Latin characters;
generating one or more input keys based on the Latin-based input data string, including replacing any element in the Latin-based input data string that has a corresponding sounds-alike element to generate a phonetic key;
searching, using the one or more input keys, a reference database stored in a memory device for one or more candidate records, wherein similar records in the database are indexed by a common lookup key; and
if the one or more candidate records are found, determining a match score of the one or more candidate records.
1 Assignment
0 Petitions
Accused Products
Abstract
Described herein is a technology for facilitating searching and matching of data. In some implementations, an input data string, including one or more ideographic elements, is received and converted to a Latin-based input data string. One or more input keys may be generated based on the Latin-based input data string. A reference database may be searched for one or more candidate records indexed by at least one of the one or more input keys. If the one or more candidate records are found, a match score is determined for the one or more candidate records.
-
Citations
29 Claims
-
1. A method implemented by a computer comprising at least one processor and at least one memory for facilitating searching and matching of data, comprising:
-
receiving an input data string including one or more ideographic elements; converting the input data string to a Latin-based input data string at least in part by deconstructing the one or more ideographic elements into constituent radicals or strokes and cross-referencing a constituent radical or stroke to a pre-defined Latin character so as to generate one or more sets of Latin characters; generating one or more input keys based on the Latin-based input data string, including replacing any element in the Latin-based input data string that has a corresponding sounds-alike element to generate a phonetic key; searching, using the one or more input keys, a reference database stored in a memory device for one or more candidate records, wherein similar records in the database are indexed by a common lookup key; and if the one or more candidate records are found, determining a match score of the one or more candidate records. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27)
-
-
28. A non-transitory computer usable medium having a computer readable program code tangibly embodied therein, the computer readable program code adapted to be executed by a processor to implement a method for facilitating searching and matching of data, comprising:
-
receiving an input data string including one or more ideographic elements; converting the input data string to a Latin-based input data string at least in part by deconstructing the one or more ideographic elements into constituent radicals or strokes and cross-referencing a constituent radical or stroke to a pre-defined Latin character so as to generate one or more sets of Latin characters; generating one or more input keys based on the Latin-based input data string, including replacing any element in the Latin-based input data string that has a corresponding sounds-alike element to generate a phonetic key; searching, using the one or more input keys, a reference database for one or more candidate records, wherein similar records in the database are indexed by a common lookup key; and if the one or more candidate records are found, determining a match score of the one or more candidate records.
-
-
29. A system for facilitating searching and matching of data, comprising:
-
a non-transitory memory device for storing computer readable program code; and a processor in communication with the memory device, the processor being operative with the computer readable program code to; receive an input data string including one or more ideographic elements; convert the input data string to a Latin-based input data string at least in part by deconstructing the one or more ideographic elements into constituent radicals or strokes and cross-referencing a constituent radical or stroke to a pre-defined Latin character so as to generate one or more sets of Latin characters; generate one or more input keys based on the Latin-based input data string, including replacing any element in the Latin-based input data string that has a corresponding sounds-alike element to generate a phonetic key; search, using the one or more input keys, a reference database for one or more candidate records, wherein similar records in the database are indexed by a common lookup key; and if the one or more candidate records are found, determining a match score of the one or more candidate records.
-
Specification