System and method for searching and matching data having ideogrammatic content
First Claim
1. A computerized method of searching and matching input data to stored data, the method comprising:
- receiving input data comprising a search string having a plurality of elements, at least some of the elements forming part of an ideogrammatic writing system;
converting a subset of the plurality of elements to a set of terms using at least one method selected from the group consisting of polylogogrammatic semantic disambiguation, hanzee acronym expansion, kanji acronym expansion, and business word recognition;
generating an optimized plurality of keys from the set of terms;
retrieving stored data based on the optimized keys corresponding to most likely candidates for match to the input data; and
selecting a best match from the match candidates.
4 Assignments
0 Petitions
Accused Products
Abstract
A method of searching and matching non-phonetic or ideogrammatic input data to stored data, including the steps of receiving input data comprising a search string having a plurality of elements, converting a subset of the elements into a set of terms, generating an optimized plurality of keys from the set of terms, retrieving stored data based on the optimized keys corresponding to most likely candidates for match, and selecting a best match from the plurality of candidates. At least some of the ideogrammatic elements form part of an ideogrammatic writing system. The method may also include dividing the search string into a plurality of overlapping sub-segments and identifying sub-segments having inferred semantic meaning as well as sub-segments having no semantic meaning in the ideogrammatic writing system, and using the various sub-segments to generate the optimized keys.
60 Citations
20 Claims
-
1. A computerized method of searching and matching input data to stored data, the method comprising:
-
receiving input data comprising a search string having a plurality of elements, at least some of the elements forming part of an ideogrammatic writing system;
converting a subset of the plurality of elements to a set of terms using at least one method selected from the group consisting of polylogogrammatic semantic disambiguation, hanzee acronym expansion, kanji acronym expansion, and business word recognition;
generating an optimized plurality of keys from the set of terms;
retrieving stored data based on the optimized keys corresponding to most likely candidates for match to the input data; and
selecting a best match from the match candidates. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19)
-
-
20. A computer readable medium having instructions for performing a method of searching and matching input data to stored data, the method comprising:
-
receiving input data comprising a search string having a plurality of elements, at least some of the elements forming part of an ideogrammatic writing system;
converting a subset of the plurality of elements to a set of terms using at least one method selected from the group consisting of polylogogrammatic semantic disambiguation, hanzee acronym expansion, kanji acronym expansion, and business word recognition;
generating an optimized plurality of keys from the set of terms;
retrieving stored data based on the optimized keys corresponding to most likely candidates for match to the input data; and
selecting a best match from the match candidates.
-
Specification