SYSTEM AND METHOD FOR UTILIZING MULTIPLE ENCODINGS TO IDENTIFY SIMILAR LANGUAGE CHARACTERS
First Claim
1. A method for identifying a similarity between language characters, comprising the steps of:
- receiving a pair of language characters at a language character match engine, wherein the language character match engine executes on one or more microprocessor,wherein each language character has a unique structure, andwherein the language character match engine is adapted to receive encoding configuration information from each of a plurality of encoding components to encode the pair of language characters based on the unique structure of each language character;
encoding, based on the unique structure of each language character, the pair of language characters according to each of the plurality of encoding components to generate a pair of string identification characters for each of the plurality of encoding components, and wherein each string of identification characters represents the unique structure of one of the pair of language characters;
comparing the pair of string identification characters to one another to generate a similarity score for each pair of string identification characters;
combining the similarity scores to generate a composite similarity score; and
determining, based on the composite similarity score, a similarity between the pair of language characters.
2 Assignments
0 Petitions
Accused Products
Abstract
Described herein are systems and methods for identifying the similarity between language characters. As described herein, a pair of language characters is received at a language character match engine. The language character match engine is adapted to receive encoding configuration information from each of a plurality of encoding components, and is adapted to encode the pair of language characters based on the unique structure of each language character to generate a pair of string identification characters for each encoding component. Thereafter, each pair of string identification characters is compared to one another to generate a similarity score, and the similarity score for each pair of string identification characters is combined to create a composite similarity score. The composite similarity score represents a similarity between the pair of language characters, and is used to identify the similarity between the pair of language characters.
44 Citations
20 Claims
-
1. A method for identifying a similarity between language characters, comprising the steps of:
-
receiving a pair of language characters at a language character match engine, wherein the language character match engine executes on one or more microprocessor, wherein each language character has a unique structure, and wherein the language character match engine is adapted to receive encoding configuration information from each of a plurality of encoding components to encode the pair of language characters based on the unique structure of each language character; encoding, based on the unique structure of each language character, the pair of language characters according to each of the plurality of encoding components to generate a pair of string identification characters for each of the plurality of encoding components, and wherein each string of identification characters represents the unique structure of one of the pair of language characters; comparing the pair of string identification characters to one another to generate a similarity score for each pair of string identification characters; combining the similarity scores to generate a composite similarity score; and determining, based on the composite similarity score, a similarity between the pair of language characters. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10)
-
-
11. A system for identifying a similarity between language characters, comprising:
-
a language character match engine that executes on one or more microprocessor and is adapted to receive a pair of language characters, wherein each language character has a unique structure; wherein the language character match engine is adapted to receive encoding configuration information from each of a plurality of encoding components to configure the language character match engine to encode the pair of language characters, and wherein when the language character match engine receives the encoding configuration information, the language character match engine encodes, based on the unique structure of each language character, the pair of language characters to generate a pair of string identification characters for each of the plurality of encoding components, and wherein each string of identification characters represents the unique structure of one of the pair of language characters; compares the pair of string identification characters to one another to generate a similarity score for each pair of string identification characters; combines the similarity scores to generate a composite similarity score; and determines, based on the composite similarity score, a similarity between the pair of language characters. - View Dependent Claims (12, 13, 14, 15, 16, 17, 18, 19)
-
-
20. A non-transitory computer readable storable medium storing one or more sequences of instructions for identifying a similarity between language characters, wherein said instructions, when executed by one or more processors, cause the one or more processors to execute the steps of:
-
receiving a pair of language characters at a language character match engine, wherein each language character has a unique structure, and wherein the language character match engine is adapted to receive encoding configuration information from each of a plurality of encoding components to encode the pair of language characters based on the unique structure of each language character; encoding, based on the unique structure of each language character, the pair of language characters according to each of the plurality of encoding components to generate a pair of string identification characters for each of the plurality of encoding components, and wherein each string of identification characters represents the unique structure of one of the pair of language characters; comparing the pair of string identification characters to one another to generate a similarity score for each pair of string identification characters; combining the similarity scores to generate a composite similarity score; and determining, based on the composite similarity score, a similarity between the pair of language characters.
-
Specification