Compressed unicode normalization with inversion
First Claim
Patent Images
1. A method for use in a computer environment, comprising:
- receiving a string of characters; and
performing a normalizing transformation on the string of characters to generate a bitstring.
4 Assignments
0 Petitions
Accused Products
Abstract
Any string in any character set with an arbitrary-leveled weight-based comparison system is transformed into a bitstring in such a way that two transformed strings can be compared byte-by-byte. The resulting bit string has the minimum possible maximum length. The transformed bit strings can be inverted—meaning the original string can be recovered from the transformed string.
-
Citations
25 Claims
-
1. A method for use in a computer environment, comprising:
-
receiving a string of characters; and
performing a normalizing transformation on the string of characters to generate a bitstring. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 17, 18, 19)
-
-
11. A method of comparing a first string of characters and a second string of characters, comprising:
-
performing a normalizing transformation on each string of characters to generate a first bitstring having a first primary weight and a first secondary weight, and a second bitstring having a second primary weight and a second secondary weight;
comparing the first primary weight and the second primary weight;
if the first primary weight is greater than the second primary weight then determining that the first bitstring is greater than the second bitstring;
if the first primary weight is less than the second primary weight then determining that the first bitstring is less than the second bitstring; and
if the first primary weight is equal to the second primary weight then performing a comparison based on the first secondary weight and the second secondary weight. - View Dependent Claims (12, 13, 14, 15, 16)
-
-
20. A string processing system, comprising:
-
a first data source using a string of characters;
a weight table comprising a plurality of characters, including the characters in the string of characters, each character having an associated code, primary weight, and secondary weight; and
a processor to perform a normalizing transformation on the string of characters to generate a bitstring. - View Dependent Claims (21, 22, 23, 24, 25)
-
Specification