Unicode converter
First Claim
1. A method for converting a source string into a target string, said method comprising:
- (a) receiving a source string having a first character encoding;
(b) sequentially dividing the source string into text elements, each text element including one or more characters of the source string, and at least one of the text elements including a plurality of characters of the source string;
(c) obtaining attribute information for the text elements after or during said dividing (b);
(d) looking up in a mapping table a conversion code associated with a second character encoding for each of the text elements said looking up (c) in the mapping table of the conversion code for each of the text elements includes the operations of identifying a requested variant;
selecting one of a plurality of mapping tables based on the attribute information, the requested variant, and a length of the text element; and
looking up the conversion code from the selected one of the mapping tables; and
(e) combining the conversion codes for the text elements so as to form a target string of the second character encoding.
3 Assignments
0 Petitions
Accused Products
Abstract
A character code conversion system that provides round trip fidelity, while ensuring that the resulting character codes are interchangeable with other platforms. The code conversion system is able to map a single source character or a sequence of characters to either a single target character or a sequence of target characters. With round trip fidelity, source text can be converted to target text and then back again to the original source text. The interchangability is ensured by maximizing the use of standard target characters, and by minimizing the use of private characters. The code conversion is particularly useful for converting to/from Unicode characters from/to other character sets.
150 Citations
31 Claims
-
1. A method for converting a source string into a target string, said method comprising:
-
(a) receiving a source string having a first character encoding; (b) sequentially dividing the source string into text elements, each text element including one or more characters of the source string, and at least one of the text elements including a plurality of characters of the source string; (c) obtaining attribute information for the text elements after or during said dividing (b); (d) looking up in a mapping table a conversion code associated with a second character encoding for each of the text elements said looking up (c) in the mapping table of the conversion code for each of the text elements includes the operations of identifying a requested variant;
selecting one of a plurality of mapping tables based on the attribute information, the requested variant, and a length of the text element; and
looking up the conversion code from the selected one of the mapping tables; and(e) combining the conversion codes for the text elements so as to form a target string of the second character encoding. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13)
-
-
14. A code conversion system for converting a source string to a target string, said system comprising:
-
a converter for controlling the conversion of the source string having a first character encoding into the target string having a second character encoding; a scanner, operatively connected to said converter, for dividing the source string into text elements, each text element including one or more characters of the source string, and at least one of the text elements including a plurality of characters of the source string; a mapping table for storing target encodings for text elements of the source encoding, said mapping table includes a plurality of mapping portions, each of the mapping portions stores the target encodings for text elements of a different length, and an appropriate one of the mapping portions is utilized for each of the text elements defending on the lengths of the text elements; and a lookup handler, operatively connected to said converter and said mapping table, for looking up in said mapping table a conversion code associated with a second character encoding for each of the text elements. - View Dependent Claims (15, 16, 17, 18, 19, 20, 26, 27)
-
-
21. A computer readable medium containing program instructions for converting a source string into a target string, said computer readable medium comprising:
-
computer readable code configured to cause a computer to effect receiving a source string having a first character encoding; computer readable code configured to cause a computer to effect dividing the source string into text elements, each text element including one or more characters of the source string, and at least one of the text elements including a plurality of characters of the source string; computer readable code configured to cause a computer to effect obtaining attribute information for the text elements; computer readable code configured to cause a computer to effect looking up in a conversion code associated with a second character encoding for each of the text elements, said computer readable code operates to effect looking up in the mapping table the conversion code associated with the second character encoding for each of the text elements by operating to select at least one of a plurality of mapping tables based on the attribute information and a length of the text element, and then looking up the conversion code from the selected one of the mapping tables; and computer readable code configured to cause a computer to effect combining the conversion codes for the text elements so as to form a target string of the second character encoding. - View Dependent Claims (29, 30, 31)
-
-
22. A method for converting a source string into a target string, said method comprising:
-
(a) receiving a source string having a first character encoding; (b) sequentially dividing the source string into text elements, each text element including one or more characters of the source string, and at least one of the text elements including a plurality of characters of the source string; (c) obtaining attribute information for the text elements after or during said dividing (b) (d) looking up in a mapping table a conversion code associated with a second character encoding for each of the text elements, said looking up (d) in the mapping table of the conversion code for each of the text elements includes the operations of (d1) selecting one of a plurality of mapping tables based on the attribute information and a length of the text element, and (d2) looking up the conversion code from the selected one of the mapping tables; and (e) combining the conversion codes for the text elements so as to form a target string of the second character encoding. - View Dependent Claims (23, 24, 25, 28)
-
Specification