Method and device for phonetically encoding Chinese textual data for data processing entry
First Claim
1. An operator controllable electronic data entry device for encoding in digital-coded form, for output to word processing equipment in accordance with an encoding method, at least one desired block of a plurality of blocks of Chinese text characters, each of said blocks containing one or more characters in a linear order, said data entry device comprising:
- input means for inputting at least one of a plurality of encoding sequences of data, said at least one encoding sequence including phonetic and character position data which are used to encode said desired block in accordance with said encoding method, said input means comprising a means for inputting said phonetic data which corresponds to the pronunciation of each Chinese text character making up said at least one desired block and a means for inputting said character position data indicating a linear position which each text character making up said at least one desired block occupies within a sequence of characters constituting a phrase in the Chinese language;
said input means being selectively operable by said operator to input said encoding sequences of data, said input means generating a respective plurality of electronic, digitally-coded signal strings corresponding to said encoding sequences of data;
memory means containing a plurality of memory locations, each said location corresponding to one of said plurality of signal strings generated by the input means and each said location operative to store digitally-coded data for at least one Chinese character, by which coded data said at least one character can be visually reproduced or otherwise processed as required by the word processing equipment into which the data entry is made; and
processing means for receiving said signal strings generated by said input means, for searching said memory means for said corresponding memory locations, for retrieving said coded data stored in said memory means, and for outputting said data into the word processing equipment into which the data entry is made.
0 Assignments
0 Petitions
Accused Products
Abstract
A method and device for coded entry of Chinese character text data into a word processing, display, printing, telecommunication, etc. system. In the principal embodiment of the invention, an electronic input keyboard is used that has keys marked with phonetic notations suitable to represent Chinese speech sounds, as well as a set of "character position keys," operated by the following encoding rules: (1) the text is divided into blocks of characters, where one block may contain one or more characters, each block to be encoded by one uninterrupted typing sequence; (2) if the pronunciation of a block is unique, encoding is done simply by entering on the keyboard the phonetic data of the character(s) making up the block; (3) if the pronunciation of a block is not unique, first the phonetic data of a string of characters making up a longer block is entered, the pronunciation of that longer block being unique and the block to be encoded being a part of the longer block, and then by using the "character position keys" the operator enters the "position data," that is, the position(s) which the character(s) of the block to be encoded occupy within that longer block. In an alternative embodiment, part or all of the phonetic data of the characters are entered into the encoding apparatus, not by keyboard means, but by the use of an acoustic speech sound analyzer.
-
Citations
19 Claims
-
1. An operator controllable electronic data entry device for encoding in digital-coded form, for output to word processing equipment in accordance with an encoding method, at least one desired block of a plurality of blocks of Chinese text characters, each of said blocks containing one or more characters in a linear order, said data entry device comprising:
-
input means for inputting at least one of a plurality of encoding sequences of data, said at least one encoding sequence including phonetic and character position data which are used to encode said desired block in accordance with said encoding method, said input means comprising a means for inputting said phonetic data which corresponds to the pronunciation of each Chinese text character making up said at least one desired block and a means for inputting said character position data indicating a linear position which each text character making up said at least one desired block occupies within a sequence of characters constituting a phrase in the Chinese language;
said input means being selectively operable by said operator to input said encoding sequences of data, said input means generating a respective plurality of electronic, digitally-coded signal strings corresponding to said encoding sequences of data;memory means containing a plurality of memory locations, each said location corresponding to one of said plurality of signal strings generated by the input means and each said location operative to store digitally-coded data for at least one Chinese character, by which coded data said at least one character can be visually reproduced or otherwise processed as required by the word processing equipment into which the data entry is made; and processing means for receiving said signal strings generated by said input means, for searching said memory means for said corresponding memory locations, for retrieving said coded data stored in said memory means, and for outputting said data into the word processing equipment into which the data entry is made. - View Dependent Claims (2, 3, 4)
-
-
5. A method for identifying, within a text, Chinese characters for coded entry of said characters in a word processing system by a data entry device comprising:
- at least (a) a keyboard having actuable keys for inputting phonetic data, indicating a pronunciation of said characters, and actuable keys for inputting character position data, indicating the linear position which said characters occupy within a sequence or block of characters constituting a phrase in the Chinese language, and outputting a plurality of digital coded strings of signals in response to a sequence of actuating said keys, (b) a memory having a plurality of locations corresponding uniquely to each of said plurality of digital coded strings and storing therein at each location a digital code representing at least one Chinese character, and (c) a processor means responsive to digital coded strings from said keyboard to search said memory for locations corresponding to said strings and to retrieve digital codes representing Chinese characters therefrom, said method comprising the steps of;
dividing the text in which said characters are to be identified into blocks of characters, each block containing one or more characters; and executing on said keyboard for each of said blocks one uninterrupted identifying typing sequence, wherein (i) the identifying typing sequence for the characters contained in a block having a unique pronunciation comprises typing on said keyboard, in a sequence as speech sounds are pronounced when the characters are spoken, the phonetic data of each character being contained in said block, and (ii) the identifying typing sequence for the characters contained in a block having a pronunciation that is not unique comprises (a) first typing the phonetic data of the characters making up a block longer than said block having a pronounciation that is not unique, said longer block being selectable from a plurality of blocks which constitute phrases in the Chinese language and have a pronunciation that is unique, said block having a pronounciation that is not unique being a part of said longer block, and (b) then on the actuable keys for character position data typing, as part of the same identifying typing sequence, position data specifying the linear position which each character of said block having a pronounciation that is not unique occupies within the sequence of characters making up said longer block.
- at least (a) a keyboard having actuable keys for inputting phonetic data, indicating a pronunciation of said characters, and actuable keys for inputting character position data, indicating the linear position which said characters occupy within a sequence or block of characters constituting a phrase in the Chinese language, and outputting a plurality of digital coded strings of signals in response to a sequence of actuating said keys, (b) a memory having a plurality of locations corresponding uniquely to each of said plurality of digital coded strings and storing therein at each location a digital code representing at least one Chinese character, and (c) a processor means responsive to digital coded strings from said keyboard to search said memory for locations corresponding to said strings and to retrieve digital codes representing Chinese characters therefrom, said method comprising the steps of;
-
6. An electronic data entry device for encoding in digital-coded form, for output into word processing equipment, at least one desired block of a plurality of blocks of Chinese text characters, each of said blocks containing one or more characters in linear order and each of said characters and each of said blocks having a pronounciation identifiable by phonetic data, and the encoding of said at least one desired block proceeding as follows when utilizing said data entry device:
-
in the case in which the pronunciation of said at least one desired block is unique, encoding said block on the basis of a specific sequence of phonetic data of each said character making up said block, said phonetic data representing the pronunciation of said character, and in the case in which the pronunciation of said at least one desired block is not unique, encoding said block on the basis of (1) a specific sequence of phonetic data representing the pronunciation of a block of characters longer than said desired block, said longer block also being one of said plurality of blocks of Chinese text characters, the pronunciation of said longer block being unique and said at least one desired block being a part of the linear sequence of characters in said longer blocks and (2) a specific sequence of character position data representing the linear position which each of the characters making up said at least one desired block occupies within said longer block, said data entry device comprising; (a) a first input means for inputting, when a user determines that the pronunciation of said at least one desired block is unique, at least one of a plurality of sequences of said phonetic data representing the pronunciation of each of the characters making up said at least one desired block, said sequence of phonetic data being used to encode said block, and generating at lease one of a respective plurality of strings of electronic signals, said at least one string of signals carrying said at least one sequence of phonetic data and being used to encode said desired block in digital-coded form; (b) a second input means for additionally inputting, when a user determines that the pronunciation of said at least one desired block is not unique, at least one of a plurality of sequences of said character position data representing the linear position of each character contained in said at least one desired block within a block longer than said desired block, said sequence of phonetic data and sequence of character position data being inputted in sequence by said first and second input means to uniquely encode said at least one desired block, and said first and second input means generating in the same sequence at least one of a respective plurality of strings of electronic signals corresponding to the input sequences of phonetic and character position data, said at least one string of signals carrying one sequence of phonetic and character position data encoding said at least one desired block in digital-coded form; (c) memory means containing a plurality of memory locations, each said location corresponding to one of said plurality of signal strings generated by said first input means or said first and second input means in sequence, and each said location operative to store digitally-coded data for at least one Chinese character, said coded data being used when visually reproducing or otherwise processing said at least one character by the word processing equipment into which the data entry is made; and (d) processing means for receiving said signal strings generated by said first input means or said first and second input means in sequence, for searching said memory means for the corresponding memory locations, for retrieving said coded data stored in said memory means, and for outputting said data into the word processing equipment into which the data entry is made. - View Dependent Claims (7, 8, 9)
-
-
10. A computer-implemented method of identifying a block of Chinese characters, said block containing one or more characters for which information related thereto is stored in memory at an individually addressable location, comprising the steps of:
-
a) inputting a first string of signals comprising at least encoded phonetic data for said characters in said block, b) comparing said input string with the addresses of individual locations in memory, c) upon detection of an identity between said input string and an address in memory, retrieving the information stored at the corresponding memory location, d) determining whether said information represents a single character or multiple characters and, in response thereto; (i) utilizing said information to generate a block comprising one text character when said information is determined to represent a single character, and (ii) dividing said information into a plurality of single character codes and utilizing said single character codes to generate a block comprising a sequence of text characters, when said information is determined to represent multiple characters. - View Dependent Claims (11, 12, 13)
-
-
14. A method of selecting and inputting in digital-coded form a desired block of Chinese characters into an electronic information processing system, said desired block being one of a plurality of blocks of characters in the Chinese language, each of said blocks containing one or more characters in linear order with each of said characters having a corresponding pronounciation,
said method utilizing (i) phonetic data input means for inputting respective phonetic data of respective Chinese characters, said phonetic data for each character corresponding to the pronunciation of said character, and a digital code carried by the respective signal string generated by said phonetic data input means in response to being selectively actuated serving as a phonetic identifier code for said character, further a sequence of pronunciations of characters making up a block being the pronunciation of said block and a sequence of phonetic identifier codes for the characters making up a block being the phonetic identifier code for said block, said method further utilizing (ii) character position data input means for inputting the linear position or sequence data of a character or characters within a linear block of characters as said characters make up a phrase or a text in the Chinese language, said phonetic data input means and said character position data input means being further connected through a data processing means to an addressable memory means and to said information processing system into which the data entry is made, said method comprising the steps of: -
(a) providing a location in said memory means for at least each of the blocks, of said plurality of blocks, having a non-identical pronunciation and identifying each of said locations with the respective phonetic identifier code for said block, said phonetic identifier code serving as the address of said location; (b) storing character code information at each of said locations, the number of pieces of component parts of character code information and the sequence in which said pieces are stored at each location being the same as the number of characters and the sequence thereof which make up the respective block of which the phonetic identifier code serves as the address of said location, each piece of character code information standing for one Chinese character and being the digital code information used by said information processing system to visually reproduce or transmit or otherwise process said Chinese character; (c) in the case in which the pronunciation of said desired block is unique, no other block of characters in Chinese having the same pronunciation as said desired block, (i) selectively operating said phonetic data input means to input a string of signals carrying the phonetic identifier code of said block, (ii) comparing in said data processing means said phonetic identifier code with the addresses of locations in said memory means, (iii) upon detecting an identity between said phonetic identifier code and an address in said memory means, retrieving by said data processing means the character code information stored at said location, and (iv) outputting from said data processing means a signal string carrying said information into said information processing system into which the data entry is made; and (d) in the case in which the pronunciation of said desired block is not unique, there being at least one other block in the Chinese language containing at least one different character but having the same pronunciation as said desired block, (i) selectively operating said phonetic data input means to input a first string of signals carrying the phonetic identifier code of a block of characters longer than said desired block and selected from among said plurality of blocks of characters in the Chinese language, the pronunciation of said longer block being unique and said desired block to be selected being a part of said longer block, (ii) selectively operating said character position data input means to input a second string of signals carrying the data indicating the linear position of each character of said desired block within the linear sequence of characters in said longer block, (iii) comparing in said data processing means said phonetic identifier code carried by said first string of signals with the addresses of locations in said memory means, (iv) upon detecting an identity between said phonetic identifier code carried by said first input string and an address in said memory means, retrieving the character code information stored at said memory location, and (v) dividing the signal string carrying said character code information retrieved from said memory location into segments of strings, each of said segments carrying one piece of character code information, and feeding into said information processing system, in the sequence in which said linear position data had been inputted, only those of said segments which carry the character code information for the character or characters making up the desired block, as specified by said input of linear character position data. - View Dependent Claims (15)
-
-
16. An electronic keyboard data entry method for Chinese character texts, comprising the steps of:
-
(a) dividing the text into blocks of characters, each block containing at least one character, and performing the data entry character block by character block, each block being entered by one uninterrupted typing sequence; (b) if the pronunciation of a first of said character blocks to be entered is unique in the language, entering, by selective actuation of the keys of said electronic input keyboard, a first string of phonetic data of each character contained in said character block; (c) if the pronunciation of said first of said character blocks is not unique, entering on said keyboard a second string of phonetic data of a longer block containing at least one more character than said first one to be entered, said longer block having a unique pronunciation and including said character block to be entered, and then entering on the same keyboard position data indicating position(s) which the character(s) of said character block to be entered occupy within said longer block; and (d) repeating steps (b) and (c) for the remaining blocks of the text to be entered. - View Dependent Claims (17, 18, 19)
-
Specification