Dictionary retrieval device
First Claim
1. A dictionary retrieval device for converting an input character string input from an input part of a computer and outputting the converted character string to an output part of the computer, comprising:
- conversion character definition means for classifying a character set into subsets, and providing group IDs for each subset;
character group ID conversion means for converting from each character of the character string to a group ID by using said conversion character definition means;
input character string conversion means for converting the input character string input from the input part to a group ID string using said character-group ID conversion means;
first storage means for storing a word dictionary containing words which are significant character strings appearing at the input part;
dictionary conversion means for converting all words in said word dictionary to the group ID string using said character group ID conversion means;
second storage means for storing a converted word dictionary containing words which are converted by said dictionary conversion means; and
dictionary retrieval means for retrieving the converted word dictionary expressed by the group ID for the group ID string converted by said input character string conversion means.
1 Assignment
0 Petitions
Accused Products
Abstract
A dictionary retrieval device is constructed by a conversion character definition form for providing group IDs for character subsets, a character-group ID conversion part for replacing a character with a group ID, an input character string conversion part for converting the input character string input from the input part to an input group ID, a dictionary conversion part for converting a word dictionary to a converted word dictionary defined by a notation group ID string, and a dictionary retrieval part for retrieving the converted word dictionary by the input group ID string. The dictionary retrieval device can retrieve a word from a dictionary which could not have been retrieved from a dictionary due to an input error, in the past, by regarding elements of character set defined by a conversion character definition form as the same element.
-
Citations
15 Claims
-
1. A dictionary retrieval device for converting an input character string input from an input part of a computer and outputting the converted character string to an output part of the computer, comprising:
-
conversion character definition means for classifying a character set into subsets, and providing group IDs for each subset; character group ID conversion means for converting from each character of the character string to a group ID by using said conversion character definition means; input character string conversion means for converting the input character string input from the input part to a group ID string using said character-group ID conversion means; first storage means for storing a word dictionary containing words which are significant character strings appearing at the input part; dictionary conversion means for converting all words in said word dictionary to the group ID string using said character group ID conversion means; second storage means for storing a converted word dictionary containing words which are converted by said dictionary conversion means; and dictionary retrieval means for retrieving the converted word dictionary expressed by the group ID for the group ID string converted by said input character string conversion means. - View Dependent Claims (2, 3, 4, 5, 6)
-
-
7. A morphology analysis device for analyzing a character string input into a computer, comprising:
-
dictionary retrieval means for retrieving a dictionary, said dictionary retrieval means including; conversion character definition means for classifying a character set into subsets, and providing group IDs for each subset, character-group ID conversion means for replacing each character of the character string with group IDs by using said conversion character definition means, input character string conversion means for converting the character string to an input group ID string using said character-group ID conversion means, first storage means for storing a word dictionary containing words which are significant character strings, dictionary conversion means for converting a notation character string of each word which is defined in the word dictionary to a notation group ID string using said character-group ID conversion means, second storage means for storing a converted word dictionary containing words which are converted by said dictionary conversion means, and dictionary retrieval means for retrieving the converted word dictionary expressed by the notation group ID for the input group ID string converted by said input character string conversion means; grammar rule supply means for supplying a grammar rule; and grammar checking means for executing a morphology analysis by referring to the grammar rule and the converted word dictionary, and for outputting the result of the morphology analysis together with dictionary information.
-
-
8. A character string correction device for character strings in sentences inputted into a computer, comprising:
-
dictionary retrieval means for retrieving a dictionary, said dictionary retrieval means including; conversion character definition means for classifying a character set [C={C1, C2, . . . , Cn }] into subsets [(G1 .OR right.C)], and providing group IDs for each subset, character-group ID conversion means for replacing each character of the character string with group IDs by using said conversion character definition means, input character string conversion means for converting the character strings to input group ID strings using said character-group ID conversion means, first storage means for storing a word dictionary containing words which are significant character strings, dictionary conversion means for converting a notation character string of each word which is defined in the word dictionary to a notation group ID string using said character-group ID conversion means, second storage means for storing a converted word dictionary containing words which are converted by said dictionary conversion means, and dictionary retrieval means for retrieving the converted word dictionary expressed by the notation group ID for each of the input group ID strings converted by said input character string conversion means; grammar rule supply means for supplying a grammar rule; grammar checking means for executing a morphology analysis by referring to the grammar rule and the converted word dictionary, and for outputting the result of the morphology analysis together with dictionary information; and morphology composition means provided between said grammar checking means and the output part for outputting sentences by composing the results of the morphology analysis by said grammar checking means.
-
-
9. A post-processing device in a computer for character recognition comprising:
-
input means for inputting candidate character strings which have plural character candidates for each character of an input character string output from a character recognition process device to an expanded dictionary retrieval part; dictionary retrieval means for retrieving a dictionary, and including; conversion character definition means for classifying a character set [C={C1, C2, . . . , Cn }] into subsets [(G1 .OR right.C)], and providing group IDs for each subset, character-group ID conversion means for replacing each character of the input character string with group IDs by using said conversion character definition means, input character string conversion means for converting the input character string to an input group ID string using said character-group ID conversion means, first storage means for storing a word dictionary containing words which are significant character strings, dictionary conversion means for converting a notation character string of each word which is defined in the word dictionary to a notation group ID string using said character-group ID conversion means, second storage means for storing a converted word dictionary containing words which are converted by said dictionary conversion means, and dictionary retrieval means for retrieving the converted word dictionary expressed by the notation group ID for the input group ID string converted by said input character string conversion means; grammar rule supply means for supplying a grammar rule; grammar checking means for executing the morphology analysis by referring to the grammar rule and the converted word dictionary, and for outputting the result of the morphology analysis together with dictionary information; and morphology composition means provided between said grammar checking means and the output part for outputting sentences by composing the results of the morphology analysis by said grammar checking means by using an evaluation function.
-
-
10. A computer to translate an input character string of characters into a converted word, comprising:
-
conversion character definition means for classifying a character set into subsets, providing group identifiers for each of the subsets, and creating a notation group identification string using the group identifiers corresponding to characters in the converted word; character group identification conversion means for converting the input character string into an input group identification string by replacing each of the characters of the input character string with one of the group identifiers; and dictionary retrieval means for retrieving the converted word corresponding to the notation group identification string matching the input group identification string. - View Dependent Claims (11)
-
-
12. A method to translate an input character string of characters into a converted word in a computer, comprising the steps of:
-
(a) classifying a character set into subsets; (b) providing group identifiers for each of the subsets; (c) creating a notation group identification string using the group identifiers corresponding to characters in the converted word; (d) converting the input character string into an input group identification string by replacing each of the characters of the input character string with one of the group identifiers; and (e) retrieving the converted word corresponding to the notation group identification string matching the input group identification string. - View Dependent Claims (13, 14)
-
-
15. A method to translate an input character string of characters into a converted word in a computer, comprising the steps of:
-
(a) prestoring notation group identification strings representing converted words using group identifiers assigned to subsets of a character set; (b) converting the input character string into an input group identification string by replacing each of the characters of the input character string with one of the group identifiers; and (c) retrieving the converted word corresponding to one of the notation group identification strings matching the input group identification string.
-
Specification