Japanese language sentence dividing method and apparatus
First Claim
Patent Images
1. A Japanese language sentence dividing apparatus comprising:
- dictionary means containing definitions, rules, tables, and words;
first sentence dividing means coupled to said dictionary means for dividing an inputted Japanese language sentence by referring to said dictionary means;
detecting means coupled to said first sentence dividing means for detecting when said first sentence dividing means encounters a word which is not registered in said dictionary means;
temporary dividing means coupled to said detecting means for dividing a character string containing at least a word which is not registered in said dictionary means into one or more realizable forms of partial character strings, each string containing at least one character, at dividing points in response to said detecting means and said dictionary means;
means for matching each of the partial character strings located between dividing points in each of said realizable forms from said temporary dividing means with a word in said dictionary means;
evaluation means for evaluating said realizable forms by counting the number of characters contained in said partial character string having successfully matched a word in said dictionary means, anda second sentence dividing means for dividing said inputted Japanese language sentence containing words which are not registered in said dictionary means, in a way that results in a best division of said sentence.
1 Assignment
0 Petitions
Accused Products
Abstract
A Japanese language sentence containing a word not registered in an electronic dictionary is divided by following a series of predetermined rules. When more than one division of such a sentence is possible, an evaluation is made in order to determine the best division of the sentence containing a word not registered in the dictionary.
49 Citations
10 Claims
-
1. A Japanese language sentence dividing apparatus comprising:
-
dictionary means containing definitions, rules, tables, and words; first sentence dividing means coupled to said dictionary means for dividing an inputted Japanese language sentence by referring to said dictionary means; detecting means coupled to said first sentence dividing means for detecting when said first sentence dividing means encounters a word which is not registered in said dictionary means; temporary dividing means coupled to said detecting means for dividing a character string containing at least a word which is not registered in said dictionary means into one or more realizable forms of partial character strings, each string containing at least one character, at dividing points in response to said detecting means and said dictionary means; means for matching each of the partial character strings located between dividing points in each of said realizable forms from said temporary dividing means with a word in said dictionary means; evaluation means for evaluating said realizable forms by counting the number of characters contained in said partial character string having successfully matched a word in said dictionary means, and a second sentence dividing means for dividing said inputted Japanese language sentence containing words which are not registered in said dictionary means, in a way that results in a best division of said sentence. - View Dependent Claims (2)
-
-
3. A method of dividing a Japanese language sentence comprising the steps of:
-
providing dictionary means containing definitions, rules, tables and words; dividing, in a first sentence dividing means, an inputted Japanese language sentence by referring to said dictionary means; detecting when said first sentence dividing means encounters a word which is not registered in said dictionary means; temporarily dividing a character string containing at least one word which is not registered in said dictionary means into one or more realizable forms of partial character strings, each string containing at least one character, at dividing points in response to the detecting of a not registered word; matching each of the partial character strings located between dividing points in each of said realizable forms with a word in said dictionary means; evaluating said realizable forms by counting the number of characters contained in said partial character string having successfully matched a word in said dictionary means, and dividing, in a second sentence dividing means, said inputted Japanese language sentence containing words which are not registered in said dictionary means, in a way that results in the best division of said sentence. - View Dependent Claims (4)
-
-
5. A Japanese language sentence dividing apparatus comprising:
-
memory means; first sentence dividing means for dividing an inputted Japanese language sentence by character type according to character type definition and division determination rules contained in said memory means and providing a first output; second sentence dividing means for dividing said first output by function word strings according to a function word table contained in said memory means and providing a second output; third sentence dividing means for dividing said second output by content word dictionary and function word table according to a content word dictionary and function word table contained in said memory means and providing a third output; fourth sentence dividing means for dividing said third output according to unregistered word deduction rules according to the content word dictionary, the function word table, unregistered word deduction rules and affix table and connection rules contained in said memory means and providing a fourth output, and adjustment means for adjusting said fourth output by compound word synthesizing rules according to compound word synthesizing rules contained in said memory means. - View Dependent Claims (6, 7)
-
-
8. A method of dividing a Japanese language sentence comprising the steps of:
-
providing a memory means; dividing an inputted Japanese language sentence by character type according to character type definitions and division determination rules contained in said memory means; dividing the sentence divided by character type by function word strings according to a function word table contained in said memory means; dividing the sentence divided by function word strings by content word dictionary and function word table according to a content word dictionary and function word table contained in said memory means; dividing the sentence divided by content word dictionary and function word table according to unregistered word deduction rules according to the content word dictionary, the function word table, unregistered word deduction rules and affix table and connection rules contained in said memory means, and adjusting the sentence divided according to unregistered word deduction rules by compound word synthesizing rules according to compound word synthesizing rules contained in said memory means. - View Dependent Claims (9, 10)
-
Specification