Apparatus for and method of analyzing chinese
First Claim
1. An apparatus for analyzing Chinese language, comprising:
- an input unit inputting a Chinese sentence;
a morpheme analyzer dividing the Chinese sentence into words;
a dependency relationship analyzer analyzing a dependency relationship between a parent word being a dependency destination of each of the words and a child word being a dependent from each of the words, the words being obtained by dividing the input Chinese sentence;
a memory unit that stores lihe-word information that registers a first word being a Chinese morpheme and capable of being a part of a lihe-word with a plurality of second words each forming a lihe-word with the first word, does not include words inserted between the first word and the second word, and lihe-words formed by the first word and the second words in the lihe-word information are not grouped based on similarity with other lihe-words;
a lihe-word processor detecting the first word and a second word from the dependency relationship, based on the lihe-word information without using the similarity of the lihe-word with other lihe-words, and the lihe-word processor changing a dependency destination of a word depending on both the first word and the second word to the lihe-word formed by combining the first word with the second word in the dependency relationship;
a generating unit generating a sentence based on the lihe-word formed by combining the first word with the second word and the word depending on the lihe-word by changing the dependency destination; and
an output unit outputting the sentence.
4 Assignments
0 Petitions
Accused Products
Abstract
An apparatus for analyzing Chinese according to one aspect of the present invention includes a dependency structure analyzer analyzing a dependency relationship between words by extracting a parent word being a dependency destination of each of the words and a child word being a dependent from each of the words. The words are obtained by dividing a Chinese sentence. The apparatus also includes a lihe-word processor referring to lihe-word information that includes a first word and capable of being a part of a lihe-word and a second word forming the lihe-word with the first word. The lihe-word processor detects the first word and the second word from the words analyzed, and then changes a dependency destination of a word depending on both the first word and the second word to the lihe-word formed by combining the first word with the second word.
12 Citations
16 Claims
-
1. An apparatus for analyzing Chinese language, comprising:
-
an input unit inputting a Chinese sentence; a morpheme analyzer dividing the Chinese sentence into words; a dependency relationship analyzer analyzing a dependency relationship between a parent word being a dependency destination of each of the words and a child word being a dependent from each of the words, the words being obtained by dividing the input Chinese sentence; a memory unit that stores lihe-word information that registers a first word being a Chinese morpheme and capable of being a part of a lihe-word with a plurality of second words each forming a lihe-word with the first word, does not include words inserted between the first word and the second word, and lihe-words formed by the first word and the second words in the lihe-word information are not grouped based on similarity with other lihe-words; a lihe-word processor detecting the first word and a second word from the dependency relationship, based on the lihe-word information without using the similarity of the lihe-word with other lihe-words, and the lihe-word processor changing a dependency destination of a word depending on both the first word and the second word to the lihe-word formed by combining the first word with the second word in the dependency relationship; a generating unit generating a sentence based on the lihe-word formed by combining the first word with the second word and the word depending on the lihe-word by changing the dependency destination; and an output unit outputting the sentence. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13)
-
-
14. A method of analyzing Chinese language implemented by a computer processor executing instructions, comprising:
-
inputting, by a computer receiving input from a user interface, a Chinese sentence; dividing, by the computer, the sentence into words; analyzing, by the computer, a dependency relationship between a parent word being a dependency destination of each of the words and a child word being a dependent from each of the words; detecting, by the computer, a first word being a Chinese morpheme and capable of being a part of a lihe-word and a second word forming the lihe-word with the first word, from the dependency relationship, based on a lihe-word information in a memory unit, without using the similarity of the lihe-word with other lihe-words, wherein the lihe-word information registers the first word with a plurality of second words each forming a lihe-word with the first word, but does not include words inserted between the first word and the second word, and the lihe-word formed by the first word and the second words in the lihe-word information not being grouped based on similarity with other lihe-words; changing, by the computer, a dependency destination of a word depending on both the first word and the second word to the lihe-word formed by combining the first word with the second word in the dependency relationship; generating, by the computer, a sentence based on the lihe-word formed by combining the first word with the second word and the word depending on the lihe-word by changing the dependency destination; and outputting the sentence.
-
-
15. A computer program product having a non-transitory computer readable medium including programmed instructions, wherein the instructions, when executed by a computer, cause the computer to perform a method which comprises:
-
inputting a Chinese sentence; dividing the sentence into words; analyzing a dependency relationship between a parent word being a dependency destination of each of the words and a child word being a dependent from each of the words; detecting, by the computer, a first word being a Chinese morpheme and capable of being a part of a lihe-word and a second word forming the lihe-word with the first word, from the dependency relationship, based on a lihe-word information in a memory unit without using the similarity of the lihe-word with other lihe-words, wherein the lihe-word information registers the first word with a plurality of second words each forming a lihe-word with the first word, but does not include words inserted between the first word and the second word, and the lihe-word formed by the first word and a second word in the lihe-word information being not grouped based on similarity with other lihe-words; changing a dependency destination of a word depending on both the first word and the second word to the lihe-word formed by combining the first word with the second word in the dependency relationship; generating a sentence based on the lihe-word formed by combining the first word with the second word and the word depending on the lihe-word by changing the dependency destination; and outputting the sentence. - View Dependent Claims (16)
-
Specification