Sentence processing apparatus and method thereof,utilizing dictionaries to interpolate elliptic characters or symbols
First Claim
1. A sentence processing apparatus comprising:
- an input unit for inputting characters, a display unit for displaying said input characters, and a processing unit for converting and editing said input characters, wherein said processing unit includes;
candidate word extraction means which extracts candidate words for an elliptic word by referring to a vocabulary dictionary storing a word and its usage frequency, to a dictionary of transition between words defining an information on transition between words and a probability of transition between words, and by searching the characters before and after the elliptic character included in the input sentence in the vocabulary dictionary, and determination means which selects a single word among said extracted candidate words by referring to said dictionary of transition between words.
1 Assignment
0 Petitions
Accused Products
Abstract
A document or sentence processing apparatus having an input unit for inputting characters, a display unit for displaying input characters and a processing unit for converting and editing the input characters, in which the processing unit has a candidate word extraction unit which extracts candidates for the words with their characters omitted and/or omitted words themselves by referring to the vocabulary dictionary storing words and their usage frequency, to the dictionary of transition between words defining the information on the transition between words and the probability of the transition between words, and by searching the characters before and after the elliptic character included in the input sentence into the vocabulary dictionary, and a determination unit which selects a single word among the extracted candidate words by referring to the dictionary of transition between words.
-
Citations
6 Claims
-
1. A sentence processing apparatus comprising:
-
an input unit for inputting characters, a display unit for displaying said input characters, and a processing unit for converting and editing said input characters, wherein said processing unit includes;
candidate word extraction means which extracts candidate words for an elliptic word by referring to a vocabulary dictionary storing a word and its usage frequency, to a dictionary of transition between words defining an information on transition between words and a probability of transition between words, and by searching the characters before and after the elliptic character included in the input sentence in the vocabulary dictionary, and determination means which selects a single word among said extracted candidate words by referring to said dictionary of transition between words. - View Dependent Claims (2, 3, 4)
said input unit includes a tablet for allowing an input of words by handwriting, and said processing unit includes recognition means for extracting and recognizing stroke information input by handwriting. -
3. A sentence processing apparatus according to claim 1, wherein
said processing unit includes vocabulary dictionary building means for decomposing an input sentence into individual words, and storing an occurrence count of said individual word in said sentence and said individual word into said vocabulary dictionary. -
4. A sentence processing apparatus according to claim 1, wherein
said processing unit includes means for building a dictionary of transition between words for decomposing an input sentence into individual words, and storing a transition count between said individual words in said sentence and said individual word into said dictionary of transition between words.
-
-
5. A sentence processing method comprising:
-
a step of decomposing an input sentence into individual words, and storing an occurrence count of an individual word in said sentence and said individual word into a vocabulary dictionary, a step of storing an transition count between said individual words into a dictionary of transition between words and searching a class of a particle for said individually decomposed word, a step of extracting candidate words of omitted words by referring to said vocabulary dictionary on characters before and after an elliptic symbol included in said input sentence, and a step of determining a single word among said candidate words extracted on a basis of said dictionary of transition between words.
-
-
6. A sentence processing method comprising:
-
a step of decomposing an input sentence into individual words, and storing an occurrence count of an individual word in said sentence and said individual word into a vocabulary dictionary, a step of storing an transition count between said individual words into a dictionary of transition between words and searching a class of a particle for said individually decomposed word, a step of extracting a candidate of omitted words by referring to said vocabulary dictionary on characters before and after an elliptic symbol included in said input sentence, and a step of determining a single word among said candidate words extracted on a basis of said dictionary of transition between words, wherein in a case where said determined word is found in said vocabulary dictionary, an occurrence count of said determined word is modified and said dictionary of transition between words is modified on a basis of an information on transition between words.
-
Specification