Natural language processing method
First Claim
1. A natural language processing method comprising:
- a reception step of receiving a sentence appended with information associated with a pause setting position;
a part-of-speech acquisition step of acquiring parts of speech of respective words in the sentence;
an acquisition step of acquiring frequencies of occurrence of arrangements of the parts of speech for respective part-of-speech sequence groups with the same arrangements of parts of speech corresponding to arrangements of the words;
a count step of counting the number of pause setting positions each of which is present between parts of speech in the part-of-speech sequence for respective part-of-speech sequence groups on the basis of the information associated with the pause setting position; and
a calculation step of calculating pause insertability values using the frequencies of occurrence and the number of setting positions for respective part-of-speech sequence groups.
1 Assignment
0 Petitions
Accused Products
Abstract
A sentence appended with information associated with a pause setting position is input (S201), and a morphological analysis process is applied to the sentence to divide the sentence into words and to determine parts of speech of respective words (S203). Part-of-speech sequences, each of which includes parts of speech of a total of N (N≧2) words before and after each word boundary, are obtained for respective word boundaries, and the frequencies of occurrence of arrangements of the parts of speech are calculated for respective groups of part-of-speech sequences with the same arrangements of parts of speech (S206). Pause counts, each of which indicates the number of times of setting of a pause setting position indicated by the pause setting position data between parts of speech in the part-of-speech sequence, are calculated for respective groups of the part-of-speech sequences with the same arrangements of parts of speech (S208). Pause insertability values are calculated using the frequencies of occurrence and pause counts for respective groups (S210).
-
Citations
7 Claims
-
1. A natural language processing method comprising:
-
a reception step of receiving a sentence appended with information associated with a pause setting position;
a part-of-speech acquisition step of acquiring parts of speech of respective words in the sentence;
an acquisition step of acquiring frequencies of occurrence of arrangements of the parts of speech for respective part-of-speech sequence groups with the same arrangements of parts of speech corresponding to arrangements of the words;
a count step of counting the number of pause setting positions each of which is present between parts of speech in the part-of-speech sequence for respective part-of-speech sequence groups on the basis of the information associated with the pause setting position; and
a calculation step of calculating pause insertability values using the frequencies of occurrence and the number of setting positions for respective part-of-speech sequence groups. - View Dependent Claims (2, 3, 4, 6, 7)
-
-
5. A natural language processing apparatus comprising:
-
reception means for receiving a sentence appended with information associated with a pause setting position;
part-of-speech acquisition means for acquiring parts of speech of respective words in the sentence;
acquisition means for acquiring frequencies of occurrence of arrangements of the parts of speech for respective part-of-speech sequence groups with the same arrangements of parts of speech corresponding to arrangements of the words;
count means for counting the number of pause setting positions each of which is present between parts of speech in the part-of-speech sequence for respective part-of-speech sequence groups on the basis of the information associated with the pause setting position; and
calculation means for calculating pause insertability values using the frequencies of occurrence and the number of setting positions for respective part-of-speech sequence groups.
-
Specification