Chinese prosodic words forming method and apparatus
First Claim
1. A method of forming Chinese prosodic words, characterized in that said method comprises steps of:
- inputting Chinese text;
performing process of word segmentation and part of speech annotation for the input Chinese text to generate an initial prosodic word sequence;
inserting grids representing prosodic word boundaries for all the words in the initial prosodic word sequence to generate a grid prosodic word sequence;
annotating the grids ready to be deleted in the grid prosodic word sequence based on the prosodic word forming means;
judging the grids which actually need to be deleted in the grids ready to be deleted based on the prosodic word forming means;
deleting the grids which actually need to be deleted in the grid prosodic word sequence, and word forming the words between every two grids in the remaining grids to generate prosodic words.
1 Assignment
0 Petitions
Accused Products
Abstract
The present invention provides a method and apparatus of forming Chinese prosodic words, which method comprises the steps of inputting Chinese text; performing process of word segmentation and part of speech annotation for the input Chinese text to generate an initial prosodic word sequence; inserting grids representing prosodic word boundaries for all the words in the initial prosodic word sequence to generate a grid prosodic word sequence; annotating the grids ready to be deleted in the grid prosodic word sequence based on the prosodic word forming means; judging the grids which actually need to be deleted in the grids ready to be deleted based on the prosodic word forming means; deleting the grids which actually need to be deleted in the grid prosodic word sequence, and word forming the words between every two grids in the remaining grids to generate prosodic words. The present invention avoids the defect whereby the type of insertion error of the prosodic word would render the pronunciation hard to understand or unnatural as far as possible, and reduces the number of the type of insertion error of prosodic word boundaries.
-
Citations
13 Claims
-
1. A method of forming Chinese prosodic words, characterized in that said method comprises steps of:
-
inputting Chinese text; performing process of word segmentation and part of speech annotation for the input Chinese text to generate an initial prosodic word sequence; inserting grids representing prosodic word boundaries for all the words in the initial prosodic word sequence to generate a grid prosodic word sequence; annotating the grids ready to be deleted in the grid prosodic word sequence based on the prosodic word forming means; judging the grids which actually need to be deleted in the grids ready to be deleted based on the prosodic word forming means; deleting the grids which actually need to be deleted in the grid prosodic word sequence, and word forming the words between every two grids in the remaining grids to generate prosodic words. - View Dependent Claims (2, 3, 4, 5)
-
-
6. An apparatus of forming Chinese prosodic words, characterized in that said apparatus comprises:
-
an input part for inputting Chinese text; a word segmentation and part of speech annotating part for performing process of word segmentation and part of speech annotation for the input Chinese text to generate an initial prosodic word sequence; a prosodic word grid insert part for inserting grids representing prosodic word boundaries for all the words in the initial prosodic word sequence to generate a grid prosodic word sequence; a prosodic word grid delete part for annotating the grids ready to be deleted in the grid prosodic word sequence based on the prosodic word forming means;
judging the grids which actually need to be deleted in the grids ready to be deleted based on the prosodic word forming means;
deleting the grids which actually need to be deleted in the grid prosodic word sequence;and a prosodic word generating part for forming the words between every two grids in the remaining grids to generate prosodic words. - View Dependent Claims (7, 8, 9, 10, 11)
-
-
12. A program of forming Chinese prosodic words, characterized in that said program comprises:
-
inputting Chinese text; performing process of word segmentation and part of speech annotation for the input Chinese text to generate an initial prosodic word sequence; inserting grids representing prosodic word boundaries for all the words in the initial prosodic word sequence to generate a grid prosodic word sequence; annotating the grids ready to be deleted in the grid prosodic word sequence based on the prosodic word forming means; judging the grids which actually need to be deleted in the grids ready to be deleted based on the prosodic word forming means; deleting the grids which actually need to be deleted in the grid prosodic word sequence, and word forming the words between every two grids in the remaining grids to generate prosodic words.
-
-
13. A readable storage medium of storing Chinese prosodic words forming program, characterized in that said readable storage medium stores the following programs:
-
inputting Chinese text; performing process of word segmentation and part of speech annotation for the input Chinese text to generate an initial prosodic word sequence; inserting grids representing prosodic word boundaries for all the words in the initial prosodic word sequence to generate a grid prosodic word sequence; annotating the grids ready to be deleted in the grid prosodic word sequence based on the prosodic word forming means; judging the grids which actually need to be deleted in the grids ready to be deleted based on the prosodic word forming means; deleting the grids which actually need to be deleted in the grid prosodic word sequence, and word forming the words between every two grids in the remaining grids to generate prosodic words.
-
Specification