Automatic speech segmentation and verification method and system
First Claim
1. An automatic speech segmentation and verification method comprising:
- a retrieving step, for retrieving a recorded speech corpus, the recorded speech corpus corresponding to a known text script, the known text script defining phonetic information with N phonetic units;
a segmenting step, for segmenting the recorded speech corpus into N test speech unit segments referring to the phonetic information of the N phonetic units in the known text script;
a segment-confidence-measure verifying step, for verifying segment confidence measures of N cutting points of the test speech unit segments to determine if the N cutting points of the test speech unit segments are correct;
a phonetic-confidence-measure verifying step, for verifying phonetic confidence measures of the test speech unit segments to determine if the test speech unit segments correspond to the known text script; and
a determining step, for determining acceptance of the phonetic unit by comparing a combination of segment reliability and the phonetic confidence measures of the test speech unit segments to a predetermined threshold value;
wherein if the combined confidence measure is greater than the predetermined threshold value, the phonetic is accepted.
1 Assignment
0 Petitions
Accused Products
Abstract
An automatic speech segmentation and verification system and method is disclosed, which has a known text script and a recorded speech corpus corresponding to the known text script. A speech unit segmentor segments the recorded speech corpus into N test speech unit segments referring to the phonetic information of the known text script. Then, a segmental verifier is applied to obtain a confidence measure of syllable segmentation for verifying the correctness of the cutting points of test speech unit segments. A phonetic verifier obtains a confidence measure of syllable verification by using verification models for verifying whether the recorded speech corpus is correctly recorded. Finally, a speech unit inspector integrates the confidence measure of syllable segmentation and the confidence measure of syllable verification to determine whether the test speech unit segment is accepted or not.
41 Citations
18 Claims
-
1. An automatic speech segmentation and verification method comprising:
-
a retrieving step, for retrieving a recorded speech corpus, the recorded speech corpus corresponding to a known text script, the known text script defining phonetic information with N phonetic units;
a segmenting step, for segmenting the recorded speech corpus into N test speech unit segments referring to the phonetic information of the N phonetic units in the known text script;
a segment-confidence-measure verifying step, for verifying segment confidence measures of N cutting points of the test speech unit segments to determine if the N cutting points of the test speech unit segments are correct;
a phonetic-confidence-measure verifying step, for verifying phonetic confidence measures of the test speech unit segments to determine if the test speech unit segments correspond to the known text script; and
a determining step, for determining acceptance of the phonetic unit by comparing a combination of segment reliability and the phonetic confidence measures of the test speech unit segments to a predetermined threshold value;
wherein if the combined confidence measure is greater than the predetermined threshold value, the phonetic is accepted. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9)
-
-
10. An automatic speech segmentation and verification system comprising:
-
a database for storing a known text script and a recorded speech corpus corresponding to the known text script, and the known text script has phonetic information with N speech unit segment wherein N is a positive integer;
a speech unit segmentor for segmenting the recorded speech corpus into N test speech unit segments referring to the phonetic information of the known text script;
a segmental verifier for verifying the correctness of the cutting points of test speech unit segments by obtaining a segmental confidence measure;
a phonetic verifier for obtaining a confidence measure of syllable verification by using verification models for verifying whether the recorded speech corpus is correctly recorded; and
a speech unit inspector for integrating the confidence measure of syllable segmentation and the confidence measure of syllable verification to determine whether the test speech unit segment is accepted. - View Dependent Claims (11, 12, 13, 14, 15, 16, 17, 18)
-
Specification