METHOD AND APPARATUS FOR RECOGNIZING SPEECH
First Claim
1. A method of recognizing speech comprising, the steps of:
- determining a boundary between phonemes included in character sequences that are phonetically input to detect each phoneme interval;
calculating reliability according to a probability that a phoneme indicated by the detected phoneme interval corresponds to a phoneme included in a predefined phoneme model;
calculating a phoneme alignment cost with respect to the character sequences based on the calculated reliability and a pre-trained and stored phoneme recognition probability distribution; and
performing phoneme alignment based on the calculated phoneme alignment cost to perform speech recognition on the input character sequences.
1 Assignment
0 Petitions
Accused Products
Abstract
Provided are an apparatus and method for recognizing speech, in which reliability with respect to phoneme-recognized phoneme sequences is calculated and performance of speech recognition is enhanced using the calculated results. The method of recognizing speech includes the steps of: determining a boundary between phonemes included in character sequences that are phonetically input to detect each phoneme interval; calculating reliability according to a probability that a phoneme indicated by the detected phoneme interval corresponds to a phoneme included in a predefined phoneme model; calculating a phoneme alignment cost with respect to the character sequences based on the calculated reliability and a pre-trained and stored phoneme recognition probability distribution; and performing phoneme alignment based on the calculated phoneme alignment cost to perform speech recognition on the input character sequences. As a result, reliability with respect to the phoneme-recognized phoneme sequences can be calculated, and the performance of speech recognition can be enhanced using the calculated results.
-
Citations
20 Claims
-
1. A method of recognizing speech comprising, the steps of:
-
determining a boundary between phonemes included in character sequences that are phonetically input to detect each phoneme interval; calculating reliability according to a probability that a phoneme indicated by the detected phoneme interval corresponds to a phoneme included in a predefined phoneme model; calculating a phoneme alignment cost with respect to the character sequences based on the calculated reliability and a pre-trained and stored phoneme recognition probability distribution; and performing phoneme alignment based on the calculated phoneme alignment cost to perform speech recognition on the input character sequences. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13)
-
-
14. An apparatus for recognizing speech, comprising:
-
a phoneme interval detector for detecting each phoneme interval by determining a boundary between phonemes included in phonetically input character sequences; a reliability determination unit for calculating reliability according to probabilities that a phoneme indicated by each detected phoneme interval corresponds to each phoneme included in a predefined phoneme model; a reliability-based phoneme error model for storing a phoneme recognition probability distribution obtained by pre-training that a phonetically input phoneme is recognized as a phoneme; and a word recognition unit for calculating a phoneme alignment cost with respect to the character sequences based on the calculated reliability and the phoneme recognition probability distribution, and performing phoneme alignment based on the calculated phoneme alignment cost to perform speech recognition with respect to the character sequences. - View Dependent Claims (15, 16, 17, 18, 19, 20)
-
Specification