Speech recognition method and apparatus
First Claim
Patent Images
1. A speech recognition apparatus comprising:
- (a) a memory for storing;
(i) a word dictionary having recognition target words, said word dictionary comprising a tree structure in which the recognition target words share a predetermined speech unit;
(ii) a first acoustic model which expresses a reference pattern of the speech unit by one or more states; and
(iii) a second acoustic model which is lower in precision than said first acoustic model;
(b) selection means for selecting a state of interest from the tree structure;
(c) checking means for checking the number of branches of the selected state; and
(d) likelihood calculation means for calculating a likelihood of an acoustic feature parameter for states immediately succeeding the selected state using the first acoustic model, if the number of branches of the selected state is equal to or more than a predetermined value, and otherwise calculating a likelihood of an acoustic feature parameter for states immediately succeeding the selected state using the second acoustic model;
wherein in calculating a likelihood with respect to a state of interest by using said second acoustic model, if likelihood calculation using said first acoustic model has been performed for a state having the same speech unit alignment as that of the state of interest, said likelihood calculation means reuses a result of the likelihood calculation as a result of likelihood calculation for the state of interest.
1 Assignment
0 Petitions
Accused Products
Abstract
A speech recognition apparatus includes a word dictionary having recognition target words, a first acoustic model which expresses a reference pattern of a speech unit by one or more states, a second acoustic model which is lower in precision than said first acoustic model, selection means for selecting one of said first acoustic model and said second acoustic model on the basis of a parameter associated with a state of interest, and likelihood calculation means for calculating a likelihood of an acoustic feature parameter with respect to said acoustic model selected by said selection means.
-
Citations
8 Claims
-
1. A speech recognition apparatus comprising:
-
(a) a memory for storing; (i) a word dictionary having recognition target words, said word dictionary comprising a tree structure in which the recognition target words share a predetermined speech unit; (ii) a first acoustic model which expresses a reference pattern of the speech unit by one or more states; and (iii) a second acoustic model which is lower in precision than said first acoustic model; (b) selection means for selecting a state of interest from the tree structure; (c) checking means for checking the number of branches of the selected state; and (d) likelihood calculation means for calculating a likelihood of an acoustic feature parameter for states immediately succeeding the selected state using the first acoustic model, if the number of branches of the selected state is equal to or more than a predetermined value, and otherwise calculating a likelihood of an acoustic feature parameter for states immediately succeeding the selected state using the second acoustic model; wherein in calculating a likelihood with respect to a state of interest by using said second acoustic model, if likelihood calculation using said first acoustic model has been performed for a state having the same speech unit alignment as that of the state of interest, said likelihood calculation means reuses a result of the likelihood calculation as a result of likelihood calculation for the state of interest. - View Dependent Claims (2, 3, 4, 5)
-
-
6. A speech recognition method of performing speech recognition in a speech recognition apparatus by using (i) a word dictionary having recognition target words, the word dictionary comprising a tree structure in which the recognition target words share a predetermined speech unit, (ii) a first acoustic model which expresses a reference pattern of the speech unit by one or more states, and (iii) a second acoustic model which is lower in precision than the first acoustic model, the method comprising the steps of:
-
selecting a state of interest from the tree structure; checking the number of branches of the selected state; and calculating a likelihood of an acoustic feature parameter for states immediately succeeding the selected state using the first acoustic model, if the number of branches of the selected state is equal to or more than a predetermined value, and otherwise calculating a likelihood of an acoustic feature parameter for states immediately succeeding the selected state using the second acoustic model; wherein in calculating a likelihood with respect to a state of interest by using said second acoustic model, if likelihood calculation using said first acoustic model has been performed for a state having the same speech unit alignment as that of the state of interest, said likelihood calculation means reuses a result of the likelihood calculation as a result of likelihood calculation for the state of interest. - View Dependent Claims (7, 8)
-
Specification