Word boundary acoustic units
First Claim
1. A speech recognition system for recognizing an input utterance of spoken words, the system comprising:
- a set of word models for modeling vocabulary to be recognized, each word model being associated with a word in the vocabulary, each word in the vocabulary considered as a sequence of phones including a first phone and a last phone, wherein each word model begins in the middle of the first phone of its associated word and ends in the middle of the last phone of its associated word;
a set of word connecting models for modeling acoustic transitions between the middle of a word'"'"'s last phone and the middle of an immediately succeeding word'"'"'s first phone; and
a recognition engine for processing the input utterance in relation to the set of word models and the set of word connecting models to cause recognition of the input utterance.
7 Assignments
0 Petitions
Accused Products
Abstract
A speech recognition system recognizes an input utterance of spoken words. The system includes a set of word models for modeling vocabulary to be recognized, each word model being associated with a word in the vocabulary, each word in the vocabulary considered as a sequence of phones including a first phone and a last phone, wherein each word model begins in the middle of the first phone of its associated word and ends in the middle of the last phone of its associated word; a set of word connecting models for modeling acoustic transitions between the middle of a word'"'"'s last phone and the middle of an immediately succeeding word'"'"'s first phone; and a recognition engine for processing the input utterance in relation to the set of word models and the set of word connecting models to cause recognition of the input utterance.
-
Citations
36 Claims
-
1. A speech recognition system for recognizing an input utterance of spoken words, the system comprising:
-
a set of word models for modeling vocabulary to be recognized, each word model being associated with a word in the vocabulary, each word in the vocabulary considered as a sequence of phones including a first phone and a last phone, wherein each word model begins in the middle of the first phone of its associated word and ends in the middle of the last phone of its associated word;
a set of word connecting models for modeling acoustic transitions between the middle of a word'"'"'s last phone and the middle of an immediately succeeding word'"'"'s first phone; and
a recognition engine for processing the input utterance in relation to the set of word models and the set of word connecting models to cause recognition of the input utterance. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9)
-
-
10. A method of a speech recognition system for recognizing an input utterance of spoken words, the method comprising:
-
modeling vocabulary to be recognized with a set of word models, each word model being associated with a word in the vocabulary, each word in the vocabulary being considered as a sequence of phones including a first phone and a last phone, wherein each word model begins in the middle of the first phone of its associated word and ends in the middle of the last phone of its associated word;
modeling acoustic transitions between the middle of a word'"'"'s last phone and the middle of an immediately succeeding word'"'"'s first phone with a set of word connecting models; and
processing with a recognition engine the input utterance in relation to the set of word models and the set of word connecting models to cause recognition of the input utterance. - View Dependent Claims (11, 12, 13, 14, 15, 16, 17, 18)
-
-
19. An improved speech recognition system of the type employing word models, wherein the improvement comprises:
-
a set of word models for modeling vocabulary to be recognized, each word model being associated with a word in the vocabulary, each word in the vocabulary considered as a sequence of phones including a first phone and a last phone, wherein each word model begins in the middle of the first phone of its associated word and ends in the middle of the last phone of its associated word; and
a set of word connecting models for modeling acoustic transitions between the middle of a word'"'"'s last phone and the middle of an immediately succeeding word'"'"'s first phone. - View Dependent Claims (20, 21, 22, 23, 24, 25, 26, 27)
-
-
28. An improved method of a speech recognition system for recognizing an input utterance of spoken words, the improvement comprising:
-
modeling vocabulary to be recognized with a set of word models, each word model being associated with a word in the vocabulary, each word in the vocabulary being considered as a sequence of phones including a first phone and a last phone, wherein each word model begins in the middle of the first phone of its associated word and ends in the middle of the last phone of its associated word; and
modeling acoustic transitions between the middle of a word'"'"'s last phone and the middle of an immediately succeeding word'"'"'s first phone with a set of word connecting models. - View Dependent Claims (29, 30, 31, 32, 33, 34, 35, 36)
-
Specification