Meaning token dictionary for automatic speech recognition
First Claim
Patent Images
1. An automatic speech recognition system, comprising:
- a speech recognition dictionary comprising a plurality of meaning tokens, wherein a single meaning token has a same meaning associated with plural different spoken words that have different pronunciations but similar spoken meanings; and
a speech recognizer configured to convert spoken input into a sequence of meaning tokens contained in the speech recognition dictionary and corresponding to a sequence of vocabulary words most likely to have been spoken by a user, wherein different spoken inputs having different spoken words but similar meanings are converted into a same meaning token or same sequence of meaning tokens, and wherein at least one meaning token encodes one or more labels identifying one or more respective application-specific categories for use by an application program.
6 Assignments
0 Petitions
Accused Products
Abstract
Systems and methods of automatic speech recognition are described. In one aspect, an automatic speech recognition system includes a speech recognition dictionary and a speech recognizer. The speech recognition dictionary includes a plurality of meaning tokens each associated with one or more pronunciations of one or more vocabulary words and signifying a single meaning. The speech recognizer is configured to convert spoken input into a sequence of meaning tokens contained in the speech recognition dictionary and corresponding to a sequence of vocabulary words most likely to have been spoken by a user.
-
Citations
23 Claims
-
1. An automatic speech recognition system, comprising:
-
a speech recognition dictionary comprising a plurality of meaning tokens, wherein a single meaning token has a same meaning associated with plural different spoken words that have different pronunciations but similar spoken meanings; and a speech recognizer configured to convert spoken input into a sequence of meaning tokens contained in the speech recognition dictionary and corresponding to a sequence of vocabulary words most likely to have been spoken by a user, wherein different spoken inputs having different spoken words but similar meanings are converted into a same meaning token or same sequence of meaning tokens, and wherein at least one meaning token encodes one or more labels identifying one or more respective application-specific categories for use by an application program. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11)
-
-
12. An automatic speech recognition method, comprising:
-
converting spoken input into a sequence of meaning tokens contained in a speech recognition dictionary and corresponding to a sequence of vocabulary words most likely to have been spoken by a user, wherein the speech recognition dictionary comprises a plurality of meaning tokens, and a single meaning token has a same meaning associated with plural different spoken words that have different pronunciations but similar spoken meanings such that different spoken inputs having different spoken words but similar spoken meanings are converted into a same meaning token or same sequence of meaning tokens, and wherein at least one meaning token encodes one or more labels identifying one or more respective application-specific categories for use by an application program. - View Dependent Claims (13, 14, 15, 16, 17, 18, 19, 20, 21)
-
-
22. A computer program for automatically recognizing speech, the computer program residing on a computer-readable medium and comprising computer-readable instructions for causing a computer to:
-
convert spoken input into a sequence of meaning tokens contained in a speech recognition dictionary and corresponding to a sequence of vocabulary words most likely to have been spoken by a user, wherein the speech recognition dictionary resides on the computer-readable medium and comprises a plurality of meaning tokens, and a single meaning token has a same meaning associated with plural different spoken words that have different pronunciations but similar spoken meanings such that different spoken inputs having different spoken words but similar spoken meanings are converted into a same meaning token or same sequence of meaning tokens, and wherein at least one meaning token encodes one or more labels identifying one or more respective application-specific categories for use by an application program. - View Dependent Claims (23)
-
Specification