Apparatus and method for synthesized audible response to an utterance in speaker-independent voice recognition
First Claim
Patent Images
1. A method comprising:
- selecting one of a plurality of phonetic representations of speech elements of a predefined vocabulary that most closely matches an utterance, wherein said plurality of phonetic representations includes multiple phonetic representations of any of said speech elements having different possible pronunciations; and
synthesizing an audible speech fragment according to said one of said phonetic representations.
4 Assignments
0 Petitions
Accused Products
Abstract
When a speaker-independent voice-recognition (SIVR) system recognizes a spoken utterance that matches a phonetic representation of a speech element belonging to a predefined vocabulary, it may play a synthesized speech fragment as a means for the user to verify that the utterance was correctly recognized. When a speech element in the vocabulary has more than one possible pronunciation, the system may select the one most closely matching the user'"'"'s utterance, and play a synthesized speech fragment corresponding to that particular representation.
-
Citations
24 Claims
-
1. A method comprising:
-
selecting one of a plurality of phonetic representations of speech elements of a predefined vocabulary that most closely matches an utterance, wherein said plurality of phonetic representations includes multiple phonetic representations of any of said speech elements having different possible pronunciations; and
synthesizing an audible speech fragment according to said one of said phonetic representations. - View Dependent Claims (2, 3, 4, 5, 6)
-
-
7. An apparatus comprising:
a processor to select one of a plurality of phonetic representations of speech elements of a predefined vocabulary that most closely matches a portion of an incoming digitized voice signal corresponding to an utterance, wherein said plurality of phonetic representations includes multiple phonetic representations of any of said speech elements having different possible pronunciations, and to synthesize an outgoing digitized voice signal according to said one of said phonetic representations. - View Dependent Claims (8, 9, 10, 11, 12, 13)
-
14. A voice-operated, mobile cellular telephone comprising:
-
a transceiver;
an antenna; and
a processor to select one of a plurality of phonetic representations of speech elements of a predefined vocabulary that most closely matches a portion of an incoming digitized voice signal corresponding to an utterance, wherein said plurality of phonetic representations includes multiple phonetic representations of any of said speech elements having different possible pronunciations, and to synthesize an outgoing digitized voice signal according to said one of said phonetic representations. - View Dependent Claims (15, 16, 17, 18, 19, 20)
-
-
21. An article comprising a computer-readable storage medium having stored thereon instructions that, when executed by a processor, result in:
-
selecting one of a plurality of phonetic representations of speech elements of a predefined vocabulary that most closely matches an utterance, wherein said plurality of phonetic representations includes multiple phonetic representations of any of said speech elements having different possible pronunciations; and
synthesizing an audible speech fragment according to said one of said phonetic representations. - View Dependent Claims (22, 23, 24)
-
Specification