Mobile terminal controllable by spoken utterances
First Claim
Patent Images
1. A network server for mobile terminals which are controllable by spoken utterances, comprising:
- a unit for providing acoustic models for automatic recognition of the spoken utterances, the unit for providing acoustic models translating a textual transcription of a spoken utterance into a sequence of phonetic transcription units and the sequence of phonetic transcription units into a sequence of phonetic recognition units, the sequence of phonetic recognition units forming an acoustic model of the spoken utterance; and
an interface for transmitting the acoustic models to the mobile terminals.
1 Assignment
0 Petitions
Accused Products
Abstract
A mobile terminal (100) which is controllable by spoken utterances like proper names or command words is described. The mobile terminal (100) comprises an interface (200) for receiving from a network server (300) acoustic models for automatic speech recognition and an automatic speech recognizer (110) for recognizing the spoken utterances based on the received acoustic models. The invention further relates to a network server (300) for mobile terminals (100) which are controllable by spoken utterances and to a method for obtaining acoustic models for a mobile terminal (100) controllable by spoken utterances.
-
Citations
27 Claims
-
1. A network server for mobile terminals which are controllable by spoken utterances, comprising:
-
a unit for providing acoustic models for automatic recognition of the spoken utterances, the unit for providing acoustic models translating a textual transcription of a spoken utterance into a sequence of phonetic transcription units and the sequence of phonetic transcription units into a sequence of phonetic recognition units, the sequence of phonetic recognition units forming an acoustic model of the spoken utterance; and
an interface for transmitting the acoustic models to the mobile terminals. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9)
-
-
10. A network server for mobile terminals which are controllable by spoken utterances, comprising:
-
a unit for providing acoustic models for automatic recognition of spoken utterances;
a speech synthesizer for generating voice prompts of textual transcriptions, the voice prompts being usable as acoustic feedback; and
an interface for transmitting the acoustic models and the voice prompts to the mobile terminals. - View Dependent Claims (11, 14, 15, 16, 17, 18, 19)
-
-
12. A network server for mobile terminals which are controllable by spoken utterances, comprising:
-
a unit for providing acoustic models for automatic recognition of the spoken utterances;
a voice prompt database for storing voice prompts corresponding to the spoken utterances, the voice prompts being utilized as acoustic feedback;
an interface in communication with the unit for providing acoustic models and the voice prompt database, the interface enabling transmission of the acoustic models and the voice prompts to the mobile terminals.
-
-
13. A mobile terminal controllable by spoken utterances, comprising:
-
an interface for receiving from a network server acoustic models which were created on the basis of textual transcriptions of the spoken utterances, the received acoustic models being comprised of a sequence of phonetic recognition units, each phonetic recognition unit being derived from a corresponding phonetic transcription unit; and
an automatic speech recognizer for recognizing the spoken utterances based on the phonetic recognition units of the received acoustic models.
-
-
20. A method for obtaining acoustic models for automatic speech recognition in a mobile terminal controllable by spoken utterances, comprising:
-
providing acoustic models by a network server, one or more of the provided acoustic models being obtained by translating a textual transcription of a spoken utterance into a sequence of phonetic transcription units and the sequence of phonetic transcription units into a sequence of phonetic recognition units, the sequence of phonetic recognition units forming the acoustic model of the spoken utterance;
transmitting the acoustic models from the network server to the mobile terminal; and
automatically recognizing the spoken utterances within the mobile terminal based on the phonetic recognition units of the acoustic models transmitted by the network server. - View Dependent Claims (21, 22, 23, 24, 25, 27)
-
-
26. A computer program product comprising program code portions for performing when the computer program product is run on a network server the steps of
providing acoustic models, one or more of the provided acoustic models being obtained by translating a textual transcription of a spoken utterance into a sequence of phonetic transcription units and the sequence of phonetic transcription units into a sequence of phonetic recognition units, the sequence of phonetic recognition units forming the acoustic model of the spoken utterance; transmitting the acoustic models from the network server to a mobile terminal to enable automatic recognition of the spoken utterances within the mobile terminal based on the phonetic recognition units of the acoustic models transmitted by the network server.
Specification