Focused language models for improved speech input of structured documents
First Claim
1. A speech recognition processor for processing input speech and converting to text, comprising:
- topic determination means for determining a topic of the input speech;
register determination means for determining a register of an outgoing message based on a user-specified register, wherein said register determining means is one or more of a keypad user interface device, a touch-based user interface device, and a speech recognition interface device that allows a user to input said register;
speech input means for allowing a user to input a speech message;
a language model retrieval section for retrieving a focused language model based upon said topic and said register;
a speech recognition module which uses said retrieved focused language model to convert said speech message to text; and
a display section for displaying said text.
2 Assignments
0 Petitions
Accused Products
Abstract
An e-mail message process is provided for use with a personal digital assistant which allows for the use of input speech messaging which is converted to text using a focused language model which is downloaded by a cellular phone connection to an Internet server which provides the focused language model based upon a topic for the intended e-mail message. The text that is generated from the input speech method can be summarized by the e-mail message processor and can be edited by the user. The generated e-mail message can then be transmitted again via cellular connection to an Internet e-mail server for transmitting the e-mail message to a recipient.
-
Citations
32 Claims
-
1. A speech recognition processor for processing input speech and converting to text, comprising:
-
topic determination means for determining a topic of the input speech;
register determination means for determining a register of an outgoing message based on a user-specified register, wherein said register determining means is one or more of a keypad user interface device, a touch-based user interface device, and a speech recognition interface device that allows a user to input said register;
speech input means for allowing a user to input a speech message;
a language model retrieval section for retrieving a focused language model based upon said topic and said register;
a speech recognition module which uses said retrieved focused language model to convert said speech message to text; and
a display section for displaying said text. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9)
-
-
10. A personal digital computer device, comprising:
-
a housing including a display screen and an input keypad disposed on an outer surface thereof;
a microphone unit disposed in said housing;
a transmitter/receiver device disposed in said housing;
a processor for processing input speech, including topic determining means for determining a topic of the input speech, register determination means for determining a register of an outgoing message based on a user-specified register, speech input means for allowing a user to input a voice message via said microphone, a language model retrieval section adapted for accessing an internet server via said transmitter/receiver device for retrieving a language model from the internet server based upon said topic and said register, a speech recognition module which uses said retrieved language model to convert said speech message to text, and a display section for displaying said text on said display screen, wherein said register determining means is one or more of a keypad user interface device, a touch-based user interface device, and a speech recognition interface device that allows a user to input said register. - View Dependent Claims (11, 12, 13, 14, 15, 16, 17)
-
-
18. A personal computer implemented speech recognition e-mail processor, comprising:
-
topic determining means for determining a topic of an outgoing e-mail message;
register determination means for determining a register of an outgoing message based on a user-specified register, wherein said register determining means is one or more of a keypad user interface device, a touch-based user interface device, and a speech recognition interface device that allows a user to input said register;
speech input means for allowing a user to input a voice message;
a language model retrieval section for retrieving a language model based upon said topic and said register;
a speech recognition section which uses said retrieved language model to convert said voice message to text;
a display section for displaying said text in an e-mail template; and
a transmission section for transmitting said e-mail template via a cellular-internet connection. - View Dependent Claims (19, 20, 21, 22, 23, 24)
-
-
25. A personal computer implemented speech recognition e-mail processor, comprising:
-
register determining means for determining a register of an outgoing e-mail message based on a user-specified register;
register inferred for an outgoing message replying to a received message based on one or more of;
(a) metadata describing how the received message was formatted, and (b) forms of address present in the received message;
speech input means for allowing a user to input a speech message;
a language model retrieval section for retrieving a language model based upon said register;
a speech recognition section which uses said retrieved language model to convert said speech message to text;
a display section for displaying said text in an e-mail template; and
a transmission section for transmitting said e-mail template via a cellular-internet connection, wherein said register determining means is one or more of a keypad user interface device and a touch-based user interface device that allows a user to input said register. - View Dependent Claims (26, 27, 28)
-
-
29. A personal computer implemented speech recognition e-mail processor, comprising:
-
register determining means for determining a register of an outgoing e-mail message based on a user-specified register;
register inferred for an outgoing message replying to a received message based on one or more of;
(a) metadata describing how the received message was formatted, and (b) forms of address present in the received message;
speech input means for allowing a user to input a speech message;
a language model retrieval section for retrieving a language model based upon said register;
a speech recognition section which uses said retrieved language model to convert said speech message to text;
a display section for displaying said text in an e-mail template; and
a transmission section for transmitting said e-mail template via a cellular-internet connection, wherein said register determining means is a speech recognition interface device that allows a user to input said register. - View Dependent Claims (30, 31, 32)
-
Specification