Application text entry in a mobile environment using a speech processing facility
First Claim
1. A method of entering text into a software application resident on a mobile communication facility comprising:
- recording speech presented by a user using a mobile communication facility resident capture facility;
transmitting the recording through a wireless communication facility to a speech recognition facility;
transmitting contextual information relating to the software application to the speech recognition facility, wherein the contextual information relating to the software application includes an identity of the application, an identity of the mobile communication facility, and contextual information within the application;
wherein the speech recognition facility transmits the contextual information within the application to a first server, wherein the first server decides to process said contextual information within the application or alternatively, the first server transmits the contextual information within the application to a second server and wherein the speech recognition facility further transmits the identity of the application to the second server;
selecting at least one statistical language model from a plurality of language models;
generating results utilizing the speech recognition facility using the at least one statistical language model based at least in part on the information relating to the software application and the recording;
transmitting the results to the mobile communications facility; and
loading the results into the software application.
6 Assignments
0 Petitions
Accused Products
Abstract
In embodiments of the present invention improved capabilities are described for a mobile environment speech processing facility. The present invention may provide for the entering of text into a software application resident on a mobile communication facility, where recorded speech may be presented by the user using the mobile communications facility'"'"'s resident capture facility. Transmission of the recording may be provided through a wireless communication facility to a speech recognition facility, and may be accompanied by information related to the software application. Results may be generated utilizing the speech recognition facility that may be independent of structured grammar, and may be based at least in part on the information relating to the software application and the recording. The results may then be transmitted to the mobile communications facility, where they may be loaded into the software application.
549 Citations
26 Claims
-
1. A method of entering text into a software application resident on a mobile communication facility comprising:
-
recording speech presented by a user using a mobile communication facility resident capture facility; transmitting the recording through a wireless communication facility to a speech recognition facility; transmitting contextual information relating to the software application to the speech recognition facility, wherein the contextual information relating to the software application includes an identity of the application, an identity of the mobile communication facility, and contextual information within the application;
wherein the speech recognition facility transmits the contextual information within the application to a first server, wherein the first server decides to process said contextual information within the application or alternatively, the first server transmits the contextual information within the application to a second server and wherein the speech recognition facility further transmits the identity of the application to the second server;
selecting at least one statistical language model from a plurality of language models;generating results utilizing the speech recognition facility using the at least one statistical language model based at least in part on the information relating to the software application and the recording; transmitting the results to the mobile communications facility; and loading the results into the software application. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 26)
-
-
13. A method for entering text into a software application resident on a mobile communication facility comprising:
-
recording speech presented by a user into a mobile communication facility resident capture facility; transmitting the recording through a wireless communication facility to a speech recognition facility; transmitting information relating to the software application to the speech recognition facility, wherein the information relating to the software application includes an identity of the application, an identity of the mobile communication facility, and an identity of a text box within the application;
wherein the speech recognition facility transmits contextual information within the application to a first server, wherein the first server decides to process said contextual information within the application or alternatively, the first server transmits the contextual information within the application to a second server and wherein the speech recognition facility further transmits the identity of the application to the second server;
selecting an initial set of a plurality of language models for processing the recorded speech;generating results utilizing the speech recognition facility and the initial set of language models; transmitting the results to the mobile communications facility; and loading the results into the software application. - View Dependent Claims (14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24)
-
-
25. A system, comprising:
-
a mobile communication device for recording speech and running a resident software module; a speech recognition facility remote from a mobile communication facility; a communications facility for transmitting the recorded speech and information relating to the software module to the speech recognition facility; wherein the speech recognition facility generates results by processing the recorded speech by selecting at least one statistical language model, at least one of which is a large vocabulary statistical language model, from a plurality of language models and based at least in part on the information relating to the software application, wherein the information relating to the software application includes an identity of the application, an identity of the mobile communication facility, and contextual information within the application;
wherein the speech recognition facility transmits the contextual information within the application to a first server, wherein the first server decides to process said contextual information within the application or alternatively, the first server transmits the contextual information within the application to a second server and wherein the speech recognition facility further transmits the identity of the application to the second server.
-
Specification