System and method of providing speech processing in user interface
First Claim
1. A method comprising:
- receiving, via touch provided on a touch screen of a device, an indication associated with a specific field displayed in a user interface on the touch screen, the indication signaling that speech, which is associated with the specific field, will follow;
receiving the speech via the device and generating speech data based on the speech;
generating, by the device, a request for speech recognition, wherein the request comprises;
(1) an application identifier identifying a speech recognizer on a public network node;
(2) a location parameter specific to a current location of the device, the device being associated with a speaker of the speech; and
(3) a grammar parameter associated with a home location of the speaker of the speech, the grammar parameter identifying a particular grammar;
transmitting the speech data and the request to the public network node for speech recognition using the speech recognizer;
receiving, at the device, text associated with the speech data from the speech recognizer; and
inserting the text into the specific field.
4 Assignments
0 Petitions
Accused Products
Abstract
Disclosed are systems, methods and computer-readable media for enabling speech processing in a user interface of a device. The method includes receiving an indication of a field and a user interface of a device, the indication also signaling that speech will follow, receiving the speech from the user at the device, the speech being associated with the field, transmitting the speech as a request to public, common network node that receives and processes speech, processing the transmitted speech and returning text associated with the speech to the device and inserting the text into the field. Upon a second indication from the user, the system processes the text in the field as programmed by the user interface. The present disclosure provides a speech mash up application for a user interface of a mobile or desktop device that does not require expensive speech processing technologies.
45 Citations
23 Claims
-
1. A method comprising:
-
receiving, via touch provided on a touch screen of a device, an indication associated with a specific field displayed in a user interface on the touch screen, the indication signaling that speech, which is associated with the specific field, will follow; receiving the speech via the device and generating speech data based on the speech; generating, by the device, a request for speech recognition, wherein the request comprises; (1) an application identifier identifying a speech recognizer on a public network node;
(2) a location parameter specific to a current location of the device, the device being associated with a speaker of the speech; and(3) a grammar parameter associated with a home location of the speaker of the speech, the grammar parameter identifying a particular grammar; transmitting the speech data and the request to the public network node for speech recognition using the speech recognizer; receiving, at the device, text associated with the speech data from the speech recognizer; and inserting the text into the specific field. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14)
-
-
15. A device comprising:
-
a touch screen; a processor; and a computer-readable medium storing instructions which, when executed by the processor, cause the processor to perform operations comprising; receiving, via touch provided on the touch screen, an indication associated with a specific field displayed in a user interface on the touch screen, the indication signaling that speech, which is associated with the specific field, will follow; receiving the speech via the device and generating speech data based on the speech; generating, by the device, a request for speech recognition, wherein the request comprises; (1) an application identifier identifying a speech recognizer on a public network node; (2) a location parameter specific to a current location of the device, the device being associated with a speaker of the speech; and (3) a grammar parameter associated with a home location of the speaker of the speech, the grammar parameter identifying a particular grammar; transmitting the speech data and the request to the public network node for speech recognition using the speech recognizer; receiving, at the device, text associated with the speech data from the speech recognizer; and inserting the text into the specific field. - View Dependent Claims (16, 17, 18, 19, 20, 21, 22)
-
-
23. A computer-readable storage device having instructions stored which, when executed by a computing device, cause the computing device to perform operations comprising:
-
receiving, via touch provided on a touch screen of a device, an indication associated with a specific field displayed in a user interface on the touch screen, the indication signaling that speech, which is associated with the specific field, will follow; receiving the speech via the device and generating speech data based on the speech; generating, by the device, a request for speech recognition, wherein the request comprises; (1) an application identifier identifying a speech recognizer on a public network node; (2) a location parameter specific to a current location of the device, the device being associated with a speaker of the speech; and (3) a grammar parameter associated with a home location of the speaker of the speech, the grammar parameter identifying a particular grammar; transmitting the speech data and the request to the public network node for speech recognition using the speech recognizer; receiving, at the device, text associated with the speech data from the speech recognizer; and inserting the text into the specific field.
-
Specification