SYSTEM AND METHOD OF PROVIDING SPEECH PROCESSING IN USER INTERFACE
First Claim
1. A method comprising:
- receiving, via touch provided on a touch screen of a device, an indication associated with a specific field displayed in a user interface on the touch screen, the indication signaling that speech, which is associated with the specific field, will follow;
receiving the speech via the device;
generating speech data based on the speech;
generating a request for speech recognition, wherein the request comprises;
(1) an application identifier identifying a speech recognizer;
(2) a current location of the device; and
(3) a grammar parameter associated with a home location of a speaker of the speech;
transmitting the speech data and the request to a network node for speech recognition using the speech recognizer;
receiving, at the device, a transcription of the speech from the speech recognizer; and
inserting the transcription into the specific field.
4 Assignments
0 Petitions
Accused Products
Abstract
Disclosed are systems, methods and computer-readable media for enabling speech processing in a user interface of a device. The method includes receiving an indication of a field and a user interface of a device, the indication also signaling that speech will follow, receiving the speech from the user at the device, the speech being associated with the field, transmitting the speech as a request to public, common network node that receives and processes speech, processing the transmitted speech and returning text associated with the speech to the device and inserting the text into the field. Upon a second indication from the user, the system processes the text in the field as programmed by the user interface. The present disclosure provides a speech mash up application for a user interface of a mobile or desktop device that does not require expensive speech processing technologies.
-
Citations
20 Claims
-
1. A method comprising:
-
receiving, via touch provided on a touch screen of a device, an indication associated with a specific field displayed in a user interface on the touch screen, the indication signaling that speech, which is associated with the specific field, will follow; receiving the speech via the device; generating speech data based on the speech; generating a request for speech recognition, wherein the request comprises; (1) an application identifier identifying a speech recognizer; (2) a current location of the device; and (3) a grammar parameter associated with a home location of a speaker of the speech; transmitting the speech data and the request to a network node for speech recognition using the speech recognizer; receiving, at the device, a transcription of the speech from the speech recognizer; and inserting the transcription into the specific field. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 10, 11)
-
-
9. The method of claim 9, wherein the grammar parameter controls a compilation of a plurality of grammars.
-
12. A system comprising:
-
a processor; and a computer-readable storage medium having instructions stored which, when executed by the processor, cause the processor to perform operations comprising; receiving, via touch provided on a touch screen of a device, an indication associated with a specific field displayed in a user interface on the touch screen, the indication signaling that speech, which is associated with the specific field, will follow; receiving the speech via the device; generating speech data based on the speech; generating a request for speech recognition, wherein the request comprises; (1) an application identifier identifying a speech recognizer; (2) a current location of the device; and (3) a grammar parameter associated with a home location of a speaker of the speech; transmitting the speech data and the request to a network node for speech recognition using the speech recognizer; receiving, at the device, a transcription of the speech from the speech recognizer; and inserting the transcription into the specific field. - View Dependent Claims (13, 14, 15, 16, 17, 18, 19)
-
-
20. A computer-readable storage device having instructions stored which, when executed by a computing device, cause the computing device to perform operations comprising:
-
receiving, via touch provided on a touch screen of a device, an indication associated with a specific field displayed in a user interface on the touch screen, the indication signaling that speech, which is associated with the specific field, will follow; receiving the speech via the device; generating speech data based on the speech; generating a request for speech recognition, wherein the request comprises; (1) an application identifier identifying a speech recognizer; (2) a current location of the device; and (3) a grammar parameter associated with a home location of a speaker of the speech; transmitting the speech data and the request to a network node for speech recognition using the speech recognizer; receiving, at the device, a transcription of the speech from the speech recognizer; and inserting the transcription into the specific field.
-
Specification