Speech recognition using loosely coupled components
First Claim
Patent Images
1. A system comprising:
- an audio capture component, the audio capture component comprising means for capturing a first audio signal representing first speech of a user to produce a first captured audio signal;
a speech recognition processing component comprising means for performing automatic speech recognition on the first captured audio signal to produce first speech recognition results;
a first result processing component, the first result processing component comprising first means for processing the first speech recognition results to produce first result output;
a second result processing component, the second result processing component comprising second means for processing the first speech recognition results to produce second result output;
a context sharing component comprising means for identifying a first one of the first and second result processing components as being associated with a first context of the user at a first time, the context sharing component further comprising;
means for receiving credentials from the user;
means for identifying, based on the credentials, a list of at least one result processing component authorized for use on behalf of the user at the first time;
means for determining that the at least one result processing component in the list is associated with the context of the user at the first time; and
means for identifying a location of the at least one result processing component associated with the context of the user at the first time; and
speech recognition result provision means for providing, via a method selected based on the identified location, the first speech recognition results to the identified first one of the first and second result processing components.
4 Assignments
0 Petitions
Accused Products
Abstract
An automatic speech recognition system includes an audio capture component, a speech recognition processing component, and a result processing component which are distributed among two or more logical devices and/or two or more physical devices. In particular, the audio capture component may be located on a different logical device and/or physical device from the result processing component. For example, the audio capture component may be on a computer connected to a microphone into which a user speaks, while the result processing component may be on a terminal server which receives speech recognition results from a speech recognition processing server.
5 Citations
6 Claims
-
1. A system comprising:
-
an audio capture component, the audio capture component comprising means for capturing a first audio signal representing first speech of a user to produce a first captured audio signal; a speech recognition processing component comprising means for performing automatic speech recognition on the first captured audio signal to produce first speech recognition results; a first result processing component, the first result processing component comprising first means for processing the first speech recognition results to produce first result output; a second result processing component, the second result processing component comprising second means for processing the first speech recognition results to produce second result output; a context sharing component comprising means for identifying a first one of the first and second result processing components as being associated with a first context of the user at a first time, the context sharing component further comprising; means for receiving credentials from the user; means for identifying, based on the credentials, a list of at least one result processing component authorized for use on behalf of the user at the first time; means for determining that the at least one result processing component in the list is associated with the context of the user at the first time; and means for identifying a location of the at least one result processing component associated with the context of the user at the first time; and speech recognition result provision means for providing, via a method selected based on the identified location, the first speech recognition results to the identified first one of the first and second result processing components. - View Dependent Claims (2, 3)
-
-
4. A computer-implemented method for use with a system:
wherein the system comprises; an audio capture component; a speech recognition processing component; a first result processing component; a second result processing component; a context sharing component; and speech recognition result provision means; wherein the method comprises; (A) using the audio capture component to capture a first audio signal representing first speech of a user to produce a first captured audio signal; (B) using the speech recognition processing component to perform automatic speech recognition on the first captured audio signal to produce first speech recognition results; (C) using the first result processing component to process the first speech recognition results to produce first result output; (D) using second result processing component to process the first speech recognition results to produce second result output; (E) using the context sharing component to identify a first one of the first and second result processing components as being associated with a first context of the user at a first time, wherein using the context sharing component to identify further comprises; receiving credentials from the user; identifying, based on the credentials, a list of at least one result processing component authorized for use on behalf of the user at the first time; determining that the at least one result processing component in the list is associated with the context of the user at the first time; and identifying a location of the at least one result processing component associated with the context of the user at the first time; and (F) using the speech recognition result provision means to provide, via a method selected based on the identified location, the first speech recognition results to the identified first one of the first and second result processing components. - View Dependent Claims (5, 6)
Specification