Speech recognition using loosely coupled components
DCFirst Claim
Patent Images
1. A system comprising:
- an audio capture component, the audio capture component comprising means for capturing a first audio signal representing first speech of a user to produce a first captured audio signal;
a speech recognition processing component comprising means for performing automatic speech recognition on the first captured audio signal to produce first speech recognition results;
a first result processing component, the first result processing component comprising first means for processing the first speech recognition results to produce first result output;
a second result processing component, the second result processing component comprising second means for processing the first speech recognition results to produce second result output;
a context sharing component comprising means for identifying a first one of the first and second result processing components as being associated with a first context of the user at a first time, the context sharing component further comprising;
means for identifying a list of at least one result processing component authorized for use on behalf of the user at the first time; and
means for determining that the at least one result processing component in the list is associated with the context of the user at the first time; and
speech recognition result provision means for providing the first speech recognition results to the identified first one of the first and second result processing components.
4 Assignments
Litigations
1 Petition
Accused Products
Abstract
An automatic speech recognition system includes an audio capture component, a speech recognition processing component, and a result processing component which are distributed among two or more logical devices and/or two or more physical devices. In particular, the audio capture component may be located on a different logical device and/or physical device from the result processing component. For example, the audio capture component may be on a computer connected to a microphone into which a user speaks, while the result processing component may be on a terminal server which receives speech recognition results from a speech recognition processing server.
4 Citations
42 Claims
-
1. A system comprising:
-
an audio capture component, the audio capture component comprising means for capturing a first audio signal representing first speech of a user to produce a first captured audio signal; a speech recognition processing component comprising means for performing automatic speech recognition on the first captured audio signal to produce first speech recognition results; a first result processing component, the first result processing component comprising first means for processing the first speech recognition results to produce first result output; a second result processing component, the second result processing component comprising second means for processing the first speech recognition results to produce second result output; a context sharing component comprising means for identifying a first one of the first and second result processing components as being associated with a first context of the user at a first time, the context sharing component further comprising; means for identifying a list of at least one result processing component authorized for use on behalf of the user at the first time; and means for determining that the at least one result processing component in the list is associated with the context of the user at the first time; and speech recognition result provision means for providing the first speech recognition results to the identified first one of the first and second result processing components. - View Dependent Claims (2)
-
-
3. A computer-implemented method for use with a system:
wherein the system comprises; an audio capture component; a speech recognition processing component; a first result processing component; a second result processing component; a context sharing component; and speech recognition result provision means; wherein the method comprises; (A) using the audio capture component to capture a first audio signal representing first speech of a user to produce a first captured audio signal; (B) using the speech recognition processing component to perform automatic speech recognition on the first captured audio signal to produce first speech recognition results; (C) using the first result processing component to process the first speech recognition results to produce first result output; (D) using second result processing component to process the first speech recognition results to produce second result output; (E) using the context sharing component to identify a first one of the first and second result processing components as being associated with a first context of the user at a first time, wherein using the context sharing component to identify further comprises; identifying a list of at least one result processing component authorized for use on behalf of the user at the first time; and determining that the at least one result processing component in the list is associated with the context of the user at the first time; and (F) using the speech recognition result provision means to provide the first speech recognition results to the identified first one of the first and second result processing components. - View Dependent Claims (4)
-
5. A system comprising:
-
a first audio capture component comprising first means for capturing a first audio signal representing speech of a user to produce a first captured audio signal; a first speech recognition processing component comprising first means for performing automatic speech recognition on the first captured audio signal to produce first speech recognition results; a first result processing component comprising first means for processing the first speech recognition results to produce first result output; and a context sharing component comprising means for dynamically coupling at least two of the first audio capture component, the first speech recognition processing component, and the first result processing component to each other at a first time, wherein the means for dynamically coupling further comprises; means for identifying a list of at least one result processing component authorized for use on behalf of the user at the first time; and means for determining that the at least one result processing component in the list is associated with the context of the user at the first time. - View Dependent Claims (6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23)
-
-
24. A computer-implemented method for use with a system:
-
wherein the system comprises; a first audio capture component; a first speech recognition processing component; a first result processing component; and a context sharing component; wherein the method comprises; (A) using the first audio capture component to capture a first audio signal representing speech of a user to produce a first captured audio signal; (B) using the first speech recognition processing component to perform automatic speech recognition on the first captured audio signal to produce first speech recognition results; (C) using the first result processing component to process the first speech recognition results to produce first result output; and (D) using the context sharing component to dynamically couple at least two of the first audio capture component, the first speech recognition processing component, and the first result processing component to each other at a first time, wherein using the context sharing component to dynamically couple further comprises; identifying a list of at least one result processing component authorized for use on behalf of the user at the first time; and determining that the at least one result processing component in the list is associated with the context of the user at the first time. - View Dependent Claims (25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42)
-
Specification