Speech Recognition Using Loosely Coupled Components
First Claim
1. A system comprising:
- a first device including an audio capture component, the audio capture component comprising means for capturing an audio signal representing speech of a user to produce a captured audio signal;
a speech recognition processing component comprising means for performing automatic speech recognition on the captured audio signal to produce speech recognition results;
a second device including a result processing component;
a context sharing component comprising;
means for determining that the result processing component is associated with a current context of the user; and
wherein the result processing component comprises means for processing the speech recognition results to produce result output.
10 Assignments
0 Petitions
Accused Products
Abstract
An automatic speech recognition system includes an audio capture component, a speech recognition processing component, and a result processing component which are distributed among two or more logical devices and/or two or more physical devices. In particular, the audio capture component may be located on a different logical device and/or physical device from the result processing component. For example, the audio capture component may be on a computer connected to a microphone into which a user speaks, while the result processing component may be on a terminal server which receives speech recognition results from a speech recognition processing server.
10 Citations
104 Claims
-
1. A system comprising:
-
a first device including an audio capture component, the audio capture component comprising means for capturing an audio signal representing speech of a user to produce a captured audio signal; a speech recognition processing component comprising means for performing automatic speech recognition on the captured audio signal to produce speech recognition results; a second device including a result processing component; a context sharing component comprising; means for determining that the result processing component is associated with a current context of the user; and wherein the result processing component comprises means for processing the speech recognition results to produce result output. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29)
-
-
30. A method, for use with a system, the method performed by at least one processor executing computer program instructions stored on a non-transitory computer-readable medium:
-
wherein the system comprises; a first device including an audio capture component; a speech recognition processing component; and a second device including a result processing component; wherein the method comprises; (A) using the audio capture component to capture an audio signal representing speech of a user to produce a captured audio signal; (B) using the speech recognition processing component to perform automatic speech recognition on the captured audio signal to produce speech recognition results; (C) determining that the result processing component is associated with a current context of the user; (D) in response to the determination that the result processing component is associated with the current context of the user, providing the speech recognition results to the result processing component; and (E) using the result processing component to process the speech recognition results to produce result output. - View Dependent Claims (31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58)
-
-
59. A system comprising:
-
an audio capture component, the audio capture component comprising means for capturing a first audio signal representing first speech of a user to produce a first captured audio signal; a speech recognition processing component comprising means for performing automatic speech recognition on the first captured audio signal to produce first speech recognition results; a first result processing component, the first result processing component comprising first means for processing the first speech recognition results to produce first result output; a second result processing component, the second result processing component comprising second means for processing the first speech recognition results to produce second result output; a context sharing component comprising means for identifying a first one of the first and second result processing components as being associated with a first context of the user at a first time; and speech recognition result provision means for providing the first speech recognition results to the identified first one of the first and second result processing components. - View Dependent Claims (60)
-
-
61. A computer-implemented method for use with a system:
wherein the system comprises; an audio capture component; a speech recognition processing component; a first result processing component; a second result processing component; a context sharing component; and speech recognition result provision means; wherein the method comprises; (A) using the audio capture component to capture a first audio signal representing first speech of a user to produce a first captured audio signal; (B) using the speech recognition processing component to perform automatic speech recognition on the first captured audio signal to produce first speech recognition results; (C) using the first result processing component to process the first speech recognition results to produce first result output; (D) using second result processing component to process the first speech recognition results to produce second result output; (E) using the context sharing component to identify a first one of the first and second result processing components as being associated with a first context of the user at a first time; (F) using the speech recognition result provision means to provide the first speech recognition results to the identified first one of the first and second result processing components. - View Dependent Claims (62)
-
63. A system comprising:
-
a first audio capture component comprising first means for capturing a first audio signal representing speech of a user to produce a first captured audio signal; a first speech recognition processing component comprising first means for performing automatic speech recognition on the first captured audio signal to produce first speech recognition results; a first result processing component comprising first means for processing the first speech recognition results to produce first result output; a context sharing component comprising means for dynamically coupling at least two of the first audio capture component, the first speech recognition processing component, and the first result processing component to each other. - View Dependent Claims (64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81)
-
-
82. A computer-implemented method for us with a system:
-
wherein the system comprises; a first audio capture component; a first speech recognition processing component; a first result processing component; and a context sharing component; wherein the method comprises; (A) using the first audio capture component to capture a first audio signal representing speech of a user to produce a first captured audio signal; (B) using the first speech recognition processing component to perform automatic speech recognition on the first captured audio signal to produce first speech recognition results; (C) using the first result processing component to process the first speech recognition results to produce first result output; (D) using the context sharing component to dynamically couple at least two of the first audio capture component, the first speech recognition processing component, and the first result processing component to each other. - View Dependent Claims (83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100)
-
-
101. A system comprising:
-
a first machine comprising; a target application; and a result processing component comprising; means for processing first speech recognition results to produce result output; means for providing the result output to the target application; an audio capture device, wherein the first machine does not include the audio capture device; and a context sharing component comprising means for logically coupling the result processing component to the audio capture device. - View Dependent Claims (102)
-
-
103. A computer-implemented method for use with a system:
-
the system comprising; a first machine comprising; a target application; and a result processing component comprising; an audio capture device, wherein the first machine does not include the audio capture device; and a context sharing component; wherein the method comprises; (A) using the result processing component to process first speech recognition results to produce result output; (B) using the result processing component to provide the result output to the target application; and (C) using the context sharing component to logically couple the result processing component to the audio capture device. - View Dependent Claims (104)
-
Specification