Hybrid Speech Recognition
First Claim
Patent Images
1. A computer-implemented method performed by a client device, the method comprising:
- (A) receiving a request from a requester to apply automatic speech recognition to an audio signal;
(B) providing the audio signal to a first automatic speech recognition engine in the client device;
(C) receiving first speech recognition results from the first automatic speech recognition engine;
(D) determining whether a second automatic speech recognition engine, in a server device, is accessible to the client device;
(E) if the second automatic speech recognition engine is determined not to be accessible to the client device, then providing the first speech recognition results to the requester in response to the request.
12 Assignments
0 Petitions
Accused Products
Abstract
A hybrid speech recognition system uses a client-side speech recognition engine and a server-side speech recognition engine to produce speech recognition results for the same speech. An arbitration engine produces speech recognition output based on one or both of the client-side and server-side speech recognition results.
-
Citations
21 Claims
-
1. A computer-implemented method performed by a client device, the method comprising:
-
(A) receiving a request from a requester to apply automatic speech recognition to an audio signal; (B) providing the audio signal to a first automatic speech recognition engine in the client device; (C) receiving first speech recognition results from the first automatic speech recognition engine; (D) determining whether a second automatic speech recognition engine, in a server device, is accessible to the client device; (E) if the second automatic speech recognition engine is determined not to be accessible to the client device, then providing the first speech recognition results to the requester in response to the request. - View Dependent Claims (2, 3)
-
-
4. An apparatus comprising:
-
means for receiving a request from a requester to apply automatic speech recognition to an audio signal; means for providing the audio signal to a first automatic speech recognition engine in the client device; means for receiving first speech recognition results from the first automatic speech recognition engine; means for determining whether a second automatic speech recognition engine, in a server device, is accessible to the client device; and means for providing the first speech recognition results to the requester in response to the request if the second automatic speech recognition engine is determined not to be accessible to the client device.
-
-
5. A computer-implemented method performed by a client device, the method comprising:
-
(A) receiving a request from a requester to apply automatic speech recognition to an audio signal; (B) providing the audio signal to a first automatic speech recognition engine in a server device; (C) receiving first speech recognition results from the first automatic speech recognition engine; (D) determining whether a second automatic speech recognition engine, in the client device, is accessible to the client device; (E) if the second automatic speech recognition engine is determined not to be accessible to the client device, then providing the first speech recognition results to the requester in response to the request.
-
-
6. An apparatus comprising:
-
means for receiving a request from a requester to apply automatic speech recognition to an audio signal; means for providing the audio signal to a first automatic speech recognition engine in a server device; means for receiving first speech recognition results from the first automatic speech recognition engine; means for determining whether a second automatic speech recognition engine, in the client device, is accessible to the client device; means for providing the first speech recognition results to the requester in response to the request if the second automatic speech recognition engine is determined not to be accessible to the client device.
-
-
7. A computer-implemented method performed by a client device, the method comprising:
-
(A) receiving a request from a requester to apply automatic speech recognition to an audio signal; (B) providing the audio signal to a first automatic speech recognition engine in the client device; (C) receiving first speech recognition results from the first automatic speech recognition engine at a first time; (D) providing the audio signal to a second automatic speech recognition engine in a server device; (E) determining whether second speech recognition results have been received by the client device from the second automatic speech recognition engine within a predetermined time period after the first time; (F) if the second speech recognition results have been received by the client device within the predetermined time period, then providing the second speech recognition results to the requester in response to the request; and (G) if the second speech recognition results have not been received by the client device within the predetermined time period, then providing the first speech recognition results to the requester in response to the request. - View Dependent Claims (8)
-
-
9. An apparatus comprising:
-
means for receiving a request from a requester to apply automatic speech recognition to an audio signal; means for providing the audio signal to a first automatic speech recognition engine in the client device; means for receiving first speech recognition results from the first automatic speech recognition engine at a first time; means for providing the audio signal to a second automatic speech recognition engine in a server device; means for determining whether second speech recognition results have been received by the client device from the second automatic speech recognition engine within a predetermined time period after the first time; means for providing the second speech recognition results to the requester in response to the request if the second speech recognition results have been received by the client device within the predetermined time period; and means for providing the first speech recognition results to the requester in response to the request if the second speech recognition results have not been received by the client device within the predetermined time period.
-
-
10. A computer-implemented method performed by a client device, the method comprising:
-
(A) receiving a request from a requester to apply automatic speech recognition to an audio signal; (B) providing the audio signal to a first automatic speech recognition engine in the client device; (C) providing the audio signal to a second automatic speech recognition engine in a server device; (D) receiving first speech recognition results from the first automatic speech recognition engine; (E) determining whether a confidence measure associated with the first speech recognition results exceeds a predetermined threshold; and (F) if the confidence measure exceeds the predetermined threshold, then providing the first speech recognition results to the requester in response to the request. - View Dependent Claims (11)
-
-
12. An apparatus comprising:
-
means for receiving a request from a requester to apply automatic speech recognition to an audio signal; means for providing the audio signal to a first automatic speech recognition engine in the client device; means for providing the audio signal to a second automatic speech recognition engine in a server device; means for receiving first speech recognition results from the first automatic speech recognition engine; means for determining whether a confidence measure associated with the first speech recognition results exceeds a predetermined threshold; and means for providing the first speech recognition results to the requester in response to the request if the confidence measure exceeds the predetermined threshold.
-
-
13. A computer-implemented method performed by a client device, the method comprising:
-
(A) receiving a request from a requester to apply automatic speech recognition to an audio signal; (B) providing the audio signal to a first automatic speech recognition engine in the client device; (C) receiving first speech recognition results from the first automatic speech recognition engine; (D) providing the audio signal to a second automatic speech recognition engine in a server device; (E) receiving second speech recognition results from the second automatic speech recognition engine; (F) producing hybrid speech recognition results based on the first speech recognition results and the second speech recognition results; and (G) providing the hybrid speech recognition results to the requester in response to the request. - View Dependent Claims (14, 15, 16, 17, 18, 19, 20)
-
-
21. An apparatus comprising:
-
means for receiving a request from a requester to apply automatic speech recognition to an audio signal; means for providing the audio signal to a first automatic speech recognition engine in the client device; means for receiving first speech recognition results from the first automatic speech recognition engine; means for providing the audio signal to a second automatic speech recognition engine in a server device; means for receiving second speech recognition results from the second automatic speech recognition engine; means for producing hybrid speech recognition results based on the first speech recognition results and the second speech recognition results; and means for providing the hybrid speech recognition results to the requester in response to the request.
-
Specification