System and method for supplemental speech recognition by identified idle resources
First Claim
1. A method comprising:
- projecting, via a processor, an expected demand for speech recognition resources;
identifying, via the processor and based on the expected demand for speech recognition, a main speech recognizer and a supplemental speech recognizer;
assigning the main speech recognizer to recognize a plurality of speech from a plurality of speakers;
beginning to process the plurality of speech using the main speech recognizer, to yield main results; and
upon determining that first speech recognition results of a first user in the plurality of speakers have a lower accuracy than alternative speech recognition results of a second user;
assigning, via the processor, the supplemental speech recognizer to recognize speech from the first user, wherein the assigning of the supplemental speech recognizer is a reallocation of the supplemental speech recognizer away from the second user;
continuing to process the speech from the first user using the supplemental speech recognizer and the main speech recognizer, to yield supplemental results; and
combining additional speech recognition results, produced by the main speech recognizer, and the supplemental results.
3 Assignments
0 Petitions
Accused Products
Abstract
Disclosed herein are systems, methods, and computer-readable storage media for improving automatic speech recognition performance. A system practicing the method identifies idle speech recognition resources and establishes a supplemental speech recognizer on the idle resources based on overall speech recognition demand. The supplemental speech recognizer can differ from a main speech recognizer, and, along with the main speech recognizer, can be associated with a particular speaker. The system performs speech recognition on speech received from the particular speaker in parallel with the main speech recognizer and the supplemental speech recognizer and combines results from the main and supplemental speech recognizer. The system recognizes the received speech based on the combined results. The system can use beam adjustment in place of or in combination with a supplemental speech recognizer. A scheduling algorithm can tailor a particular combination of speech recognition resources and release the supplemental speech recognizer based on increased demand.
59 Citations
20 Claims
-
1. A method comprising:
-
projecting, via a processor, an expected demand for speech recognition resources; identifying, via the processor and based on the expected demand for speech recognition, a main speech recognizer and a supplemental speech recognizer; assigning the main speech recognizer to recognize a plurality of speech from a plurality of speakers; beginning to process the plurality of speech using the main speech recognizer, to yield main results; and upon determining that first speech recognition results of a first user in the plurality of speakers have a lower accuracy than alternative speech recognition results of a second user; assigning, via the processor, the supplemental speech recognizer to recognize speech from the first user, wherein the assigning of the supplemental speech recognizer is a reallocation of the supplemental speech recognizer away from the second user; continuing to process the speech from the first user using the supplemental speech recognizer and the main speech recognizer, to yield supplemental results; and combining additional speech recognition results, produced by the main speech recognizer, and the supplemental results. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
-
8. A system comprising:
-
a processor; and a computer-readable storage medium having instructions stored which, when executed by the processor, cause the processor to perform a method comprising; projecting an expected demand for speech recognition resources; identifying, based on the expected demand for speech recognition, a main speech recognizer and a supplemental speech recognizer; assigning the main speech recognizer to recognize a plurality of speech from a plurality of speakers; beginning to process the plurality of speech using the main speech recognizer, to yield main results; and upon determining that first speech recognition results of a first user in the plurality of speakers have a lower accuracy than alternative speech recognition results of a second user; assigning the supplemental speech recognizer to recognize speech from the first user, wherein the assigning of the supplemental speech recognizer is a reallocation of the supplemental speech recognizer away from the second user; continuing to process the speech from the first user using the supplemental speech recognizer and the main speech recognizer, to yield supplemental results; and combining additional speech recognition results, produced by the main speech recognizer, and the supplemental results. - View Dependent Claims (9, 10, 11, 12, 13, 14)
-
-
15. A non-transitory computer-readable storage device having instructions stored which, when executed by a computing device, cause the computing device to perform operations comprising:
-
projecting, via the computing device, an expected demand for speech recognition resources; identifying, via the computing device and based on the expected demand for speech recognition, a main speech recognizer and a supplemental speech recognizer; assigning the main speech recognizer to recognize a plurality of speech from a plurality of speakers; beginning to process the plurality of speech using the main speech recognizer, to yield main results; and upon determining that first speech recognition results of a first user in the plurality of speakers have a lower accuracy than alternative speech recognition results of a second user; assigning the supplemental speech recognizer to recognize speech from the first user, wherein the assigning of the supplemental speech recognizer is a reallocation of the supplemental speech recognizer away from the second user; continuing to process the speech from the first user using the supplemental speech recognizer and the main speech recognizer, to yield supplemental results; and combining additional speech recognition results, produced by the main speech recognizer, and the supplemental results. - View Dependent Claims (16, 17, 18, 19, 20)
-
Specification