Methods and apparatus for buffering data for use in accordance with a speech recognition system
First Claim
1. A method for processing acoustic data to reduce one or more truncation errors associated with operation of a speech recognition system, the method comprising acts of:
- continuously recording acoustic data in a circular buffer;
when an indication that the speech recognition system is being addressed is detected, starting recording of acoustic data in a second buffer that is separate from the circular buffer;
obtaining combined acoustic data at least in part by prepending first acoustic data recorded in the circular buffer to a beginning of second acoustic data recorded in the second buffer; and
analyzing the combined acoustic data, which comprises data from the circular buffer and data from the second buffer, to identify a likely speech endpoint in the combined acoustic data, wherein the act of analyzing comprises using a boundary between the first and second acoustic data as a reference location wherein the act of analyzing the combined acoustic data comprises an act of identifying, among one or more regions in the combined acoustic data likely to correspond to silence, a region of silence closest to the reference location.
2 Assignments
0 Petitions
Accused Products
Abstract
Techniques are disclosed for overcoming errors in speech recognition systems. For example, a technique for processing acoustic data in accordance with a speech recognition system comprises the following steps/operations. Acoustic data is obtained in association with the speech recognition system. The acoustic data is recorded using a combination of a first buffer area and a second buffer area, such that the recording of the acoustic data using the combination of the two buffer areas at least substantially minimizes one or more truncation errors associated with operation of the speech recognition system.
60 Citations
30 Claims
-
1. A method for processing acoustic data to reduce one or more truncation errors associated with operation of a speech recognition system, the method comprising acts of:
-
continuously recording acoustic data in a circular buffer; when an indication that the speech recognition system is being addressed is detected, starting recording of acoustic data in a second buffer that is separate from the circular buffer; obtaining combined acoustic data at least in part by prepending first acoustic data recorded in the circular buffer to a beginning of second acoustic data recorded in the second buffer; and analyzing the combined acoustic data, which comprises data from the circular buffer and data from the second buffer, to identify a likely speech endpoint in the combined acoustic data, wherein the act of analyzing comprises using a boundary between the first and second acoustic data as a reference location wherein the act of analyzing the combined acoustic data comprises an act of identifying, among one or more regions in the combined acoustic data likely to correspond to silence, a region of silence closest to the reference location. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14)
-
-
15. Apparatus for processing acoustic data to reduce one or more truncation errors associated with operation of a speech recognition system, comprising:
-
at least one memory comprising a circular buffer and a second buffer that is separate from the circular buffer; and at least one processor coupled to the memory and operative to; continuously record acoustic data in the circular buffer; when an indication that the speech recognition system is being addressed is detected, start recording of acoustic data in a second buffer; obtain combined acoustic data at least in part by prepending first acoustic data recorded in the circular buffer to a beginning of second acoustic data recorded in the second buffer; and analyze the combined acoustic data, which comprises data from the circular buffer and data from the second buffer, to identify a likely speech endpoint in the combined acoustic data, wherein the act of analyzing comprises using a boundary between the first and second acoustic data as a reference location wherein the at least one processor is further operative to analyze the combined acoustic data at least in part by identifying, among one or more regions in the combined acoustic data likely to correspond to silence, a region of silence closest to the reference location. - View Dependent Claims (16, 17, 18)
-
-
19. At least one article of manufacture for use in processing acoustic data to reduce one or more truncation errors associated with operation of a speech recognition system, comprising at least one machine readable medium having encoded thereon one or more programs which when executed implement acts of:
-
continuously recording acoustic data in a circular buffer; when an indication that the speech recognition system is being addressed is detected, starting recording of acoustic data in a second buffer that is separate from the circular buffer; obtaining combined acoustic data at least in part by prepending first acoustic data recorded in the circular buffer to a beginning of second acoustic data recorded in the second buffer; and analyzing the combined acoustic data, which comprises data from the circular buffer and data from the second buffer, to identify a likely speech endpoint in the combined acoustic data, wherein the act of analyzing comprises using a boundary between the first and second acoustic data as a reference location wherein the act of analyzing the combined acoustic data comprises an act of identifying, among one or more regions in the combined acoustic data likely to correspond to silence, a region of silence closest to the reference location. - View Dependent Claims (20, 21)
-
-
22. A method for processing acoustic data in accordance with a speech recognition system, the method comprising acts of:
-
recording acoustic data in at least one recording medium; detecting, at a first time, a user-generated input event instructing the speech recognition system to start speech recognition processing, the first time corresponding to a first location of the recorded acoustic data recorded in the at least one recording medium; searching in the recorded acoustic data to identify a silence region having the shortest distance, among all silence regions in the recorded acoustic data, relative to the first location in the recorded acoustic data corresponding to the first time at which the user-generated input event was detected; and identifying a location in the identified silence region as a start location for speech recognition processing of at least a portion of the recorded acoustic data, wherein; if the recorded acoustic data is such that the identified silence region entirely follows the first location, the start location for speech recognition processing follows the first location; and if the recorded acoustic data is such that the identified silence region entirely precedes the first location, the start location for speech recognition processing precedes the first location. - View Dependent Claims (23, 24)
-
-
25. A system for processing acoustic data in accordance with a speech recognition system, the system comprising:
-
at least one memory for storing executable instructions; at least one processor programmed by the executable instructions to; record acoustic data in at least one recording medium; detect, at a first time, a user-generated input event instructing the speech recognition system to start speech recognition processing, the first time corresponding to a first location of the recorded acoustic data recorded in the at least one recording medium; search in the recorded acoustic data to identify a silence region having the shortest distance, among all silence regions in the recorded acoustic data, relative to the first location in the recorded acoustic data corresponding to the first time at which the user-generated input event was detected; and identify a location in the identified silence region as a start location for speech recognition processing of at least a portion of the recorded acoustic data, wherein; if the recorded acoustic data is such that the identified silence region entirely follows the first location, the start location for speech recognition processing follows the first location; and if the recorded acoustic data is such that the identified silence region entirely precedes the first location, the start location for speech recognition processing precedes the first location. - View Dependent Claims (26, 27)
-
-
28. At least one computer readable memory encoded with instructions that, when executed, perform a method for processing acoustic data in accordance with a speech recognition system, the method comprising acts of:
-
recording acoustic data in at least one recording medium; detecting, at a first time, a user-generated input event instructing the speech recognition system to start speech recognition processing, the first time corresponding to a first location of the recorded acoustic data recorded in the at least one recording medium; searching in the recorded acoustic data to identify a silence region having the shortest distance, among all silence regions in the recorded acoustic data, relative to the first location in the recorded acoustic data corresponding to the first time at which the user-generated input event was detected; and identifying a location in the identified silence region as a start location for speech recognition processing of at least a portion of the recorded acoustic data, wherein; if the recorded acoustic data is such that the identified silence region entirely follows the first location, the start location for speech recognition processing follows the first location; and if the recorded acoustic data is such that the identified silence region entirely precedes the first location, the start location for speech recognition processing precedes the first location. - View Dependent Claims (29, 30)
-
Specification