SYSTEMS AND METHODS OF INTERPRETING SPEECH DATA
First Claim
1. An uncontrolled environment-based speech recognition system, the system comprising:
- one or more filters to each generate a set of processed audio data based on raw audio data received from one or more computing devices, the one or more filters applying filter processes to the raw audio data to generate the set of processed audio data, the one or more filters comprising at least one filter appropriate for the uncontrolled environment;
a translator, operable by a processor, to provide a set of translation results for the raw audio data based on the set of processed audio data, each translation result being associated with at least one processed audio data and each translation result including a text data and a confidence level associated with that text data; and
in response to receiving the set of translation results, a decision controller is automatically triggered by the processor to select the text data that represents the raw audio data, the decision controller is operable to;
identify at least one translation result from the set of translation results that includes the text data associated with the confidence level that exceeds a confidence threshold;
determine whether the identified at least one translation result comprises more than one translation result;
in response to determining the identified at least one translation result comprises more than one translation result, determine an occurrence frequency for each text data of the identified at least one translation result and select the text data based on the occurrence frequency, the occurrence frequency representing a number of times that the text data appears in the set of translation results; and
generate an output signal associated with the selection by the decision controller.
1 Assignment
0 Petitions
Accused Products
Abstract
Method and systems are provided for interpreting speech data. A method and system for recognizing speech involving a filter module to generate a set of processed audio data based on raw audio data; a translation module to provide a set of translation results for the raw audio data; and a decision module to select the text data that represents the raw audio data. A method for minimizing noise in audio signals received by a microphone array is also described. A method and system of automatic entry of data into one or more data fields involving receiving a processed audio data; and operating a processing module to: search in a trigger dictionary for a field identifier that corresponds to the trigger identifier; identify a data field associated with a data field identifier corresponding to the field identifier; and providing content data associated with the trigger identifier to the identified data field.
-
Citations
22 Claims
-
1. An uncontrolled environment-based speech recognition system, the system comprising:
-
one or more filters to each generate a set of processed audio data based on raw audio data received from one or more computing devices, the one or more filters applying filter processes to the raw audio data to generate the set of processed audio data, the one or more filters comprising at least one filter appropriate for the uncontrolled environment; a translator, operable by a processor, to provide a set of translation results for the raw audio data based on the set of processed audio data, each translation result being associated with at least one processed audio data and each translation result including a text data and a confidence level associated with that text data; and in response to receiving the set of translation results, a decision controller is automatically triggered by the processor to select the text data that represents the raw audio data, the decision controller is operable to; identify at least one translation result from the set of translation results that includes the text data associated with the confidence level that exceeds a confidence threshold; determine whether the identified at least one translation result comprises more than one translation result; in response to determining the identified at least one translation result comprises more than one translation result, determine an occurrence frequency for each text data of the identified at least one translation result and select the text data based on the occurrence frequency, the occurrence frequency representing a number of times that the text data appears in the set of translation results; and generate an output signal associated with the selection by the decision controller. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9)
-
-
10. A computer-implemented method of recognizing speech for an uncontrolled environment, the method comprising:
-
operating one or more filters to each generate a set of processed audio data based on raw audio data received from one or more computing devices, the one or more filters being operated to apply filter processes to the raw audio data to generate the set of processed audio data, wherein the one or more filters comprise at least one filter appropriate for the uncontrolled environment; operating a translator to provide a set of translation results for the raw audio data based on the set of processed audio data, each translation result being associated with at least one processed audio data and each translation result including a text data and a confidence level associated with that text data; and in response to receiving the set of translation results, automatically trigger a decision controller to select the text data that corresponds to the raw audio data, wherein the decision controller is operable to; identify at least one translation result from the set of translation results that includes the text data associated with the confidence level that exceeds a confidence threshold; determine whether the identified at least one translation result comprises more than one translation result; in response to determining the identified at least one translation result comprises more than one translation result, determine an occurrence frequency for each text data of the identified at least one translation result and select the text data based on the occurrence frequency, the occurrence frequency representing a number of times that the text data appears in the set of translation results; and generate an output signal associated with the selection by the decision controller. - View Dependent Claims (11, 12, 13, 14, 15, 16, 17)
-
-
18. An uncontrolled environment-based speech recognition system comprising:
-
one or more filters to each generate a set of processed audio data based on raw audio data, the one or more filters comprises; a first filter to apply a first filter process to the raw audio data to generate a first processed audio data, and a second filter to apply a second filter process to the raw audio data to generate a second processed audio data, the first filter being different from the second filter, the one or more filters comprising at least one filter appropriate for the uncontrolled environment; a translator, operable by a processor, to provide a set of translation results for the raw audio data based on the set of processed audio data, each translation result being associated with at least one processed audio data and each translation result including a text data and a confidence level associated with that text data; and in response to receiving the set of translation results, a decision controller is automatically triggered by the processor to select the text data that represents the raw audio data. - View Dependent Claims (19, 20, 21, 22)
-
Specification