Automatic computation streaming partition for voice recognition on multiple processors with limited memory
First Claim
1. A computer speech processing system, comprising:
- a memory unit;
a main processing unit coupled to the memory unit;
one or more co-processor elements coupled to the memory unit and the main processing unit, wherein each of the one or more co-processor elements include a co-processor unit and a local memory associated with the co-processor unit;
a first main processor thread stored in the memory unit and configured for execution by the main processor, wherein the first main processor thread is configured to cause the main processor to perform memory maintenance and voice recognition result retrievals upon execution;
a first co-processor thread stored in the main memory or in one or more co-processor local memories and configured for execution by one or more of the co-processor units, wherein the first co-processor thread is configured to cause one or more of the co-processors to receive raw data representing sound detected by a microphone and perform voice detection and initial feature extraction on the raw data;
a second co-processor thread stored in the main memory or in one or more co-processor local memories and configured for execution by one or more of the co-processor units, wherein the second co-processor thread is configured to cause one or more of the co-processor elements to receive feature data derived for one or more features extracted by the first co-processor thread and information for locating probability density functions needed for probability computation by a speech recognition model and compute a probability that one or more features correspond to a known sub-unit of speech using the probability density functions and the feature data; and
a third co-processor thread stored in the main memory or in one or more co-processor local memories and configured for execution by one or more of the co-processor units, wherein the third co-processor thread is configured to cause one or more of the co-processor units to compute at least a portion of a path probability that a sequence of sub-units of speech correspond to a known speech unit.
4 Assignments
0 Petitions
Accused Products
Abstract
Speech processing is disclosed for an apparatus having a main processing unit, a memory unit, and one or more co-processors. Memory maintenance and voice recognition result retrievals upon execution are performed with a first main processor thread. Voice detection and initial feature extraction on the raw data are performed with a first co-processor. A second co-processor thread receives feature data derived for one or more features extracted by the first co-processor thread and information for locating probability density functions needed for probability computation by a speech recognition model and computes a probability that the one or more features correspond to a known sub-unit of speech using the probability density functions and the feature data. At least a portion of a path probability that a sequence of sub-units of speech correspond to a known speech unit is computed with a third co-processor thread.
138 Citations
24 Claims
-
1. A computer speech processing system, comprising:
-
a memory unit; a main processing unit coupled to the memory unit; one or more co-processor elements coupled to the memory unit and the main processing unit, wherein each of the one or more co-processor elements include a co-processor unit and a local memory associated with the co-processor unit; a first main processor thread stored in the memory unit and configured for execution by the main processor, wherein the first main processor thread is configured to cause the main processor to perform memory maintenance and voice recognition result retrievals upon execution; a first co-processor thread stored in the main memory or in one or more co-processor local memories and configured for execution by one or more of the co-processor units, wherein the first co-processor thread is configured to cause one or more of the co-processors to receive raw data representing sound detected by a microphone and perform voice detection and initial feature extraction on the raw data; a second co-processor thread stored in the main memory or in one or more co-processor local memories and configured for execution by one or more of the co-processor units, wherein the second co-processor thread is configured to cause one or more of the co-processor elements to receive feature data derived for one or more features extracted by the first co-processor thread and information for locating probability density functions needed for probability computation by a speech recognition model and compute a probability that one or more features correspond to a known sub-unit of speech using the probability density functions and the feature data; and a third co-processor thread stored in the main memory or in one or more co-processor local memories and configured for execution by one or more of the co-processor units, wherein the third co-processor thread is configured to cause one or more of the co-processor units to compute at least a portion of a path probability that a sequence of sub-units of speech correspond to a known speech unit. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11)
-
-
12. A computer implemented method for speech processing in a computer speech apparatus having a main processing unit, a memory unit coupled to the main processing unit and one or more co-processors coupled to the memory unit and the main processing unit, wherein each co-processor element includes a co-processor unit and a local memory associated with the co-processor unit, the method comprising:
-
a) performing memory maintenance and voice recognition result retrievals upon execution with a first main processor thread executed by the main processor; b) performing voice detection and initial feature extraction on the raw data with a first co-processor thread running on one or more of the co-processor elements; c) receiving feature data derived for one or more features extracted by the first co-processor thread and information for locating probability density functions needed for probability computation by a speech recognition model and computing a probability that the one or more features correspond to a known sub-unit of speech using the probability density functions and the feature data with a second co-processor thread configured to run on one or more of the co-processor elements; and d) computing at least a portion of a path probability that a sequence of sub-units of speech correspond to a known speech unit with a third co-processor thread configured to run on one or more of the co-processor elements. - View Dependent Claims (13, 14, 15, 16, 17, 18, 19, 20, 21, 22)
-
-
23. A computer apparatus for implementing computer speech processing in a computer speech processing apparatus having a main processing unit, a memory unit coupled to the main processing unit and one or more co-processors coupled to the memory unit and the main processing unit, wherein each co-processor element includes a co-processor unit and a local memory associated with the co-processor unit, the system comprising:
-
a) means for performing memory maintenance and voice recognition result retrievals upon execution with a first main processor thread executed by the main processor; b) means for performing voice detection and initial feature extraction on the raw data with a first co-processor thread running on one or more of the co-processors; c) means for receiving feature data derived for one or more features extracted by the first co-processor thread and information for locating probability density functions needed for probability computation by a speech recognition model and computing a probability that the one or more features correspond to a known sub-unit of speech using the probability density functions and the feature data with a second co-processor thread configured to run on one or more of the co-processors; and d) means for computing at least a portion of a path probability that a sequence of sub-units of speech correspond to a known speech unit with a third co-processor thread configured to run on one or more of the co-processors.
-
-
24. A computer readable storage medium, having embodied therein computer readable instructions for implementing a computer speech processing method in a computer speech processing apparatus having a main processing unit, a memory unit coupled to the main processing unit and one or more co-processors coupled to the memory unit and the main processing unit, the method comprising:
-
a) performing memory maintenance and voice recognition result retrievals upon execution with a first main processor executed by the main processor; b) performing voice detection and initial feature extraction on the raw data with a first co-processor thread running on one or more of the co-processors; c) receiving feature data derived for one or more features extracted by the first co-processor thread and information for locating probability density functions needed for probability computation by a speech recognition model and computing a probability that the one or more features correspond to a known sub-unit of speech using the probability density functions and the feature data with a second co-processor thread configured to run on one or more of the co-processors; and d) computing at least a portion of a path probability that a sequence of sub-units of speech correspond to a known speech unit with a third co-processor thread configured to run on one or more of the co-processors.
-
Specification