Hardware-implemented scalable modular engine for low-power speech recognition
First Claim
1. An Application Specific Integration Circuit (ASIC) for use in a hardware-implemented backend search engine for a low-power speech recognition system, said ASIC comprising at least:
- a scoring engine that includes logic circuitry adapted to read a plurality of active acoustic unit models from external memory, update each of the plurality of active acoustic unit models based on one or more corresponding senone scores received from an acoustic scoring engine for a current frame of sampled speech, write the plurality of active acoustic unit models back to the external memory, and enter a low-power state after writing the plurality of active acoustic unit models back to the external memory until processing for a subsequent frame of sampled speech is to begin;
a transition engine that includes logic circuitry adapted to process the plurality of active acoustic unit models after the plurality of active acoustic unit models have been updated and written back to the external memory by the scoring engine in order to prune unlikely active acoustic unit models for the current frame of sampled speech, create or modify active acoustic unit models likely to be transitioned to, and identify any completed words, wherein the transition engine is in a low-power state while the scoring engine is processing the plurality of active acoustic unit models; and
a language model engine that includes logic circuitry adapted to process any completed words identified by the transition engine for the current frame of sampled speech to identify one or more words that are likely to follow in the subsequent frame of sampled speech.
1 Assignment
0 Petitions
Accused Products
Abstract
The present invention relates to a low-power speech recognition system. In one embodiment, the speech recognition system is implemented in hardware and includes a backend search engine that operates to recognize words based on senone scores provided by an acoustic scoring stage. The backend search engine includes a scoring engine, a transition engine, and a language model engine. For a frame of sampled speech, the scoring engine reads active acoustic unit models from external memory, updates the active acoustic unit models based on corresponding senone scores received from an acoustic scoring stage, and writes the active acoustic unit models back to the external memory. The scoring engine enters a low-power state until processing for a next frame of sampled speech is to begin. The transition stage identifies any completed words, and the language model engine processes completed words to identify words that are likely to follow in a subsequent frame.
55 Citations
20 Claims
-
1. An Application Specific Integration Circuit (ASIC) for use in a hardware-implemented backend search engine for a low-power speech recognition system, said ASIC comprising at least:
-
a scoring engine that includes logic circuitry adapted to read a plurality of active acoustic unit models from external memory, update each of the plurality of active acoustic unit models based on one or more corresponding senone scores received from an acoustic scoring engine for a current frame of sampled speech, write the plurality of active acoustic unit models back to the external memory, and enter a low-power state after writing the plurality of active acoustic unit models back to the external memory until processing for a subsequent frame of sampled speech is to begin; a transition engine that includes logic circuitry adapted to process the plurality of active acoustic unit models after the plurality of active acoustic unit models have been updated and written back to the external memory by the scoring engine in order to prune unlikely active acoustic unit models for the current frame of sampled speech, create or modify active acoustic unit models likely to be transitioned to, and identify any completed words, wherein the transition engine is in a low-power state while the scoring engine is processing the plurality of active acoustic unit models; and a language model engine that includes logic circuitry adapted to process any completed words identified by the transition engine for the current frame of sampled speech to identify one or more words that are likely to follow in the subsequent frame of sampled speech. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12)
-
-
13. An Application Specific Integration Circuit (ASIC) for use in a hardware-implemented backend search engine for a low-power speech recognition system, said ASIC comprising at least:
-
a scoring engine that includes logic circuitry adapted to read a plurality of active acoustic unit models from external memory, update each of the plurality of active acoustic unit models based on one or more corresponding senone scores received from an acoustic scoring engine for a current frame of sampled speech, and write the plurality of active acoustic unit models back to the external memory; a transition engine that includes logic circuitry adapted to process the plurality of active acoustic unit models after the plurality of active acoustic unit models have been updated and written back to the external memory by the scoring engine in order to prune unlikely active acoustic unit models for the current frame of sampled speech, create or modify active acoustic unit models likely to be transitioned to, and identify any completed words, wherein the transition engine is in a low-power state while the scoring engine is processing the plurality of active acoustic unit models; and a language model engine that includes logic circuitry adapted to, for each completed word, identify one or more expected words that are likely to follow the completed word using an n-gram analysis, wherein as part of the n-gram analysis the language model engine performs a lookup for trigrams for the completed word in the external memory using a hashing technique. - View Dependent Claims (14, 15, 16, 17, 18, 19, 20)
-
Specification