Speech Recognition Circuit and Method
First Claim
1. A speech recognition circuit, comprising:
- an audio front end for calculating a feature vector from an audio signal, wherein the feature vector comprises a plurality of extracted and/or derived quantities from said audio signal during a defined audio time frame;
a calculating circuit for calculating distances indicating the similarity between a feature vector and a plurality of predetermined acoustic states of an acoustic model; and
a search stage for using said calculated distances to identify words within a lexical tree, the lexical tree comprising a model of words;
wherein said audio front end and said search stage are implemented using a first processor, and said calculating circuit is implemented using a second processor, and wherein data is pipelined from the front end to the calculating circuit to the search stage.
1 Assignment
0 Petitions
Accused Products
Abstract
A speech recognition circuit comprising a circuit for providing state identifiers which identify states corresponding to nodes or groups of adjacent nodes in a lexical tree, and for providing scores corresponding to said state identifiers, the lexical tree comprising a model of words; a memory structure for receiving and storing state identifiers identified by a node identifier identifying a node or group of adjacent nodes, said memory structure being adapted to allow lookup to identify particular state identifiers, reading of the scores corresponding to the state identifiers, and writing back of the scores to the memory structure after modification of the scores; an accumulator for receiving score updates corresponding to particular state identifiers from a score update generating circuit which generates the score updates using audio input, for receiving scores from the memory structure, and for modifying said scores by adding said score updates to said scores; and a selector circuit for selecting at least one node or group of adjacent nodes of the lexical tree according to said scores.
75 Citations
86 Claims
-
1. A speech recognition circuit, comprising:
-
an audio front end for calculating a feature vector from an audio signal, wherein the feature vector comprises a plurality of extracted and/or derived quantities from said audio signal during a defined audio time frame; a calculating circuit for calculating distances indicating the similarity between a feature vector and a plurality of predetermined acoustic states of an acoustic model; and a search stage for using said calculated distances to identify words within a lexical tree, the lexical tree comprising a model of words; wherein said audio front end and said search stage are implemented using a first processor, and said calculating circuit is implemented using a second processor, and wherein data is pipelined from the front end to the calculating circuit to the search stage. - View Dependent Claims (2, 3, 4, 5, 36, 37, 38, 39, 40, 42, 44, 46, 86)
-
-
6-9. -9. (canceled)
-
10. An accelerator for calculating distances for a speech recognition circuit, the accelerator comprising:
-
calculating circuit for calculating distances indicating the similarity between a feature vector and a plurality of predetermined acoustic states of an acoustic model, wherein the feature vector comprises a plurality of extracted and/or derived quantities from an audio signal during a defined audio time frame; first and second storage circuit, each for storing calculated distances for at least one said audio time frame, and for making said stored distances available for use by another part of the speech recognition circuit; control circuit for controlling read and write access to the first and second storage circuit, said control means being configured to allow writing to one said storage means while the other said storage means is available for reading, to allow first calculated distances for one audio time frame to be written to one said storage means while second calculated distances for an earlier audio time frame are made available for reading from the other said storage means. - View Dependent Claims (11, 12, 13, 14, 15, 16, 17, 18, 85)
-
-
19. A accelerator for a speech recognition circuit, the accelerator comprising:
-
calculating means for calculating distances indicating the similarity between a feature vector and a plurality of predetermined acoustic states of an acoustic model, wherein the feature vector comprises a plurality of extracted and/or derived quantities from an audio signal during a defined audio time frame; means for receiving or storing compressed data representing said acoustic model; a decompressor for decompressing said compressed data for all states or selected states of the acoustic model, wherein the decompressed data is sent to the calculating means; and output means for outputting calculated distances to another part of the speech recognition circuit. - View Dependent Claims (20, 21, 22, 23, 29)
-
-
24-25. -25. (canceled)
-
26. An accelerator for a speech recognition circuit, the accelerator comprising:
-
calculating means for calculating distances indicating the similarity between a feature vector and a plurality of predetermined acoustic states of an acoustic model, wherein the feature vector comprises a plurality of extracted and/or derived quantities from an audio signal during a defined audio time frame; and a memory for storing said acoustic model, wherein the calculating means and the memory are fabricated as circuits on a single integrated circuit. - View Dependent Claims (28, 30, 32, 33, 34, 35)
-
-
27. (canceled)
-
31. (canceled)
-
41. (canceled)
-
43. A speech recognition circuit, comprising:
-
an audio front end for calculating a feature vector from an audio signal, wherein the feature vector comprises a plurality of extracted and/or derived quantities from said audio signal during a defined audio time frame; calculating means for calculating a distance indicating the similarity between a feature vector and a predetermined acoustic state of an acoustic model; and a search stage for using said calculated distances to identify words within a lexical tree, the lexical tree comprising a model of words; wherein said audio front end, said calculating means, and said search stage are connected to each other to enable pipelined data flow from one to another.
-
-
45. A speech recognition method, comprising:
-
calculating a feature vector from an audio signal, wherein the feature vector comprises a plurality of extracted and/or derived quantities from said audio signal during a defined audio time frame; calculating a distance indicating the similarity between a feature vector and a predetermined acoustic state of an acoustic model; and using said calculated distances to identify words within a lexical tree, the lexical tree comprising a model of words; wherein said audio front end and said search stage are implemented using a first processor, and said calculating means is implemented using a second processor, and wherein data is pipelined from the front end to the calculating means to the search stage.
-
-
47-84. -84. (canceled)
Specification