Method for accelerating the execution of speech recognition neural networks and the related speech recognition device
First Claim
1. Method for accelerating neural network (4) execution in a speech recognition system, for recognising words contained in a subset of a general vocabulary of words that the same system is capable of recognising, said neural network (4) comprising a number of computing units organised in levels, among which at least one hidden level (12) and one output level (14), the computing units (Hj) of said hidden level (12) being connected to the computing units (Ni) of said output level (14) via weighted connections (Wij), said computing units (Ni) of said output level (14) corresponding to acoustic-phonetic units (2) of said general vocabulary, characterised in that it comprises the following steps:
- determining a subset of acoustic-phonetic units necessary for recognising all the words contained in said general vocabulary subset;
eliminating from the neural network (4) all the weighted connections (Wij) afferent to computing units (Ni) of said output level (14) that correspond to acoustic-phonetic units not contained in said previously determined subset of acoustic-phonetic units, thus obtaining a compacted neural network (4′
) optimised for recognition of the words contained in said general vocabulary subset;
executing, at each moment in time, only said compacted neural network (4′
).
3 Assignments
0 Petitions
Accused Products
Abstract
A method for accelerating neural network execution (4) in a speech recognition system, specifically for recognition of words contained in one or more subsets of a general vocabulary, involves the following steps.—at the recognition system initialisation phase, calculating the union of vocabulary subsets and determining the acoustic-phonetic units required for recognising the words contained in that union; re-compacting the neural network eliminating all the weighted connections afferent to computation output units corresponding to unnecessary acoustic-phonetic units;—executing unnecessary acoustic-phonetic units;—executing only the re-compacted network at each instant of time.
-
Citations
8 Claims
-
1. Method for accelerating neural network (4) execution in a speech recognition system, for recognising words contained in a subset of a general vocabulary of words that the same system is capable of recognising, said neural network (4) comprising a number of computing units organised in levels, among which at least one hidden level (12) and one output level (14), the computing units (Hj) of said hidden level (12) being connected to the computing units (Ni) of said output level (14) via weighted connections (Wij), said computing units (Ni) of said output level (14) corresponding to acoustic-phonetic units (2) of said general vocabulary, characterised in that it comprises the following steps:
-
determining a subset of acoustic-phonetic units necessary for recognising all the words contained in said general vocabulary subset;
eliminating from the neural network (4) all the weighted connections (Wij) afferent to computing units (Ni) of said output level (14) that correspond to acoustic-phonetic units not contained in said previously determined subset of acoustic-phonetic units, thus obtaining a compacted neural network (4′
) optimised for recognition of the words contained in said general vocabulary subset;
executing, at each moment in time, only said compacted neural network (4′
). - View Dependent Claims (2, 3, 4)
-
-
5. Speech recognition system, comprising a neural network (4) with a number of computing units organised in levels, including at least one hidden level (12) and one output level (14), the computing units (Hj) of said hidden level (12) being connected to the computing units (Ni) of said output level (14) via weighted connections (Wij), said computing units (Ni) of said output level (14) corresponding to acoustic-phonetic units (2) of a general vocabulary of words to be recognized, characterised in that it comprises means (18, 16) for accelerating neural network (4) execution, for recognising words contained in a subset of said general vocabulary, said means (18, 16) comprising:
-
a first module (18) for determining the subset of acoustic-phonetic units necessary for recognising all the words contained in said general vocabulary subset;
a second module (16) for selecting, from among the weighted connections (Wij) connecting the computing units (Hj) of hidden level (12) with those of output level (14), the weighted connections afferent to computing units (Ni) corresponding to acoustic-phonetic units contained in said subset of acoustic-phonetic units determined by said first module (16), thus obtaining a compacted neural network (4′
) optimised for recognition of the words contained in said general vocabulary subset. - View Dependent Claims (6, 7, 8)
-
Specification