Method for the voice recognition of a speaker using a predictive model, particularly for access control applications
First Claim
1. A method for the voice recognition of a speaker using a q-order predictive model comprising at least one phase of extracting statistical characteristics (P4, P′
-
4) including at least one step of digital acquisition of a voice sample of a particular duration D of the speaker (U) corresponding to at least one utterance of the speaker (U), a step of converting said voice sample into a sequence of vectors of particular size p obtained from a sequence of analysis windows of average duration T and with average spacing I, and a step of determining q+1 correlation matrices of size p×
p from this sequence of vectors, where p and q are non-zero integers, characterized in that said average duration T has a duration less than 10 ms.
1 Assignment
0 Petitions
Accused Products
Abstract
The invention discloses a method for the voice recognition of a speaker using a q-order predictive model, comprising a step for extracting the statistical characteristics including a step of digital acquisition of the speaker'"'"'s voice sample, corresponding to one or several utterances, a step of converting this voice sample into a sequence of vectors of size p, obtained from a series of analysis windows of average size T and an average spacing I, and a step of determining q+1 matrices from this vector sequence. The average size T is of duration of less than 10 ms and the average spacing I of a duration of less than 4.5 ms. The invention is useful in a sound lock including an electroacoustical conversion system (HP,6) and a recorded program implementation system (5) for the method.
19 Citations
20 Claims
-
1. A method for the voice recognition of a speaker using a q-order predictive model comprising at least one phase of extracting statistical characteristics (P4, P′
-
4) including at least one step of digital acquisition of a voice sample of a particular duration D of the speaker (U) corresponding to at least one utterance of the speaker (U), a step of converting said voice sample into a sequence of vectors of particular size p obtained from a sequence of analysis windows of average duration T and with average spacing I, and a step of determining q+1 correlation matrices of size p×
p from this sequence of vectors, where p and q are non-zero integers, characterized in that said average duration T has a duration less than 10 ms. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20)
-
4) including at least one step of digital acquisition of a voice sample of a particular duration D of the speaker (U) corresponding to at least one utterance of the speaker (U), a step of converting said voice sample into a sequence of vectors of particular size p obtained from a sequence of analysis windows of average duration T and with average spacing I, and a step of determining q+1 correlation matrices of size p×
Specification