Automatic Text-Independent, Language-Independent Speaker Voice-Print Creation and Speaker Recognition
1 Assignment
0 Petitions
Accused Products
Abstract
An automatic dual-step, text independent, language-independent speaker voice-print creation and speaker recognition method, wherein a neural network-based technique is used in a first step and a Markov model-based technique is used in a second step. In particular, the first step uses a neural network-based technique for decoding the content of what is uttered by the speaker in terms of language independent acoustic-phonetic classes, wherein the second step uses the sequence of language-independent acoustic-phonetic classes from the first step and employs a Markov model-based technique for creating the speaker voice-print and for recognizing the speaker. The combination of the two steps enables improvement in the accuracy and efficiency of the speaker voice-print creation and of the speaker recognition, without setting any constraints on the lexical content of the speaker utterance and on the language thereof.
107 Citations
52 Claims
-
1-26. -26. (canceled)
-
27. A method for creating a voice-print of a speaker based on an input voice signal representing an utterance of said speaker, comprising:
-
processing said input voice signal to provide a sequence of language-independent acoustic-phonetic classes associated with corresponding temporal segments of said input voice signal, said language-independent acoustic-phonetic classes representing sounds in said utterance and being represented by respective original acoustic models; adapting the original acoustic model of each of said language-independent acoustic-phonetic classes to the speaker, based on the temporal segment of the input voice signal associated with a language-independent acoustic-phonetic class; and creating said voice-print based on the adapted acoustic models of said language-independent acoustic-phonetic classes. - View Dependent Claims (28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46)
-
-
47. A speaker recognition system capable of being configured to implement a method for creating a voice-print of a speaker based on an input voice signal representing an utterance of said speaker, comprising:
-
processing said input voice signal to provide a sequence of language-independent acoustic-phonetic classes associated with corresponding temporal segments of said input voice signal, said language-independent acoustic-phonetic classes representing sounds in said utterance and being represented by respective original acoustic models; adapting the original acoustic model of each of said language-independent acoustic-phonetic classes to the speaker, based on the temporal segment of the input voice signal associated with a language-independent acoustic-phonetic class; and creating said voice-print based on the adapted acoustic models of said language-independent acoustic-phonetic classes. - View Dependent Claims (48, 49)
-
-
50. A computer program product loadable in a memory of a processing system and comprising software code portions capable of implementing, when the computer program product is run on the processing system, a method for creating a voice-print of a speaker based on an input voice signal representing an utterance of said speaker, comprising:
-
processing said input voice signal to provide a sequence of language-independent acoustic-phonetic classes associated with corresponding temporal segments of said input voice signal, said language-independent acoustic-phonetic classes representing sounds in said utterance and being represented by respective original acoustic models; adapting the original acoustic model of each of said language-independent acoustic-phonetic classes to the speaker, based on the temporal segment of the input voice signal associated with a language-independent acoustic-phonetic class; and creating said voice-print based on the adapted acoustic models of said language-independent acoustic-phonetic classes. - View Dependent Claims (51, 52)
-
Specification