Speaker verification system
First Claim
1. A speaker verification system, comprising:
- a) conversion means for;
1) dividing an input speech signal into frames at predetermined time intervals; and
2) converting the input speech signal into an acoustic parameter with a frequency spectrum having a plurality of frequency channels for every frame, thus generating a time-series of spectral patterns;
b) detecting means for detecting, from the time-series of spectral patterns, a speech portion of the input speech signal;
c) primary moment generating means for generating a primary moment of the frequency spectrum for every frame, the primary moment showing a channel position corresponding to a center of the frequency spectrum;
d) segmentation means for segmenting the speech portion into a plurality of blocks, based on the primary moment generated for every frame;
e) feature extracting means for extracting features of the input speech signal for every segmented block;
f) memory means for storing reference features of registered speakers, the reference features including features of input speech signals of the registered speakers extracted by the feature extracting means;
g) distance calculating means for calculating a distance between (1) the extracted features of an unknown speaker, and (2) the reference features stored in the memory means; and
h) decision means for making a decision as to whether or not the unknown speaker is a real speaker by comparing the distance calculated by the distance calculating means with a predetermined threshold value.
0 Assignments
0 Petitions
Accused Products
Abstract
In a speaker verification system, a detecting part detects a speech section of an input speech signal by using a time-series acoustic parameters thereof. A segmentation part calculates individuality information for segmentation by using the time-series acoustic parameters within the speech section, and segments the input speech section into a plurality of blocks based on the individuality information. A feature extracting part extracts features of an unknown speaker for every segmented block by using the time-series acoustic parameters. A distance calculating part calculates a distance between the features of the speaker extracted by the feature extracting part and reference features stored in a memory. A decision part makes a decision as to whether or not the unknown speaker is a real speaker by comparing the calculated distance with a predetermined threshold value. Segmentation is made by calculating a primary moment of the spectrum, over a block, and finding successive values which satisfy a predetermined criterion.
90 Citations
12 Claims
-
1. A speaker verification system, comprising:
-
a) conversion means for; 1) dividing an input speech signal into frames at predetermined time intervals; and 2) converting the input speech signal into an acoustic parameter with a frequency spectrum having a plurality of frequency channels for every frame, thus generating a time-series of spectral patterns; b) detecting means for detecting, from the time-series of spectral patterns, a speech portion of the input speech signal; c) primary moment generating means for generating a primary moment of the frequency spectrum for every frame, the primary moment showing a channel position corresponding to a center of the frequency spectrum; d) segmentation means for segmenting the speech portion into a plurality of blocks, based on the primary moment generated for every frame; e) feature extracting means for extracting features of the input speech signal for every segmented block; f) memory means for storing reference features of registered speakers, the reference features including features of input speech signals of the registered speakers extracted by the feature extracting means; g) distance calculating means for calculating a distance between (1) the extracted features of an unknown speaker, and (2) the reference features stored in the memory means; and h) decision means for making a decision as to whether or not the unknown speaker is a real speaker by comparing the distance calculated by the distance calculating means with a predetermined threshold value. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12)
-
Specification