Speech classification apparatus, speech classification method, and speech classification program
First Claim
Patent Images
1. A speech classification apparatus that classifies speech signals into clusters based on vocal similarity, comprising:
- a speech classification probability calculation unit that calculates a probability that a latest input one of the speech signals sequentially input belongs to each of the clusters, based on a probability model for probabilistically determining to which cluster a certain speech signal belongs; and
a parameter updating unit that successively estimates values of parameters that define the probability model using each probability calculated by the speech classification probability calculation unit and a sufficient statistic calculated based on a speech signal inputted before the latest input one of the speech signals;
the speech classification probability calculation unit calculating each probability based on the probability modeldefined by latest values of the parameters successively estimated by the parameter updating unit.
1 Assignment
0 Petitions
Accused Products
Abstract
A speech classification apparatus includes a speech classification probability calculation unit that calculates a probability (probability of classification into each cluster) that a latest one of the speech signals (speech data) belongs to each cluster based on a generative model which is a probability model, and a parameter updating unit that successively estimates parameters that define the generative model based on the probability of classification of the speech data into each cluster calculated by the speech classification probability calculation unit.
14 Citations
26 Claims
-
1. A speech classification apparatus that classifies speech signals into clusters based on vocal similarity, comprising:
-
a speech classification probability calculation unit that calculates a probability that a latest input one of the speech signals sequentially input belongs to each of the clusters, based on a probability model for probabilistically determining to which cluster a certain speech signal belongs; and a parameter updating unit that successively estimates values of parameters that define the probability model using each probability calculated by the speech classification probability calculation unit and a sufficient statistic calculated based on a speech signal inputted before the latest input one of the speech signals; the speech classification probability calculation unit calculating each probability based on the probability model defined by latest values of the parameters successively estimated by the parameter updating unit. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8)
-
-
9. A speech classification method that classifies speech signals into clusters based on vocal similarity, comprising:
-
calculating a probability that a latest input one of the speech signals sequentially input belongs to each cluster, based on a probability model for probabilistically determining to which cluster a certain speech signal belongs; successively estimating parameters that define the probability model using the probability and a sufficient statistic calculated based on a speech signal inputted before the latest input one of the speech signals; and calculating a probability that at least next input speech signal belongs to each cluster, based on the probability model defined by the successively estimated parameters. - View Dependent Claims (10, 11, 12, 13, 14, 15, 16)
-
-
17. A non-transitory computer-readable recording medium storing a speech classification program that classifies speech signals into clusters based on vocal similarity, the speech classification program causing a computer to execute:
-
a probability calculation processing that calculates a probability that a latest input one of the speech signals sequentially input belongs to each cluster, based on a probability model for probabilistically determining to which cluster a certain speech signal belongs; and a parameter update processing that successively estimates parameters that define the probability model using each probability calculated by the speech classification probability calculation processing and a sufficient statistic calculated based on a speech signal inputted before the latest input one of the speech signals; the probability calculation processing calculating each probability based on the probability model defined by latest values of the successively estimated parameters. - View Dependent Claims (18, 19, 20, 21, 22, 23, 24)
-
-
25. A speech classification system that performs a clustering process which generates a cluster in response to each of sequentially input speech data on-line, the system comprising:
-
a speech classification probability calculation unit that derives a probability that a latest one of the sequentially input speech data belongs to each cluster, using a generation model which is defined by parameter values stored in parameter storage unit, assuming a speech data distribution, and storing the probability in speech classification probability storage unit; an update target speech selection unit that determines whether or not recalculation of the probability that each of the speech data belongs to each cluster is needed according to a magnitude relation between a predetermined threshold value and an indicator, the indicator being obtained by reversing a sign of an entropy of the probability that each of the speech data belongs to each cluster; a speech classification probability updating unit that derives a probability that speech data, the probability of which is determined to be needed by the update target speech selection unit, out of predetermined items of the sequentially input speech data except the latest speech data belongs to each cluster and that updates the speech classification probability storage unit; and a parameter updating unit that calculates sufficient statistics necessary for calculating the generation model on each of the numbers of clusters and uses each of the probabilities calculated by the speech classification probability calculation unit and a sufficient statistic calculated based on a speech data inputted before the latest one of the sequentially input speech data to estimate parameter values of the generation model, assuming a current number of clusters and some numbers of clusters in the vicinity of the current number of clusters, based on results of calculations by the speech classification probability calculation unit and the speech classification probability updating unit, and that successively updates the parameter values in the parameter storage unit with the estimated parameter values. - View Dependent Claims (26)
-
Specification