STANDARD-MODEL GENERATION FOR SPEECH RECOGNITION USING A REFERENCE MODEL
First Claim
1. A standard model creating apparatus for creating a standard model which shows an acoustic characteristic having a specific attribute and is used for a speech recognition device included in an electronic apparatus used by a user, the standard model creating apparatus using a probability model that expresses a frequency parameter showing an acoustic characteristic as an output probability, the standard model creating apparatus comprising:
- a reference model storing unit configured to store a plurality of reference models which are probability models showing an acoustic characteristic having a specific attribute; and
a standard model creating unit configured to create the standard model by calculating statistics of the standard model using statistics of the plurality of reference models stored in said reference model storing unit,wherein said standard model creating unit includes;
a standard model structure determining unit configured to determine a structure of the standard model which is to be created, based on specification information regarding specifications of the electronic apparatus;
an initial standard model creating unit configured to determine initial values of the statistics specifying the standard model whose structure has been determined; and
a statistics estimating unit configured to estimate and calculate the statistics of the standard model so as to maximize or locally maximize a probability or a likelihood of the standard model, whose initial values have been determined, with respect to the plurality of reference models,wherein the plurality of reference models and the standard model are expressed using at least one Gaussian distribution, andsaid standard model structure determining unit is configured to determine a number of statistics of the standard model including at least a number of Gaussian mixture distributions as the structure of the standard model, based on the specification information indicating which of precision and speed is prioritized in speech recognition by the speech recognition device.
0 Assignments
0 Petitions
Accused Products
Abstract
A standard model creating apparatus which provides a high-precision standard model used for pattern recognition such as speech recognition, character recognition, or image recognition using a probability model based on a hidden Markov model, Bayesian theory, or linear discrimination analysis; intention interpretation using a probability model such as a Bayesian net; data-mining performed using a probability model; and so forth. The standard model creating apparatus includes a reference model preparing unit that prepares at least one reference model; a reference model storing unit that stores the reference model prepared by the reference model preparing unit (; and a standard model creating unit that creates a standard model by calculating statistics of the standard model so as to maximize or locally maximize the probability or likelihood with respect to the reference model stored in the reference model storing unit.
39 Citations
9 Claims
-
1. A standard model creating apparatus for creating a standard model which shows an acoustic characteristic having a specific attribute and is used for a speech recognition device included in an electronic apparatus used by a user, the standard model creating apparatus using a probability model that expresses a frequency parameter showing an acoustic characteristic as an output probability, the standard model creating apparatus comprising:
-
a reference model storing unit configured to store a plurality of reference models which are probability models showing an acoustic characteristic having a specific attribute; and a standard model creating unit configured to create the standard model by calculating statistics of the standard model using statistics of the plurality of reference models stored in said reference model storing unit, wherein said standard model creating unit includes; a standard model structure determining unit configured to determine a structure of the standard model which is to be created, based on specification information regarding specifications of the electronic apparatus; an initial standard model creating unit configured to determine initial values of the statistics specifying the standard model whose structure has been determined; and a statistics estimating unit configured to estimate and calculate the statistics of the standard model so as to maximize or locally maximize a probability or a likelihood of the standard model, whose initial values have been determined, with respect to the plurality of reference models, wherein the plurality of reference models and the standard model are expressed using at least one Gaussian distribution, and said standard model structure determining unit is configured to determine a number of statistics of the standard model including at least a number of Gaussian mixture distributions as the structure of the standard model, based on the specification information indicating which of precision and speed is prioritized in speech recognition by the speech recognition device. - View Dependent Claims (2, 3, 4, 5, 8, 9)
-
-
6. A method of creating a standard model which shows an acoustic characteristic having a specific attribute and is used for a speech recognition device included in an electronic apparatus used by a user, the method using a probability model that expresses a frequency parameter showing an acoustic characteristic as an output probability, the method comprising:
-
a reference model reading step of reading plurality of reference models from a reference model storing unit which is configured to store a plurality of reference models which are probability models showing an acoustic characteristic having a specific attribute; and a standard model creating step of creating the standard model by calculating statistics of the standard model using statistics of the plurality of reference models that has been read, wherein the standard model creating step includes; a standard model structure determining sub-step of determining a structure of the standard model which is to be created, based on specification information regarding specifications of the electronic apparatus; an initial standard model creating sub-step of determining initial values of the statistics specifying the standard model whose structure has been determined; and a statistics estimating sub-step of estimating and calculating the statistics of the standard model so as to maximize or locally maximize a probability or a likelihood of the standard model, whose initial values have been determined, with respect to plurality of reference models, wherein the plurality of reference models and the standard model are expressed using at least one Gaussian distribution, and said standard model structure determining unit is configured to determine a number of statistics of the standard model including at least a number of Gaussian mixture distributions as the structure of the standard model, based on the specification information indicating which of precision and speed is prioritized in speech recognition by the speech recognition device.
-
-
7. A program stored on a computer-readable medium which when executed causes a standard model creating apparatus to perform steps for creating a standard model which shows an acoustic characteristic having a specific attribute and is used for a speech recognition device included in an electronic apparatus used by a user, the program using a probability model that expresses a frequency parameter showing an acoustic characteristic as an output probability, the steps comprising:
-
a reference model reading step of reading plurality of reference models from a reference model storing unit which is configured to store a plurality of reference models which are probability models showing an acoustic characteristic having a specific attribute; and a standard model creating step of creating the standard model by calculating statistics of the standard model using statistics of the plurality of reference models that has been read, wherein the standard model creating step includes; a standard model structure determining sub-step configured to determine a structure of the standard model which is to be created, based on specification information regarding specifications of the electronic apparatus; an initial standard model creating sub-step of determining initial values of the statistics specifying the standard model whose structure has been determined; and a statistics estimating sub-step of estimating and calculating the statistics of the standard model so as to maximize or locally maximize a probability or a likelihood of the standard model, whose initial values have been determined, with respect to the plurality of reference models wherein the plurality of reference models and the standard model are expressed using at least one Gaussian distribution, and said standard model structure determining unit is configured to determine a number of statistics of the standard model including at least a number of Gaussian mixture distributions as the structure of the standard model, based on the specification information indicating which of precision and speed is prioritized in speech recognition by the speech recognition device.
-
Specification