×

STANDARD-MODEL GENERATION FOR SPEECH RECOGNITION USING A REFERENCE MODEL

  • US 20090271201A1
  • Filed: 07/08/2009
  • Published: 10/29/2009
  • Est. Priority Date: 11/21/2002
  • Status: Abandoned Application
First Claim
Patent Images

1. A standard model creating apparatus for creating a standard model which shows an acoustic characteristic having a specific attribute and is used for a speech recognition device included in an electronic apparatus used by a user, the standard model creating apparatus using a probability model that expresses a frequency parameter showing an acoustic characteristic as an output probability, the standard model creating apparatus comprising:

  • a reference model storing unit configured to store a plurality of reference models which are probability models showing an acoustic characteristic having a specific attribute; and

    a standard model creating unit configured to create the standard model by calculating statistics of the standard model using statistics of the plurality of reference models stored in said reference model storing unit,wherein said standard model creating unit includes;

    a standard model structure determining unit configured to determine a structure of the standard model which is to be created, based on specification information regarding specifications of the electronic apparatus;

    an initial standard model creating unit configured to determine initial values of the statistics specifying the standard model whose structure has been determined; and

    a statistics estimating unit configured to estimate and calculate the statistics of the standard model so as to maximize or locally maximize a probability or a likelihood of the standard model, whose initial values have been determined, with respect to the plurality of reference models,wherein the plurality of reference models and the standard model are expressed using at least one Gaussian distribution, andsaid standard model structure determining unit is configured to determine a number of statistics of the standard model including at least a number of Gaussian mixture distributions as the structure of the standard model, based on the specification information indicating which of precision and speed is prioritized in speech recognition by the speech recognition device.

View all claims
  • 0 Assignments
Timeline View
Assignment View
    ×
    ×