Speech recognition from concurrent visual and audible inputs
First Claim
1. A learning apparatus for carrying out learning an expectation degree at which a vector quantization result is observed and which is used for vector-quantizing an input series and recognizing whether or not the input series corresponds to a recognition target, based on the vector quantization result, comprising:
- vector quantization means for vector-quantizing a time series of learning data pieces comprising image data and noise data and for outputting a series of identifiers each indicating a code vector; and
calculator means for calculating an expectation degree of each of the identifiers, from the series of identifiers obtained from the time series of learning data pieces.
1 Assignment
0 Petitions
Accused Products
Abstract
With respect to each of codes corresponding to code vectors in a code book stored in a code book storage section, an expectation degree storage section stores an expectation degree at which observation is expected when an integrated parameter with respect to a word as a recognition target is inputted. A vector quantization section vector-quantizes the integrated parameter and outputs a series of codes of a code vector which has a shortest distance to the integrated parameter.
-
Citations
20 Claims
-
1. A learning apparatus for carrying out learning an expectation degree at which a vector quantization result is observed and which is used for vector-quantizing an input series and recognizing whether or not the input series corresponds to a recognition target, based on the vector quantization result, comprising:
-
vector quantization means for vector-quantizing a time series of learning data pieces comprising image data and noise data and for outputting a series of identifiers each indicating a code vector; and
calculator means for calculating an expectation degree of each of the identifiers, from the series of identifiers obtained from the time series of learning data pieces. - View Dependent Claims (2, 3)
-
-
4. A learning method for carrying out learning an expectation degree at which a vector quantization result is observed and which is used for vector-quantizing an input series and recognizing whether or not the input series corresponds to a recognition target, based on the vector quantization result, comprising the steps of:
-
vector-quantizing a time series of learning data pieces comprising image data and noise data and of outputting a series of identifiers each indicating a code vector; and
calculating an expectation degree of each of the identifiers, from the series of identifiers obtained from the time series of learning data pieces.
-
-
5. A recording medium which records a program for making a computer execute learning an expectation degree at which a vector quantization result is observed and which is used for vector-quantizing an input series and recognizing whether or not the input series corresponds to a recognition target, based on the vector quantization result, the program comprises:
-
a vector quantization step of vector-quantizing a time series of learning data pieces comprising image data and noise data and of outputting a series of identifiers each indicating a code vector; and
a calculation step of calculating an expectation degree of each of the identifiers, from the series of identifiers obtained from the time series of learning data pieces.
-
-
6. A learning apparatus for obtaining a distance transition model expressing transition of a distance between a standard series and a code vector used for vector quantization, comprising:
-
normalization means for performing normalization of a time axis with respect to a time series of learning data pieces comprising image data and noise data and for outputting the standard series; and
distance calculation means for calculating a distance between the standard series and the code vector and for outputting transition of the distance. - View Dependent Claims (7, 8)
-
-
9. A learning method for obtaining a distance transition model expressing transition of a distance between a standard series and a code vector used for vector quantization, comprising the steps of
performing normalization of a time axis with respect to a time series of learning data pieces comprising image data and noise data and of outputting the standard series; - and
calculating a distance between the standard series and the code vector and of outputting transition of the distance.
- and
-
10. A recording medium which records a program for making a computer execute learning for obtaining a distance transition model expressing transition of a distance between a standard series and a code vector used for vector quantization, characterized by comprising:
-
a normalization step of performing normalization of a time axis with respect to a time series of learning data pieces comprising image data and noise data and of outputting the standard series; and
a distance calculation step of calculating a distance between the standard series and the code vector and of outputting transition of the distance.
-
-
11. A recognition apparatus for recognizing whether or not a time series of input data pieces comprising image data and noise data corresponds to at least one recognition target, comprising:
-
integration means for integrating a time series of first input data pieces and a time series of second input data pieces, thereby to output a time series of integrated data pieces; and
recognition means for recognizing whether or not the time series of first or second input data pieces corresponds to at least one recognition target, based on transition of a distance obtained from a vector based on the time series of integrated data pieces. - View Dependent Claims (12)
-
-
13. A recognition method for recognizing whether or not a time series of input data pieces comprising image data and noise data corresponds to at least one recognition target, comprising the steps of:
-
integrating a time series of first input data pieces and a time series of second input data pieces, thereby to output a time series of integrated data pieces; and
recognizing whether or not the time series of first or second input data pieces corresponds to at least one recognition target, based on transition of a distance obtained from a vector based on the time series of integrated data pieces.
-
-
14. A recording medium which records a program for making a computer execute recognition processing for recognizing whether or not a time series of input data pieces comprising image data and noise data corresponds to at least one recognition target, wherein the program comprises:
-
an integration step of integrating a time series of first input data pieces and a time series of second input data pieces, thereby to output a time series of integrated data pieces; and
a recognition step of recognizing whether or not the time series of first or second input data pieces corresponds to at least one recognition target, based on transition of a distance obtained from a vector based on the time series of integrated data pieces.
-
-
15. A recognition apparatus for recognizing whether or not a time series of input data pieces corresponds to a recognition target, comprising:
-
storage means for storing an expectation degree at which observation is expected when the time series of input data pieces is inputted, with respect to each of identifiers corresponding to code vectors used for vector quantization;
vector quantization means for vector-quantizing the time series of input data pieces and for outputting a series of identifiers indicating code vectors;
properness detection means for obtaining properness as to whether or not the time series of input data pieces corresponds to the recognition target, with use of the series of identifiers obtained from the input data and the expectation degrees of the identifiers; and
recognition means for recognizing whether or not the time series of input data pieces corresponds to the recognition target, based on the properness, wherein the vector quantization means outputs an identifier of a code vector which has a shortest distance to the input data piece with respect to each of the time series of input data pieces.
-
-
16. A recognition apparatus for recognizing whether or not a time series of input data pieces corresponds to a recognition target, comprising:
-
storage means for storing an expectation degree at which observation is expected when the time series of input data pieces is inputted, with respect to each of identifiers corresponding to code vectors used for vector quantization;
vector quantization means for vector quantizing the time series of input data pieces and for outputting a series of identifiers indicating code vectors;
properness detection means for obtaining properness as to whether or not the time series of input data pieces corresponds to the recognition target, with use of the series of identifiers obtained from the input data and the expectation degrees of the identifiers; and
recognition means for recognizing whether or not the time series of input data pieces corresponds to the recognition target, based on the properness, wherein the time series of input data pieces is an integrated parameter which integrates a characteristic parameter of a speech and a characteristic parameter of an image of lips when the speech is spoken.
-
-
17. A recognition apparatus comprising:
-
detection means for detecting a characteristic parameter with respect to each of a plurality of input data pieces;
normalization means for normalizing the characteristic parameter of each of the plurality of input data pieces;
integration means for integrating a plurality of normalized characteristic parameters into an integrated parameter; and
recognition means for recognizing whether or not one or more of the plurality of input data pieces correspond to a recognition target, based on the integrated parameter, wherein the plurality of data pieces include at least data pieces of an image and a speech. - View Dependent Claims (18)
-
-
19. A recognition apparatus comprising:
-
detection means for detecting a characteristic parameter with respect to each of a plurality of input data pieces;
normalization means for normalizing the characteristic parameter of each of the plurality of input data pieces;
integration means for integrating a plurality of normalized characteristic parameters into an integrated parameter;
a recognition means for recognizing whether or not one or more of the plurality of input data pieces correspond to a recognition target, based on the integrated parameter; and
time axis normalization means for normalizing the integrated parameter in a time axis direction.
-
-
20. A recognition apparatus for recognizing whether or not a time series of input data pieces corresponds to at least one recognition target, comprising:
-
integration means for integrating a time series of first input data pieces and a time series of second input data pieces, thereby to output a time series of integrated data pieces;
recognition means for recognizing whether or not the time series of first or second input data pieces corresponds to at least one recognition target, based on transition of a distance obtained from a vector based on the time series of integrated data pieces, wherein a code vector in a code book used for vector quantization and a distance transition model expressing transition of a distance to a standard series, the recognition means accumulates the distance when a vector quantization result obtained by vector-quantizing a vector based on the integrated data pieces, with use of the code book is observed, thereby to recognize whether or not the time series of the first or second input data pieces correspond to at least one recognition target, based on an accumulation result.
-
Specification