Learning a verification model for speech recognition based on extracted recognition and language feature information
First Claim
Patent Images
1. A speech processing apparatus comprising:
- a recognition feature information extracting unit which extracts recognition feature information having a characteristic of recognition result data obtained by performing a speech recognition process on an inputted speech, from said recognition result data, said recognition feature information being speech recognition result data for learning which includes plural recognition hypotheses;
a language feature information extracting unit which extracts language feature information having a characteristic of a pre-registered language resource from said pre-registered language resource, said pre-registered language resource including document data, sentences, text data, word sequences, or dictionaries, the extracted language feature information including linguistic characteristics included in an existing word sequence, or importance of similarity of a document; and
a verification model obtaining unit which obtains a verification model by a learning process based on the extracted recognition feature information and language feature information, the obtained verification model being used to verify a speech recognition result data which is inputted as a verification target to a speech recognition system, wherein said verification model obtaining unit obtains a discriminative model as said verification model, said discriminative model being indicative of a correct and false label or degree of importance according to use.
1 Assignment
0 Petitions
Accused Products
Abstract
A speech processing apparatus 101 includes a recognition feature extracting unit 12 that extracts recognition feature information which is a characteristic of a speech recognition result 15 obtained by performing a speech recognition process on an inputted speech from the speech recognition result 15; a language feature extracting unit 11 that extracts language feature information which is a characteristic of a pre-registered language resource 14 from the language resource 14; and a model learning unit 13 that obtains a verification model 16 by a learning process based on the extracted recognition feature information and language feature information.
13 Citations
26 Claims
-
1. A speech processing apparatus comprising:
-
a recognition feature information extracting unit which extracts recognition feature information having a characteristic of recognition result data obtained by performing a speech recognition process on an inputted speech, from said recognition result data, said recognition feature information being speech recognition result data for learning which includes plural recognition hypotheses; a language feature information extracting unit which extracts language feature information having a characteristic of a pre-registered language resource from said pre-registered language resource, said pre-registered language resource including document data, sentences, text data, word sequences, or dictionaries, the extracted language feature information including linguistic characteristics included in an existing word sequence, or importance of similarity of a document; and a verification model obtaining unit which obtains a verification model by a learning process based on the extracted recognition feature information and language feature information, the obtained verification model being used to verify a speech recognition result data which is inputted as a verification target to a speech recognition system, wherein said verification model obtaining unit obtains a discriminative model as said verification model, said discriminative model being indicative of a correct and false label or degree of importance according to use. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15)
-
-
16. A speech processing method comprising:
-
extracting recognition feature information having a characteristic of recognition result data obtained by performing a speech recognition process on an inputted speech, from said recognition result data by a speech processing apparatus, said recognition feature information being speech recognition result data for learning which includes plural recognition hypotheses; extracting language feature information having a characteristic of a pre-registered language resource from said pre-registered language resource by said speech processing apparatus, said pre-registered language resource including document data, sentences, text data, word sequences, or dictionaries, the extracted language feature information including linguistic characteristics included in an existing word sequence, or importance or similarity of a document; and obtaining a verification model by a learning process based on the extracted recognition feature information and language feature information by said speech processing apparatus, the obtained verification model being used to verify a speech recognition result data which is inputted as a verification target to a speech recognition system, wherein said obtained verification model is a discriminative model, said discriminative model being indicative of a correct and false label or degree of importance according to use. - View Dependent Claims (17, 18, 19, 20, 21, 22, 23, 24, 25)
-
-
26. A speech processing apparatus comprising:
-
means for extracting recognition feature information having a characteristic of recognition result data obtained by performing a speech recognition process on an inputted speech, from said recognition result data, said recognition feature information being speech recognition result data for learning which includes plural recognition hypotheses; means for extracting language feature information having a characteristic of a pre-registered language resource from said pre-registered language resource, said pre-registered language resource including document data, sentences, text data, word sequences, or dictionaries, the extracted language feature information including linguistic characteristics included in an existing word sequence, or importance or similarity of a document; and means for obtaining a verification model by a learning process based on the extracted recognition feature information and language feature information, the obtained verification model being used to verify a speech recognition result data which is inputted as a verification target to a speech recognition system, wherein said obtained verification model is a discriminative model, said discriminative model being indicative of a correct and false label or degree of importance according to use.
-
Specification