SPEECH SEARCH DEVICE AND SPEECH SEARCH METHOD
First Claim
1. A speech search device comprising:
- a recognizer to refer to an acoustic model and a plurality of language models having different learning data and perform voice recognition on an input speech, to acquire an acoustic likelihood and a language likelihood of a recognized character string for each of said plurality of language models;
a character string dictionary storage to store a character string dictionary in which pieces of information showing character strings of search target words each serving as a target for speech search are stored;
a character string comparator to compare the recognized character string for each of said plurality of language models, the recognized character string being acquired by said recognizer, with the character strings of the search target words which are stored in said character string dictionary and calculate a character string matching score showing a degree of matching of said recognized character string with respect to each of the character strings of said search target words, to acquire both a character string of a search target word having a highest character string matching score and this character string matching score for each of said recognized character strings; and
a search result determinator to calculate a total score as a weighted sum of two or more of said character string matching score acquired by said character string comparator, and the acoustic likelihood and the language likelihood acquired by said recognizer, and output, as a search result, one or more search target words in descending order of calculated total scores.
1 Assignment
0 Petitions
Accused Products
Abstract
Disclosed is a speech search device including a recognizer 2 that refers to an acoustic model and language models having different learning data and performs voice recognition on an input speech, to acquire a recognized character string for each language model, a character string comparator 6 that compares the recognized character string for each language models with the character strings of search target words stored in a character string dictionary, and calculates a character string matching score showing the degree of matching of the recognized character string with respect to each of the character strings of the search target words, to acquire both a character string having the highest character string matching score and this character string matching score for each recognized character strings, and a search result determinator 8 that refers to the acquired score and outputs one or more search target words in descending order of the scores.
-
Citations
8 Claims
-
1. A speech search device comprising:
-
a recognizer to refer to an acoustic model and a plurality of language models having different learning data and perform voice recognition on an input speech, to acquire an acoustic likelihood and a language likelihood of a recognized character string for each of said plurality of language models; a character string dictionary storage to store a character string dictionary in which pieces of information showing character strings of search target words each serving as a target for speech search are stored; a character string comparator to compare the recognized character string for each of said plurality of language models, the recognized character string being acquired by said recognizer, with the character strings of the search target words which are stored in said character string dictionary and calculate a character string matching score showing a degree of matching of said recognized character string with respect to each of the character strings of said search target words, to acquire both a character string of a search target word having a highest character string matching score and this character string matching score for each of said recognized character strings; and a search result determinator to calculate a total score as a weighted sum of two or more of said character string matching score acquired by said character string comparator, and the acoustic likelihood and the language likelihood acquired by said recognizer, and output, as a search result, one or more search target words in descending order of calculated total scores. - View Dependent Claims (3, 4)
-
-
2. (canceled)
-
5. A speech search device comprising:
-
a recognizer to refer to an acoustic model and at least one language model and perform voice recognition on an input speech, to acquire an acoustic likelihood and a language likelihood of a recognized character string for each of said one or more language models; a character string dictionary storage to store a character string dictionary in which pieces of information showing character strings of search target words each serving as a target for speech search are stored; a character string comparator to acquire an external recognized character string which is acquired by, in an external device, referring to an acoustic model and a language model having learning data different from that of the one or more language models which are referred to by said recognizer, and performing voice recognition on said input speech, compare the external recognized character string acquired thereby and the recognized character string acquired by said recognizer with the character strings of the search target words stored in said character string dictionary, and calculate character string matching scores showing degrees of matching of said external recognized character string and said recognized character string with respect to each of the character strings of said search target words, to acquire both a character string of a search target word having a highest character string matching score and this character string matching score for each of said external recognized character string and said recognized character string; and a search result determinator to calculate a total score as a weighted sum of two or more of said character string matching score acquired by said character string comparator, and the acoustic likelihood and the language likelihood of said recognized character string which are acquired by said recognizer, and an acoustic likelihood and a language likelihood of said external recognized character string which are acquired from said external device, and output, as a search result, one or more search target words in descending order of calculated total scores. - View Dependent Claims (7)
-
-
6. (canceled)
-
8. A speech search method comprising the steps of:
-
in a recognizer, referring to an acoustic model and a plurality of language models having different learning data and performing voice recognition on an input speech, to acquire an acoustic likelihood and a language likelihood of a recognized character string for each of said plurality of language models; in a character string comparator, comparing the recognized character string for each of said plurality of language models with character strings of search target words each serving as a target for speech search, the character strings being stored in a character string dictionary, and calculating a character string matching score showing a degree of matching of said recognized character string with respect to each of the character strings of said search target words, to acquire both a character string of a search target word having a highest character string matching score and this character string matching score for each of said recognized character strings; and in a search result determinator, calculating a total score as a weighted sum of two or more of said character string matching score, and said acoustic likelihood and said language likelihood, and outputting, as a search result, one or more search target words in descending order of calculated total scores.
-
Specification