Method and apparatus for performing speech keyword retrieval
First Claim
1. A method for retrieving keyword from speech, comprising:
- configuring, by an apparatus comprising a processor circuitry, language models, wherein at least two of the language models each comprises a recognition model and a decoding model that corresponds to the recognition model;
extracting a speech feature, by the apparatus, from to-be-processed speech data;
determining, by the apparatus, which of the recognition models has a highest language matching rate by performing, using the recognition models, language matching on the extracted speech feature;
identifying, by the apparatus, the decoding model which corresponds to the determined recognition model;
decoding, by the apparatus, the extracted speech feature by using the identified decoding model, and obtaining a word recognition result; and
matching, by the apparatus, a keyword in a keyword dictionary and the word recognition result with each other, and outputting a matched keyword on a display of the apparatus.
1 Assignment
0 Petitions
Accused Products
Abstract
A method and an apparatus are provided for retrieving keyword. The apparatus configures at least two types of language models in a model file, where each type of language model includes a recognition model and a corresponding decoding model; the apparatus extracts a speech feature from the to-be-processed speech data; performs language matching on the extracted speech feature by using recognition models in the model file one by one, and determines a recognition model based on a language matching rate; and determines a decoding model corresponding to the recognition model; decoding the extracted speech feature by using the determined decoding model, and obtains a word recognition result after the decoding; and matches a keyword in a keyword dictionary and the word recognition result, and outputs a matched keyword.
-
Citations
15 Claims
-
1. A method for retrieving keyword from speech, comprising:
-
configuring, by an apparatus comprising a processor circuitry, language models, wherein at least two of the language models each comprises a recognition model and a decoding model that corresponds to the recognition model; extracting a speech feature, by the apparatus, from to-be-processed speech data; determining, by the apparatus, which of the recognition models has a highest language matching rate by performing, using the recognition models, language matching on the extracted speech feature; identifying, by the apparatus, the decoding model which corresponds to the determined recognition model; decoding, by the apparatus, the extracted speech feature by using the identified decoding model, and obtaining a word recognition result; and matching, by the apparatus, a keyword in a keyword dictionary and the word recognition result with each other, and outputting a matched keyword on a display of the apparatus. - View Dependent Claims (2, 3, 4, 5)
-
-
6. An apparatus for performing speech keyword retrieval, comprising a processor and a non-transitory storage medium accessible to the processor, the non-transitory storage medium configured to store units comprising:
-
a file configuring unit that configures language models, and at least two of the language models each comprises a recognition model and a decoding model that corresponds to the recognition model; a feature extracting unit that extracts a speech feature from to-be-processed speech data, and sends the extracted speech feature to a language recognition unit; the language recognition unit that determines which of the recognition models has a highest language matching rate by performing, using the recognition models, language matching on the extracted speech feature;
identifies the decoding model that corresponds to the determined recognition model, and sends the extracted speech feature to a decoding unit;the decoding unit that decodes the extracted speech feature by using the identified decoding model, obtains a word recognition result, and sends the word recognition result to a keyword search unit; and the keyword search unit that matches a keyword in a keyword dictionary and the word recognition result with each other, and outputs a matched keyword on a display of the apparatus. - View Dependent Claims (7, 8, 9, 10)
-
-
11. A phone for performing speech keyword retrieval, comprising a processor and a non-transitory storage medium accessible to the processor, the phone configured to perform acts comprising:
-
configuring language models, and at least two of the language models each comprises a recognition model and a decoding model that corresponds to the recognition model; extracting a speech feature from to-be-processed speech data, and sending the extracted speech feature; determining which of the recognition models has a highest language matching rate by performing, using the recognition models, language matching on the extracted speech feature;
identifying the decoding model that corresponds to the determined recognition model, and sending the extracted speech feature;decoding the extracted speech feature by using the identified decoding model, obtaining a word recognition result, and sending the word recognition result; and matching a keyword in a keyword dictionary and the word recognition result with each other, and outputting a matched keyword on a display of the phone. - View Dependent Claims (12, 13, 14, 15)
-
Specification