Domain-based dialog speech recognition method and apparatus
First Claim
1. A domain-based dialog speech recognition method comprising:
- performing speech recognition by using a first language model and generating a first recognition result including a plurality of first recognition sentences;
selecting a plurality of candidate domains, by using a word included in each of the first recognition sentences and having a confidence score equal to or higher than a predetermined threshold, as a domain keyword;
performing the speech recognition with the first recognition result, by using an acoustic model specific to each of the candidate domains and a second language model and generating a plurality of second recognition sentences; and
selecting one or more final recognition sentences from the first recognition sentences and the second recognition sentences.
1 Assignment
0 Petitions
Accused Products
Abstract
A domain-based speech recognition method and apparatus, the method including: performing speech recognition by using a first language model and generating a first recognition result including a plurality of first recognition sentences; selecting a plurality of candidate domains, by using a word included in each of the first recognition sentences and having a confidence score equal to or higher than a predetermined threshold, as a domain keyword; performing speech recognition with the first recognition result, by using an acoustic model specific to each of the candidate domains and a second language model and generating a plurality of second recognition sentences; and selecting at least one or more final recognition sentence from the first recognition sentences and the second recognition sentences. According to this method and apparatus, the effect of a domain extraction error by misrecognition of a word on selection of a final recognition result can be minimized.
322 Citations
25 Claims
-
1. A domain-based dialog speech recognition method comprising:
-
performing speech recognition by using a first language model and generating a first recognition result including a plurality of first recognition sentences;
selecting a plurality of candidate domains, by using a word included in each of the first recognition sentences and having a confidence score equal to or higher than a predetermined threshold, as a domain keyword;
performing the speech recognition with the first recognition result, by using an acoustic model specific to each of the candidate domains and a second language model and generating a plurality of second recognition sentences; and
selecting one or more final recognition sentences from the first recognition sentences and the second recognition sentences. - View Dependent Claims (2, 3, 4, 5, 6, 24, 25)
-
-
7. A computer-readable recording medium having embodied thereon a computer program sequence for a domain-based dialog speech recognition method comprising:
-
performing speech recognition by using a first language model and generating a first recognition result including a plurality of first recognition sentences;
selecting a plurality of candidate domains, by using a word included in each of the first recognition sentences and having a confidence score equal to or higher than a predetermined threshold, as a domain keyword;
performing the speech recognition with the first recognition result, by using an acoustic model specific to each of the candidate domains and a second language model, and generating a plurality of second recognition sentences; and
selecting one or more final recognition sentences from the first recognition sentences and the second recognition sentences.
-
-
8. A domain-based dialog speech recognition apparatus comprising:
-
a first speech recognition unit which performs speech recognition of input speech by using a first language model and generates a first recognition result including a plurality of first recognition sentences;
a domain extraction unit which selects a plurality of candidate domains by using the plurality of first recognition sentences provided by the first speech recognition unit;
a second speech recognition unit which performs the speech recognition with the first recognition result of the first speech recognition unit, by using an acoustic model specific to each of the candidate domains selected in the domain extraction unit and a second language model and generates a plurality of second recognition sentences; and
a selection unit which selects a plurality of final recognition sentences from the first recognition sentences provided by the first speech recognition unit and the second recognition sentences provided by the second speech recognition unit. - View Dependent Claims (9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23)
-
Specification