Error-tolerant language understanding system and method
First Claim
1. An error-tolerant language understanding method comprising the following steps:
- (a) Inputting at least one word sequence and its corresponding acoustic score;
(b) Parsing said word sequence to obtain a corresponding concept sequence set;
(c) Attach at least one confidence measure sequence to each concept sequence in the said concept sequence set and compare the concept sequences together with their associated confidence measure sequences against at least one exemplary concept sequence to obtain at least one edit operation sequence;
(d) According to said acoustic score of said word sequence, the corresponding grammar score of a concept sequence in said concept sequence set, the corresponding example score of said exemplary concept sequence and the corresponding edit operation score of said edit operation sequence to determine the most possible concept sequence; and
(e) Translating said most possible concept sequence into a semantic frame,wherein the step (d) further comprising;
Using a probabilistic scoring function to determine said the most possible concept sequence, and said probabilistic scoring function is formulated as follows;
wherein is the most possible word sequence in the sentence list that outputs from the speech recognition module, is the most possible concept parse forest, is the corresponding concept sequence, is the corresponding confidence measure sequences, is the most possible exemplary concept sequence and is the most possible edit operation sequence, SW is the acoustic score, SF is the grammar score, SK is the example score and SE is the edit operation score, wherein U represents a utterance signal, W represents said possible word sequence in the sentence list that outputs from the speech recognition module, F represents a possible concept parses forest of W T is a concept parse tree of said concept parse forest F, A→
α
is a concept grammar that generates said T, A is a left-hand-side symbol and α
is right-hand-side symbols, m is the number of concept in exemplary concept sequence K, k1m is a brief note of k1 . . . km, ki is the ith concept, e is an edit operation in edit operation sequence E, said utterance signal U is processed with X number of confidence measure modules and X number of confidence measure sequences are generated, one of said confidence measure sequences Mh corresponding to the r number of c1 . . . cr concepts is expressed as
1 Assignment
0 Petitions
Accused Products
Abstract
The present invention relates to an error-tolerant language understanding, system and method. The system and the method is using example sentences to provide the clues for detecting and recovering errors. The procedure of detection and recovery is guided by a probabilistic scoring function which integrated the scores from the speech recognizer, concept parser, the scores of concept-bigram and edit operations, such as deleting, inserting and substituting concepts. Meanwhile, the score of edit operations refers the confidence measure achieving more precise detection and recovery of the speech recognition errors. That said, a concept with lower confidence measure tends to be deleted or substituted, while a concept with higher one tends to be retained.
-
Citations
1 Claim
-
1. An error-tolerant language understanding method comprising the following steps:
-
(a) Inputting at least one word sequence and its corresponding acoustic score; (b) Parsing said word sequence to obtain a corresponding concept sequence set; (c) Attach at least one confidence measure sequence to each concept sequence in the said concept sequence set and compare the concept sequences together with their associated confidence measure sequences against at least one exemplary concept sequence to obtain at least one edit operation sequence; (d) According to said acoustic score of said word sequence, the corresponding grammar score of a concept sequence in said concept sequence set, the corresponding example score of said exemplary concept sequence and the corresponding edit operation score of said edit operation sequence to determine the most possible concept sequence; and (e) Translating said most possible concept sequence into a semantic frame, wherein the step (d) further comprising; Using a probabilistic scoring function to determine said the most possible concept sequence, and said probabilistic scoring function is formulated as follows; wherein is the most possible word sequence in the sentence list that outputs from the speech recognition module, is the most possible concept parse forest, is the corresponding concept sequence, is the corresponding confidence measure sequences, is the most possible exemplary concept sequence and is the most possible edit operation sequence, SW is the acoustic score, SF is the grammar score, SK is the example score and SE is the edit operation score, wherein U represents a utterance signal, W represents said possible word sequence in the sentence list that outputs from the speech recognition module, F represents a possible concept parses forest of W T is a concept parse tree of said concept parse forest F, A→
α
is a concept grammar that generates said T, A is a left-hand-side symbol and α
is right-hand-side symbols,m is the number of concept in exemplary concept sequence K, k1m is a brief note of k1 . . . km, ki is the ith concept, e is an edit operation in edit operation sequence E, said utterance signal U is processed with X number of confidence measure modules and X number of confidence measure sequences are generated, one of said confidence measure sequences Mh corresponding to the r number of c1 . . . cr concepts is expressed as
-
Specification