Detecting speech input phrase confusion risk
First Claim
1. On a computing system comprising a processor, a method of detecting phrase confusion risk in a proposed speech grammar for a computer program, the method comprising:
- providing, by downloading to a remote computing device, via the processor a speech grammar development tool executable by the remote computing device toreceive input of a text representation of each of a plurality of proposed speech grammar terms,for each proposed speech grammar term, convert the text representation to a phonetic representation of the speech grammar term,determine whether a portion of the proposed speech grammar term has a spoken duration below a threshold duration, and if the portion of the proposed speech grammar term has a spoken duration below the threshold duration, then omit the portion from the phonetic representation of the proposed speech grammar term,compare via a speech recognition engine the phonetic representation of the speech grammar term to the phonetic representations of other speech grammar terms using a weighted similarity matrix, andprovide an output regarding risk of confusion between two proposed speech grammar terms based upon a comparison by the speech recognition engine of the phonetic representations of the two proposed speech grammar terms;
receiving via the processor data regarding incorrect speech grammar term identification; and
modifying via the processor speech grammar used by the speech recognition engine, wherein modifying the speech grammar comprises modifying one or more weights in the weighted similarity matrix based upon the data.
3 Assignments
0 Petitions
Accused Products
Abstract
Embodiments are disclosed that relate to identifying phonetically similar speech grammar terms during computer program development. For example, one disclosed embodiment provides a method including providing a speech grammar development tool configured to receive input of a text representation of each of a plurality of proposed speech grammar terms, convert each text representation to a phonetic representation of the speech grammar term, compare the phonetic representation of the speech grammar term to the phonetic representations of other speech grammar terms using a weighted similarity matrix, and provide an output regarding risk of confusion between two proposed speech grammar terms based upon a comparison of the phonetic representations of the two proposed speech grammar terms. The method further includes receiving data regarding incorrect speech grammar term identification, and modifying one or more weights in the weighted similarity matrix based upon the data.
-
Citations
18 Claims
-
1. On a computing system comprising a processor, a method of detecting phrase confusion risk in a proposed speech grammar for a computer program, the method comprising:
-
providing, by downloading to a remote computing device, via the processor a speech grammar development tool executable by the remote computing device to receive input of a text representation of each of a plurality of proposed speech grammar terms, for each proposed speech grammar term, convert the text representation to a phonetic representation of the speech grammar term, determine whether a portion of the proposed speech grammar term has a spoken duration below a threshold duration, and if the portion of the proposed speech grammar term has a spoken duration below the threshold duration, then omit the portion from the phonetic representation of the proposed speech grammar term, compare via a speech recognition engine the phonetic representation of the speech grammar term to the phonetic representations of other speech grammar terms using a weighted similarity matrix, and provide an output regarding risk of confusion between two proposed speech grammar terms based upon a comparison by the speech recognition engine of the phonetic representations of the two proposed speech grammar terms; receiving via the processor data regarding incorrect speech grammar term identification; and modifying via the processor speech grammar used by the speech recognition engine, wherein modifying the speech grammar comprises modifying one or more weights in the weighted similarity matrix based upon the data. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8)
-
-
9. On a computing device comprising a processor, a method of detecting phrase confusion risk in proposed speech grammar for a computer program, the method comprising:
-
obtaining via the processor a speech grammar development tool by downloading the speech grammar development tool from a server computing system; receiving via the processor an input of a text representation of a speech grammar term; determining via the processor a phonetic representation of the speech grammar term from the text representation; if a portion of the speech grammar term comprises a duration below a threshold duration, then omitting via the processor the portion of the phonetic representation of the speech grammar term; comparing via a speech recognition engine executed by the processor the phonetic representation of the speech grammar term to a phonetic representation of another speech grammar term, wherein comparing comprises using a weighted similarity matrix; outputting to a display a representation of a risk of confusion between the speech grammar term and the other speech grammar term based upon a comparison by the speech recognition engine of the speech grammar term and the other speech grammar term; determining that the speech grammar term was incorrectly identified; sending data regarding the incorrect determination to the server computing system; receiving a modified weighted similarity matrix from the server computing system; and modifying via the processor the speech grammar used by the speech recognition engine based upon the modified weighted similarity matrix. - View Dependent Claims (10, 11, 12, 13, 14)
-
-
15. A computing system, comprising:
-
a processor; and a memory storing instructions executable by the processor to; provide, by downloading to a remote computing device, a speech grammar development tool configured to receive input of a text representation of each of a plurality of proposed speech grammar terms; for each proposed speech grammar term, convert the text representation to a phonetic representation of the speech grammar term, determine whether the proposed speech grammar term comprises a portion that has a spoken duration below a threshold duration, and when the portion of the proposed speech grammar term has a spoken duration below the threshold duration, then omit the portion from the phonetic representation of the proposed speech grammar term, and compare via a speech recognition engine the phonetic representation of the speech grammar term to the phonetic representations of other speech grammar terms using a weighted similarity matrix, the weighted similarity matrix comprising, for each phoneme of a plurality of phonemes, a weight for each phonetic feature of a plurality of phonetic features, and provide an output regarding risk of confusion between two proposed speech grammar terms based upon a comparison via a speech recognition engine of the phonetic representations of the two proposed speech grammar terms; receive data regarding one or more incorrect speech grammar term identifications, the data comprising an actual pronunciation for a selected speech grammar term that is different from the expected pronunciation of the selected speech grammar term; store a phonetic representation of the actual pronunciation of the selected speech grammar term; and modify speech grammar to be used by the speech recognition engine, wherein modifying the speech grammar comprises modifying one or more weights in the weighted similarity matrix based upon the data. - View Dependent Claims (16, 17, 18)
-
Specification