Apparatus and methods for speech recognition including individual or speaker class dependent decoding history caches for fast word acceptance or rejection
First Claim
Patent Images
1. A method for performing speech recognition on speech segments frequently input by a user, the method comprising the steps of:
- (a) inputting at least one keyword spoken by the user;
(b) decoding the at least one keyword by scoring the at least one keyword against a speech recognition vocabulary to generate a decoded keyword and at least one score for the decoded keyword;
(c) storing the decoded keyword and the at least one score;
(d) inputting a speech segment spoken by the user;
(e) comparing the input speech segment to the decoded keyword in order to generate a temporary score; and
(f) comparing the temporary score against the at least one stored score and if the temporary score is one of within a predetermined margin of, equivalent to, and larger than the at least one stored score, then the decoded keyword is output as being representative of the input speech segment, else the input speech segment is scored against the speech recognition vocabulary to generate a second decoded keyword and at least one score for the second decoded keyword.
1 Assignment
0 Petitions
Accused Products
Abstract
A method and an apparatus are provided for performing speech recognition on speech segments frequently input by a user. The method and the apparatus include use of keyword scoring in connection with a speech recognition vocabulary, a temporary score, and a predetermined margin to determine an appropriate output as being representative of the input speech segment.
-
Citations
25 Claims
-
1. A method for performing speech recognition on speech segments frequently input by a user, the method comprising the steps of:
-
(a) inputting at least one keyword spoken by the user; (b) decoding the at least one keyword by scoring the at least one keyword against a speech recognition vocabulary to generate a decoded keyword and at least one score for the decoded keyword; (c) storing the decoded keyword and the at least one score; (d) inputting a speech segment spoken by the user; (e) comparing the input speech segment to the decoded keyword in order to generate a temporary score; and (f) comparing the temporary score against the at least one stored score and if the temporary score is one of within a predetermined margin of, equivalent to, and larger than the at least one stored score, then the decoded keyword is output as being representative of the input speech segment, else the input speech segment is scored against the speech recognition vocabulary to generate a second decoded keyword and at least one score for the second decoded keyword. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10)
-
-
11. Apparatus for performing speech recognition on speech segments frequently input by a user, the apparatus comprising:
-
means for inputting at least one keyword spoken by the user; means for decoding the at least one keyword by scoring the at least one keyword against a speech recognition vocabulary to generate a decoded keyword and at least one score for the decoded keyword; means for storing the decoded keyword and the at least one score; means for inputting a speech segment spoken by the user; means for comparing the input speech segment to the decoded keyword in order to generate a temporary score; and means for comparing the temporary score against the at least one stored score and if the temporary score is one of within a predetermined margin of, equivalent to, and larger than the at least one stored score, then the decoded keyword is output as being representative of the input speech segment, else the input speech segment is scored against the speech recognition vocabulary to generate a second decoded keyword and at least one score for the second decoded keyword. - View Dependent Claims (12, 13, 14, 15, 16, 17, 18, 19, 20)
-
-
21. A system for recognizing keywords frequently input by a speaker, the system comprising:
-
a speech recognition engine for decoding at least one keyword uttered by the speaker by scoring the at least one keyword against a speech recognition vocabulary to generate a decoded keyword and at least one score for the decoded keyword; a cache database for storing the decoded keyword and the at least one score associated therewith in accordance with a predetermined identity of the speaker; means for performing a Viterbi alignment process on an input speech segment uttered by the speaker wherein the input speech segment is compared to the decoded keyword to generate a temporary score; and a comparator for comparing the temporary score against the at least one stored score and if the temporary score is one of within a predetermined margin of, equivalent to, and larger than the at least one stored score, then the decoded keyword is output as being representative of the input speech segment, else the input speech segment is scored against the speech recognition vocabulary to generate a second decoded keyword and at least one score for the second decoded keyword. - View Dependent Claims (22, 23, 24, 25)
-
Specification