System and method for utterance verification of chinese long and short keywords
First Claim
Patent Images
1. In an information processing system, a method for speech processing, the method comprising:
- receiving an utterance;
computing a score based on the utterance, including evaluating states of a model of a keyword; and
indicating based on the score that the utterance appears to contain the keyword;
wherein, in the computing step, the score is computed without requiring that a model, of speech other than the keyword, be evaluated only at states corresponding to the evaluated states of the model of the keyword.
4 Assignments
0 Petitions
Accused Products
Abstract
An utterance verification system and method includes: a new formulation of log-likelihood ratio (LLR) that discriminates between true and mis-recognition scores; a new dynamic threshold setting that permits each keyword to have its own individual threshold; and/or use of higher resolution subword units for HMM based (Hidden Markov Model-based) utterance verification. The system and method are especially suited for automated processing of speech of syllable-based languages, for example, Chinese (for example, Mandarin or Cantonese).
58 Citations
18 Claims
-
1. In an information processing system, a method for speech processing, the method comprising:
-
receiving an utterance;
computing a score based on the utterance, including evaluating states of a model of a keyword; and
indicating based on the score that the utterance appears to contain the keyword;
wherein, in the computing step, the score is computed without requiring that a model, of speech other than the keyword, be evaluated only at states corresponding to the evaluated states of the model of the keyword.
-
-
2. The method of claim I wherein the computing step includes:
-
evaluating a state j of the model of the keyword for each timeslice t of multiple timeslices of the utterance;
evaluating a state k of a model, of speech other than the keyword, at the timeslice t, wherein the state k is chosen to maximize or minimize a value without requiring that the state k equal the state j. - View Dependent Claims (3)
-
-
4. A system for speech processing, comprising:
-
a processor;
a memory;
a model of a keyword;
a model of words other than the keyword; and
logic that directs the processor to read an utterance;
compute a score based on the utterance and on the model of the keyword and the model of words other than the keyword, and indicate that the utterance appears to include the keyword;
wherein the score is based on portions, of the model of words other than the keyword, that do not necessarily correspond to portions, of the model of the keyword, that were used to compute the score. - View Dependent Claims (5, 6)
-
-
7. In an information processing system, a method for speech processing comprising:
-
receiving an utterance;
for each of multiple keywords, computing a score based on the utterance for each of multiple keywords, comparing the score to a threshold, wherein the threshold for one of the multiple keywords need not be the same as the threshold for another of the multiple keywords; and
indicating based on result of the comparison that the utterance appears to contain the keyword. - View Dependent Claims (8)
-
-
9. A speech processing system, comprising:
-
a processor;
a memory;
logic that directs the processor to;
read an utterance;
for each of multiple keywords, compute a score based on the utterance and compare the score to a threshold;
wherein the threshold for one of the multiple keywords need not be the same as the threshold for another of the multiple keywords; and
indicating based on result of the compare that the utterance appears to contain a keyword. - View Dependent Claims (10)
-
-
11. In an information processing system, a method for processing speech of a language having a syllabic character set, comprising:
-
maintaining models of syllables of the language, wherein syllables corresponding to some characters of the character set are modeled using at least three subword units;
receiving an utterance;
computing scores based on the utterance and the models; and
indicating the detected existence of a word in the utterance based on the scores. - View Dependent Claims (12, 13, 14)
-
-
15. A speech processing system for performing recognition on speech of a language having a syllabic character set, the system comprising:
-
a processor;
a memory;
models of syllables of the language, wherein syllables corresponding to some characters of the character set are modeled using at least three subword units; and
logic that directs the processor to;
receive an utterance;
computing scores based on the utterance and the models; and
detecting existence of a word in the utterance based on the scores. - View Dependent Claims (16, 17, 18)
-
Specification