Keyword/non-keyword classification in isolated word speech recognition

US 5,440,662 A
Filed: 12/11/1992
Issued: 08/08/1995
Est. Priority Date: 12/11/1992
Status: Expired due to Term

First Claim

Patent Images

1. A method to establish whether a speech signal comprising digitized speech represents a keyword, said method comprising the steps of:

transforming said digitized speech signal into feature vectors;

processing said feature vectors in a Hidden Markov Model (HMM) keyword detector, said (HMM) keyword detector having output signals representing speech segmentation information and signals representing scores of a set of keywords compared to said digitized speech signal;

forming a discriminating vector by deriving mean vectors from said feature vectors and concatenating said mean vectors with said segmentation information;

non-linearly processing said discriminating vector to derive a first set of weighting factors, and linearly combining said feature vectors and said discriminating vector using said first set of weighting factors to develop a first set of confidence scores;

processing said first set of confidence scores and said signals representing keyword scores from said HMM keyword detector with a second weighting factor to develop a second confidence score; and

comparing said second confidence score to a threshold to determine whether a keyword has been detected.

View all claims

9 Assignments

Timeline View

Assignment View

0 Petitions

Accused Products

Abstract

A two-pass classification system and method that post-processes HMM scores with additional confidence scores to derive a value that may be applied to a threshold on which a keyword verses non-keyword determination may be based. The first stage comprises Generalized Probabilistic Descent (GPD) analysis which uses feature vectors of the spoken words and the HMM segmentation information (developed by the HMM detector during processing) as inputs to develop a first set of confidence scores through a linear combination (a weighted sum) of the feature vectors of the speech. The second stage comprises a linear discrimination method that combines the HMM scores and the confidence scores from the GPD stage with a weighted sum to derive a second confidence score. The output of the second stage may then be compared to a predetermined threshold to determine whether the spoken word or words include a keyword.

70 Citations

View as Search Results

7 Claims

1. A method to establish whether a speech signal comprising digitized speech represents a keyword, said method comprising the steps of:
- transforming said digitized speech signal into feature vectors;
  
  processing said feature vectors in a Hidden Markov Model (HMM) keyword detector, said (HMM) keyword detector having output signals representing speech segmentation information and signals representing scores of a set of keywords compared to said digitized speech signal;
  
  forming a discriminating vector by deriving mean vectors from said feature vectors and concatenating said mean vectors with said segmentation information;
  
  non-linearly processing said discriminating vector to derive a first set of weighting factors, and linearly combining said feature vectors and said discriminating vector using said first set of weighting factors to develop a first set of confidence scores;
  
  processing said first set of confidence scores and said signals representing keyword scores from said HMM keyword detector with a second weighting factor to develop a second confidence score; and
  
  comparing said second confidence score to a threshold to determine whether a keyword has been detected.
- View Dependent Claims (2, 3)
- - 2. A method in accordance with claim 1 wherein said first weighting factors are developed using general probablistic descent training.
  - 3. A method in accordance with claim 2 wherein said second weighting factor is developed using Fisher'"'"'s linear discrimination training.

4. A keyword detection apparatus that determines whether a digitized speech signal includes one of a preselected plurality of keywords, said apparatus comprising:
- means for receiving input signals representing digitized speech and developing a plurality of signals representing feature vectors of said digitized speech;
  
  means responsive to said input signals and said signals representing feature vectors of said digitized speech for developing segmentation information regarding said speech signals and a plurality of HMM keyword scores by comparing said speech signals to each of said preselected plurality of keywords,means for receiving said feature vectors and said segmentation information and combining them to determine a first set of confidence scores;
  
  means for receiving said HMM keyword scores and said first confidence scores and combining them to determine a second confidence score; and
  
  means for comparing said second confidence score against a threshold value for determining whether the keyword having the highest score is present in said input signals.
- View Dependent Claims (5, 6)
- - 5. A keyword detection apparatus in accordance with claim 4 wherein said combining to determine a first set of confidence scores is based on general probablistic descent training.
  - 6. A keyword detection apparatus in accordance with claim 4 wherein said combining to determine a second confidence score is based on Fisher'"'"'s linear discrimination training.

7. A method to establish whether a speech signal comprising digitized speech represents a keyword, said method comprising the steps of:
- processing said speech signal into a plurality of feature vectors;
  
  processing said speech signal by a Hidden Markov Model (HMM) keyword detector, said HMM keyword detector developing signals representing speech segmentation information and signals representing keyword scores of a set of keywords compared to said speech signal;
  
  forming a discriminating vector by deriving mean vectors from said feature vectors and concatenating said mean vectors with said segmentation information;
  
  non-linearly processing said discriminating vectors to develop a first plurality of weighting factors using general probabilistic descent training and linearly combining said feature vectors and said discriminating vector using said first plurality of weighting factors to develop a first set of confidence scores;
  
  processing said first confidence scores and said signals representing keyword scores from said HMM keyword detector with second weighting factors to develop a second confidence score, said second weighting factors being derived using Fisher'"'"'s linear discrimination training; and
  
  comparing said second confidence score to a threshold to determine whether a keyword has been detected.

Specification

Resources

Litigation Campaign Assessment

Current Assignee
Google LLC (Alphabet Inc.)
Original Assignee
AT&T Corporation (AT&T, Inc.)
Inventors
Sukkar, Rafid A.
Primary Examiner(s)
MacDonald, Allen R.
Assistant Examiner(s)
ONKA, THOMAS

Application Number

US07/989,299
Time in Patent Office

970 Days
Field of Search

395/2.45, 395/2.47, 395/2.49, 395/2.65
US Class Current

704/236
CPC Class Codes

G10L 15/142 Hidden Markov Models [HMMs]

G10L 2015/088 Word spotting

Keyword/non-keyword classification in isolated word speech recognition

First Claim

9 Assignments

0 Petitions

Accused Products

Abstract

70 Citations

7 Claims

Specification

Solutions

Use Cases

Quick Links

Keyword/non-keyword classification in isolated word speech recognition

First Claim

9 Assignments

Subscription Required

Subscription Required

0 Petitions

Subscription Required

Accused Products

Subscription Required

Abstract

70 Citations

7 Claims

Specification

Subscription Required

Solutions

Use Cases

Quick Links