Speech recognition training method

US 4,718,088 A
Filed: 03/27/1984
Issued: 01/05/1988
Est. Priority Date: 03/27/1984
Status: Expired due to Term

First Claim

Patent Images

1. In a speech recognition apparatus wherein speech units are each characterized by a sequence of template patterns, and havingmeans for processing a speech input signal for repetitively deriving therefrom, at a frame repetition rate, a plurality of speech recognition acoustic parameters, andmeans responsive to said acoustic parametersfor generating likelihood costs between said acoustic parameters and said speech template patterns, andfor processing said likelihood costs for determining the speech units in said speech input signal,a method for generating said template patterns comprising the steps offinding the beginning and end of an input speech unit surrounded by silence for which template patterns are to be generated, andgenerating in accordance with a known procedure, template patterns representing said speech unit,said finding step comprisingmodelling silence as a template pattern,for each frame, comparing said silence template pattern likelihood cost with a fixed reference threshold value, anddeclaring the beginning of said speech unit when the score for the silence template pattern crosses the threshold value.

View all claims

6 Assignments

Timeline View

Assignment View

0 Petitions

Accused Products

Abstract

A speech recognition method and apparatus employ a speech processing circuitry for repetitively deriving from a speech input, at a frame repetition rate, a plurality of acoustic parameters. The acoustic parameters represent the speech input signal for a frame time. A plurality of template matching and cost processing circuitries are connected to a system bus, along with the speech processing circuitry, for determining, or identifying, the speech units in the input speech, by comparing the acoustic parameters with stored template patterns. The apparatus can be expanded by adding more template matching and cost processing circuitry to the bus thereby increasing the speech recognition capacity of the apparatus. Template pattern generation is advantageously aided by using a "joker" word to specify the time boundaries of utterances spoken in isolation, by finding the beginning and ending of an utterance surrounded by silence.

Citations

7 Claims

1. In a speech recognition apparatus wherein speech units are each characterized by a sequence of template patterns, and havingmeans for processing a speech input signal for repetitively deriving therefrom, at a frame repetition rate, a plurality of speech recognition acoustic parameters, andmeans responsive to said acoustic parametersfor generating likelihood costs between said acoustic parameters and said speech template patterns, andfor processing said likelihood costs for determining the speech units in said speech input signal,a method for generating said template patterns comprising the steps offinding the beginning and end of an input speech unit surrounded by silence for which template patterns are to be generated, andgenerating in accordance with a known procedure, template patterns representing said speech unit,said finding step comprisingmodelling silence as a template pattern,for each frame, comparing said silence template pattern likelihood cost with a fixed reference threshold value, anddeclaring the beginning of said speech unit when the score for the silence template pattern crosses the threshold value.
- View Dependent Claims (2, 3, 6, 7)
- - 2. The speech recognition template pattern generating method of claim 1 further comprising the step ofdeclaring the end of said speech unit when the score for the silence template improves sufficiently to cross a second threshold value.
  - 3. The speech recognition template pattern generating method of claim 2 wherein said second threshold value is less than said first threshold value.
  - 6. The speech recognition template pattern generating method of claim 1 further comprising the step ofassociating with an intermediate node to which said first arc leads, a second arc having a second fixed reference threshold value, andassociating with a second intermediate node to which said second arc leads, a third arc corresponding to the likelihood score of silence, anddetermining when said score at a node to which said third arc leads matches a predetermined condition.
  - 7. The speech recognition template pattern generating method of claim 6 wherein said second threshold value is less than said first threshold value.

4. In a speech recognition apparatus wherein speech units are each characterized by a sequence of template patterns, and havingmeans for processing a speech input signal for repetitively deriving therefrom, at a frame repetition rate, a plurality of speech recognition acoustic parameters, andmeans responsive to said acoustic parametersfor generating likelihood costs between said acoustic parameters and said speech template patterns, andfor processing said likelihood costs for determining the speech units in said speech input signal,a method for generating said template patterns comprising the steps offinding the beginning and end of an input speech unit surrounded by silence for which template patterns are to be generated, andgenerating in accordance with a known procedure, template patterns representing said speech unit,said finding step comprisingmodelling silence as a template pattern,for each frame, comparing said silence template pattern likelihood cost with a fixed reference threshold value, anddeclaring the beginning of said speech unit when the score for the silence template pattern crosses the threshold value,declaring the end of said speech unit when the score for the silence template improves sufficiently to cross a second threshold value, andwherein said second threshold value is less than said first threshold value.

5. In a speech recognition apparatus wherein speech units are each characterized by a sequence of template patterns, and havingmeans for processing a speech input signal for repetitively deriving therefrom, at a frame repetition rate, a plurality of speech recognition acoustic parameters, andmeans responsive to said acoustic parametersfor generating likelihood costs between said acoustic parameters and said speech template patterns, andfor processing said likelihood costs for determining the speech units in said speech input signal,a method for generating said template patterns comprising the steps offinding, using a dynamic programming and a grammar graph, the beginning and end of an input speech unit surrounded by silence for which template patterns are to be generated, andgenerating in accordance with a known procedure, template patterns representing said speech unit,said finding step comprisingmodelling silence as a template pattern,associating from a beginning node of said grammar graph a first arc having a fixed reference threshold value, andassociating with said beginning node a silence self loop, and following said dynamic programming for determining the beginning of said speech unit.

Specification

Resources

Litigation Campaign Assessment

Current Assignee
Voice Industries Corporation
Original Assignee
Exxon Research and Engineering Company (Exxon Mobil Corporation)
Inventors
Lee, Chin-Hui, Ganesan, Kalyan, Klovstad, John W., Baker, James K.
Primary Examiner(s)
Kemeny, E. S. Matt

Application Number

US06/593,891
Time in Patent Office

1,379 Days
Field of Search

381/41-43, 364/513.5
US Class Current

704/241
CPC Class Codes

G10L 15/02   Feature extraction for spee...

G10L 15/063   Training

G10L 15/083   Recognition networks G10L15...

G10L 15/12   using dynamic programming t...

G10L 2015/0638   Interactive procedures

G10L 25/27   characterised by the analys...

G10L 25/87   Detection of discrete point...

Speech recognition training method

First Claim

6 Assignments

0 Petitions

Accused Products

Abstract

Citations

7 Claims

Specification

Solutions

Use Cases

Quick Links

Speech recognition training method

First Claim

6 Assignments

Subscription Required

Subscription Required

0 Petitions

Subscription Required

Accused Products

Subscription Required

Abstract

Citations

7 Claims

Specification

Subscription Required

Solutions

Use Cases

Quick Links