Speech recognition system

US 4,513,436 A
Filed: 02/23/1984
Issued: 04/23/1985
Est. Priority Date: 09/16/1980
Status: Expired due to Term

First Claim

Patent Images

1. A method of recognizing speech wherein a reference feature vector system is partitioned in memory into a reference first portion of feature vectors, which has a constant time duration independent of a speaker, and a reference second portion of feature vectors, which has a time duration dependent on a speaker, and said reference feature vector system is compared to unknown speech having an unknown first portion of feature vectors and an unknown second portion of feature vectors, comprising the steps of:

(a) locating a first portion of feature vectors in said reference feature vector system,(b) locating unwarped candidate first portions in said unknown speech by shifting said reference first portion through said unknown speech and comparing said reference first portion with said unknown speech,(c) matching said reference first portion with one of said candidate first portions in said unknown speech; and

(d) matching said reference second portion with said unknown second portion by linearly designating each feature vector of said unknown second portion to a feature vector in said reference second portion.

View all claims

0 Assignments

Timeline View

Assignment View

0 Petitions

Accused Products

Abstract

Speech recognition with time warp is simplified by finding a certain portion of a word whose time duration is the same for all speakers. In comparing an unknown speech with a reference speech, the time duration of an unknown speech is coincided with the time length of a reference speech with the two processes. According to the invention, an element vector of a speech is classified to the first portion and the second portion. The former is a consonant and co-articulation which couples the two sounds, and the latter is a vowel. The length of the first portion is almost independent from a speaker, and the length of the second portion depends upon a speaker. Therefore, the present invention matches the first portion of an unknown speech with that of the reference speech directly without changing the time length. Next, the sample elements in the second portion of the unknown speech is linearly matched with that of a reference speech. Thus, excellent recognition is obtained using a simple calculation.

14 Citations

3 Claims

1. A method of recognizing speech wherein a reference feature vector system is partitioned in memory into a reference first portion of feature vectors, which has a constant time duration independent of a speaker, and a reference second portion of feature vectors, which has a time duration dependent on a speaker, and said reference feature vector system is compared to unknown speech having an unknown first portion of feature vectors and an unknown second portion of feature vectors, comprising the steps of:
- (a) locating a first portion of feature vectors in said reference feature vector system,(b) locating unwarped candidate first portions in said unknown speech by shifting said reference first portion through said unknown speech and comparing said reference first portion with said unknown speech,(c) matching said reference first portion with one of said candidate first portions in said unknown speech; and
  
  (d) matching said reference second portion with said unknown second portion by linearly designating each feature vector of said unknown second portion to a feature vector in said reference second portion.
- View Dependent Claims (2, 3)
- - 2. The method of claim 1 wherein the comparing of step (b) includes the step of computing for each candidate first portion a summed length which is the sum of the lengths between each feature vector in the candidate first portion and each feature vector in said reference first portion.
  - 3. The method of claim 2 wherein the matching of step (c) includes the step of selecting the candidate first portion having the minimum summed length among all of said summed lengths, thereby creating a match between the selected candidate in the unknown speech and the reference first portion.

Specification

Resources

Litigation Campaign Assessment

Current Assignee
OKI Electric Industry Company Limited
Original Assignee
OKI Electric Industry Company Limited
Inventors
Umehara, Akihiko, Nose, Isamu
Primary Examiner(s)
Kemeny, E. S. Matt

Application Number

US06/582,134
Time in Patent Office

425 Days
Field of Search

364/513, 364/513.5, 381/41-45
US Class Current

704/243
CPC Class Codes

G10L 15/12 using dynamic programming t...

Speech recognition system

First Claim

0 Assignments

0 Petitions

Accused Products

Abstract

14 Citations

3 Claims

Specification

Solutions

Use Cases

Quick Links

Speech recognition system

First Claim

0 Assignments

Subscription Required

Subscription Required

0 Petitions

Subscription Required

Accused Products

Subscription Required

Abstract

14 Citations

3 Claims

Specification

Subscription Required

Solutions

Use Cases

Quick Links