System and methods for acoustic and language modeling for automatic speech recognition with large vocabularies

US 6,928,404 B1
Filed: 03/17/1999
Issued: 08/09/2005
Est. Priority Date: 03/17/1999
Status: Expired due to Term

First Claim

Patent Images

1. A method for splitting words in a language vocabulary V in an automatic speech recognition system to provide vocabulary compression, wherein the vocabulary V has a fixed size, the method comprising the steps of:

(a) providing a fixed set of allowable endings, including an empty ending;

(b) providing a fixed set of constraints for splitting words into stems;

(c) initializing a split map of words and the corresponding stems and endings by setting a variable t to a predetermined value, and selecting a first word from the fixed vocabulary;

(d) randomly splitting the first word to generate an ending from the fixed list of allowable endings and a stem;

(e) defining and storing a stem set containing the stem generated at said splitting step (d) and a word set containing the first word;

(f) determining whether t is less than the size of the vocabulary V;

(g) obtaining a new word from the vocabulary V, when t is less than the size of the vocabulary V;

(h) determining possible splits for the new word to generate stems and endings therefrom, using the fixed set of allowable endings and the fixed set of constraints;

(i) determining whether there is a split for the new word that generates a previously stored stem of the stem set;

(j) splitting the current word into the previously stored stem and an ending of the set of allowable endings, when there is a split for the new word that generates the previously stored stem of the stem set;

(k) determining whether another previously stored stem in the stem set can be replaced by a new stem generated at step (h), when there is no split for the current word that generates the previously stored stem of the stem set;

(l) redefining the stem set and the split map to include the new stem generated at step (h) in place of the other previously stored stem, when the other previously stored stem can be replaced by the new stem, when the other previously stored stem can be replaced by the new stem generated at step (h);

(m) redefining the stem set to include any new stem into which the current word may be split and extending the split map to include the current word by splitting the new word into the new stem, when the other previously stored stem in the stem set cannot be replaced by the new stem generated at step (h); and

(n) incrementing t and returning to step (f) if t is less than the size of the vocabulary V.

View all claims

2 Assignments

Timeline View

Assignment View

0 Petitions

Accused Products

Abstract

Systems and methods are provided for generating a language component vocabulary VC for a speech recognition system having a language vocabulary V of a plurality of word forms. One method for generating a language component vocabulary VC for a speech recognition system having a language vocabulary V of a plurality of word forms includes partitioning the language vocabulary V into subsets of word forms based on frequencies of occurrence of the respective word forms, in at least one the subsets, splitting word forms having frequencies less than a threshold to thereby generate word form components and generating a language component vocabulary VC including word forms and word form components. The resulting language component vocabulary, which includes word forms and word components, is used to generate a language model that can be efficiently implemented for real-time automatic speech recognition applications for languages with large vocabularies.

46 Citations

View as Search Results

7 Claims

1. A method for splitting words in a language vocabulary V in an automatic speech recognition system to provide vocabulary compression, wherein the vocabulary V has a fixed size, the method comprising the steps of:
- (a) providing a fixed set of allowable endings, including an empty ending;
  
  (b) providing a fixed set of constraints for splitting words into stems;
  
  (c) initializing a split map of words and the corresponding stems and endings by setting a variable t to a predetermined value, and selecting a first word from the fixed vocabulary;
  
  (d) randomly splitting the first word to generate an ending from the fixed list of allowable endings and a stem;
  
  (e) defining and storing a stem set containing the stem generated at said splitting step (d) and a word set containing the first word;
  
  (f) determining whether t is less than the size of the vocabulary V;
  
  (g) obtaining a new word from the vocabulary V, when t is less than the size of the vocabulary V;
  
  (h) determining possible splits for the new word to generate stems and endings therefrom, using the fixed set of allowable endings and the fixed set of constraints;
  
  (i) determining whether there is a split for the new word that generates a previously stored stem of the stem set;
  
  (j) splitting the current word into the previously stored stem and an ending of the set of allowable endings, when there is a split for the new word that generates the previously stored stem of the stem set;
  
  (k) determining whether another previously stored stem in the stem set can be replaced by a new stem generated at step (h), when there is no split for the current word that generates the previously stored stem of the stem set;
  
  (l) redefining the stem set and the split map to include the new stem generated at step (h) in place of the other previously stored stem, when the other previously stored stem can be replaced by the new stem, when the other previously stored stem can be replaced by the new stem generated at step (h);
  
  (m) redefining the stem set to include any new stem into which the current word may be split and extending the split map to include the current word by splitting the new word into the new stem, when the other previously stored stem in the stem set cannot be replaced by the new stem generated at step (h); and
  
  (n) incrementing t and returning to step (f) if t is less than the size of the vocabulary V.
- View Dependent Claims (2, 3, 4, 5, 6)
- - 2. The method of claim 1, further comprising the step of terminating the method if t is not less than the size of the fixed vocabulary.
  - 3. The method of claim 1, wherein said determining step (k) comprises the step of determining whether other words stored in the word set during previous iterations will remain split after such substitution.
  - 4. The method of claim 1, wherein the vocabulary is sorted such that the words in the language vocabulary V are numerated in descending order based on frequencies associated with each of the words.
  - 5. The method of claim 1, wherein step (j) further comprises the step of extending the split map to the new word.
  - 6. The method of claim 1, wherein step (i) generates all possible splits for the new word.

7. A program storage device readable by machine, tangibly embodying a program of instructions executable by the machine to perform method steps for splitting words in a language vocabulary V in an automatic speech recognition system to provide vocabulary compression, wherein the vocabulary V has a fixed size, the method comprising the steps of:
- (a) providing a fixed set of allowable endings, including an empty ending;
  
  (b) providing a fixed set of constraints for splitting words into stems;
  
  (c) initializing a split map of words and the corresponding stems and endings by setting a variable t to a predetermined value, and selecting a first word from the fixed vocabulary;
  
  (d) randomly splitting the first word to generate an ending from the fixed list of allowable endings and a stem;
  
  (e) defining and storing a stem set containing the stem generated at said splitting step (d) and a word set containing the first word;
  
  (f) determining whether t is less than the size of the vocabulary V;
  
  (g) obtaining a new word from the vocabulary V, when t is less than the size of the vocabulary V;
  
  (h) determining possible splits for the new word to generate stems and endings therefrom, using the fixed set of allowable endings and the fixed set of constraints;
  
  (i) determining whether there is a split for the new word that generates a previously stored stem of the stem set;
  
  (j) splitting the current word into the previously stored stem and an ending of the set of allowable endings, when there is a split for the new word that generates the previously stored stem of the stem set;
  
  (k) determining whether another previously stored stem in the stem set can be replaced by a new stem generated at step (h), when there is no split for the current word that generates the previously stored stem of the stem set;
  
  (l) redefining the stem set and the split map to include the new stem generated at step (h) in place of the other previously stored stem, when the other previously stored stem can be replaced by the new stem, when the other previously stored stem can be replaced by the new stem generated at step (h);
  
  (m) redefining the stem set to include any new stem into which the current word may be split and extending the split map to include the current word by splitting the new word into the new stem, when the other previously stored stem in the stem set cannot be replaced by the new stem generated at step (h); and
  
  (n) incrementing t and returning to step (f) if t is less than the size of the vocabulary V.

Specification

Resources

Litigation Campaign Assessment

Current Assignee
Nuance Communications, Inc. (Microsoft Corporation)
Original Assignee
International Business Machines Corporation
Inventors
Kanevsky, Dimitri, Gopalakrishnan, Ponani, Monkowski, Michael Daniel, Sedivy, Jan
Primary Examiner(s)
Dorvil, Richemond
Assistant Examiner(s)
SPOONER, LAMONT M

Application Number

US09/271,469
Time in Patent Office

2,337 Days
Field of Search

704/1, 704/4, 704/7, 704/9, 704/10, 704/231, 704/251
US Class Current

704/10
CPC Class Codes

G06F 40/237   Lexical tools

G10L 15/183   using context dependencies,...

G10L 15/197   Probabilistic grammars, e.g...

Y10S 707/99942   Manipulating data structure...

System and methods for acoustic and language modeling for automatic speech recognition with large vocabularies

First Claim

2 Assignments

0 Petitions

Accused Products

Abstract

46 Citations

7 Claims

Specification

Use Cases

Quick Links

Others

System and methods for acoustic and language modeling for automatic speech recognition with large vocabularies

First Claim

2 Assignments

Subscription Required

Subscription Required

0 Petitions

Subscription Required

Accused Products

Subscription Required

Abstract

46 Citations

7 Claims

Specification

Subscription Required

Use Cases

Quick Links

Others