Recognition of a speech utterance available in spelled form

US 7,006,971 B1
Filed: 09/18/2000
Issued: 02/28/2006
Est. Priority Date: 09/17/1999
Status: Expired due to Fees

First Claim

Patent Images

1. A method of recognizing a speech utterance (s) available in spelled form, comprising:

a first processing stage in which a corresponding letter sequence (r) is estimated by means of a letter speech recognition unit (2) based on hidden Markov Models, said letter speech recognition unit not using a letter grammar which denotes probabilities of the occurrence of different possible letter combinations; and

a second processing stage (3) in which the estimated result (r) produced by the first processing stage utilizing a statistical letter sequence model (4) and a statistical model (5) for the speech recognition unit (2) is post-processed, wherein a dynamic programming method is used during the post-processing wherein a grid structure on which the dynamic programming is based and whose node points are provided for the assignment to accumulated probability values, is converted into a tree structure and an A* algorithm is used for finding an optimum tree path.

View all claims

2 Assignments

Timeline View

Assignment View

0 Petitions

Accused Products

Abstract

This invention relates to a method of recognizing a speech utterance (s) available in spelled form, comprising a processing stage in which a corresponding letter sequence (r) is estimated by means of a letter speech recognition unit (2) based on Hidden Markov Models, and a second processing stage (3) in which the estimated result (r) produced by the first processing stage utilizing a statistical letter sequence model (4) and a statistical model (5) for the speech recognition unit (2) is post-processed, while the dynamic programming method is used during the post-processing. For providing robust and efficient speech recognition procedures for the use of speech signals for system control, a grid structure on which the dynamic programming is based and whose node points are provided for the assignment to accumulated probability values, is converted into a tree structure and that an A* algorithm is used for finding an optimum tree path. Also a speech control device wherein a complete word is input as a control signal and at least part of this word in spelled form is input, while the result of the letter speech recognition is used within the scope of the word speech recognition.

33 Citations

View as Search Results

5 Claims

1. A method of recognizing a speech utterance (s) available in spelled form, comprising:
- a first processing stage in which a corresponding letter sequence (r) is estimated by means of a letter speech recognition unit (2) based on hidden Markov Models, said letter speech recognition unit not using a letter grammar which denotes probabilities of the occurrence of different possible letter combinations; and
  
  a second processing stage (3) in which the estimated result (r) produced by the first processing stage utilizing a statistical letter sequence model (4) and a statistical model (5) for the speech recognition unit (2) is post-processed, wherein a dynamic programming method is used during the post-processing wherein a grid structure on which the dynamic programming is based and whose node points are provided for the assignment to accumulated probability values, is converted into a tree structure and an A* algorithm is used for finding an optimum tree path.
- View Dependent Claims (2, 3, 4)
- - 2. The method as claimed in claim 1, wherein sub-optimum tree paths corresponding to N best estimates are determined for a speech utterance input with N>
    - 1.
  - 3. The method as claimed in claim 1, wherein during the search for an optimum tree path those tree paths that at the beginning of the search have a small probability are not searched.
  - 4. The method as claimed in claim 3, wherein the first processing stage is executed by means of a first IC and the second processing stage is executed by means of a second IC.

5. A method of system control by means of speech signals (w,s) comprising the steps of;
- inputting a whole word (w) serving as a control signal and at least part of this word is input in spelled form (s),recognizing the whole word (w) that is input using word speech recognition (7) and letter speech recognition (1) for recognizing the spelled part (s), the letter speech recognition comprising;
  
  a first processing stage in which a corresponding letter sequence (r) is estimated by means of a letter speech recognition unit (2) based on hidden Markov Models, said letter speech recognition unit not using a letter grammar which denotes probabilities of the occurrence of different possible letter combinations; and
  
  a second processing stage (3) in which the estimated result (r) produced by the first processing stage utilizing a statistical letter sequence model (4) and a statistical model (5) for the speech recognition unit (2) is post-processed, wherein a dynamic programming method is used during the post-processing, wherein a grid structure on which the dynamic programming is based and whose node points are provided for the assignment to accumulated probability values, is converted into a tree structure and an A* algorithm is used for finding an optimum tree path; and
  
  restricting a vocabulary assigned to the word speech recognition (7) to the recognition results of the letter speech recognition (1).

Specification

Resources

Litigation Campaign Assessment

Current Assignee
Koninklijke Philips Electronics N.V. (Koninklijke Philips N.V.)
Original Assignee
Koninklijke Philips Electronics N.V. (Koninklijke Philips N.V.)
Inventors
Fischer, Alexander, Stahl, Volker
Primary Examiner(s)
Dorvil, Richemond
Assistant Examiner(s)
Storm, Donald L.

Application Number

US09/663,585
Time in Patent Office

1,989 Days
Field of Search

704/242, 704/241, 704/251, 704/252, 704/254, 704/257
US Class Current

704/242
CPC Class Codes

G10L 15/08   Speech classification or se...

G10L 15/12   using dynamic programming t...

G10L 15/197   Probabilistic grammars, e.g...

G10L 2015/086   Recognition of spelled words

Recognition of a speech utterance available in spelled form

First Claim

2 Assignments

0 Petitions

Accused Products

Abstract

33 Citations

5 Claims

Specification

Use Cases

Quick Links

Others

Recognition of a speech utterance available in spelled form

First Claim

2 Assignments

Subscription Required

Subscription Required

0 Petitions

Subscription Required

Accused Products

Subscription Required

Abstract

33 Citations

5 Claims

Specification

Subscription Required

Use Cases

Quick Links

Others