Context sharing of similarities in context dependent word models

US 6,285,980 B1
Filed: 11/02/1998
Issued: 09/04/2001
Est. Priority Date: 11/02/1998
Status: Expired due to Term

First Claim

Patent Images

1. A method for automatic speech recognition comprising the steps of:

building a model for a vocabulary of sounds wherein at least two sounds share a common head or a common tail;

wherein said vocabulary of sounds includes subwords;

receiving an utterance containing at least one word;

processing the utterance into cepstral coefficients; and

recognizing at least one word in the utterance using said model.

View all claims

7 Assignments

Timeline View

Assignment View

0 Petitions

Accused Products

Abstract

A natural number recognition method and system that uses minimum classification error trained inter-word context dependent models of the head-body-tail type over a specific vocabulary. One part of the method and system allows recognition of spoken monetary amounts in financial transactions. A second part of the method and system allows recognition of numbers such as credit card or U.S. telephone numbers. A third part of the method and system allows recognition of natural language expressions of time, such as time of day, day of the week and date of the month for applications such as scheduling or schedule inquires. Even though limited natural language expressions are allowed, context sharing between similar sounds in the vocabulary within a head-body-tail model keeps storage and processing time requirements to manageable levels.

Citations

22 Claims

1. A method for automatic speech recognition comprising the steps of:
- building a model for a vocabulary of sounds wherein at least two sounds share a common head or a common tail;
  
  wherein said vocabulary of sounds includes subwords;
  
  receiving an utterance containing at least one word;
  
  processing the utterance into cepstral coefficients; and
  
  recognizing at least one word in the utterance using said model.
- View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12)
- - 2. The method of claim 1, wherein said vocabulary of sounds includes models of words, each model consisting of a head, body, and tail portion.
  - 3. The method of claim 1, wherein said vocabulary includes time of day words.
  - 4. The method of claim 1, wherein said model includes digit words ‘
    - zero’
      
      , ‘
      
      oh’
      
      , ‘
      
      one’
      
      , ‘
      
      two’
      
      , ‘
      
      three’
      
      , ‘
      
      four’
      
      , ‘
      
      five’
      
      , ‘
      
      six’
      
      , ‘
      
      seven’
      
      , ‘
      
      eight’
      
      , and ‘
      
      nine’
      
      .
  - 5. The method of claim 4, wherein said model also includes numbers ‘
    - ten,’ and
      
      larger.
  - 6. The method of claim 1, wherein said model includes months of the year, ‘
    - January’
      
      through ‘
      
      December.’
  - 7. The method of claim 6, where months with similar endings share common tail models.
  - 8. The method of claim 1, wherein said model of sounds includes numbers.
  - 9. The method of claim 8, wherein numbers with similar endings share tail models.
  - 10. The method of claim 8, wherein numbers with similar beginnings share head models.
  - 11. The method of claim 1, wherein said model includes days of the week, ‘
    - Monday’
      
      through ‘
      
      Friday.’
  - 12. The method of claim 11, wherein said models for days of the week share contexts for the ‘
    - day’
      
      portion of the word.

13. A method for automatic speech recognition comprising the steps of:
- receiving an utterance containing at least one word of a vocabulary of words;
  
  processing the utterance into cepstral coefficients;
  
  separating the utterance into a plurality of words;
  
  separating at least one of said plurality of words into a head portion, a body portion and a tail portion;
  
  recognizing at least one word from the vocabulary using said head portion, said body portion and said tail portion.
- View Dependent Claims (14, 15, 16)
- - 14. The method of claim 13, wherein said vocabulary includes group of time of day words.
  - 15. The method of claim 14, wherein said vocabulary includes digit words ‘
    - zero’
      
      , ‘
      
      oh’
      
      , ‘
      
      one’
      
      , ‘
      
      two’
      
      , ‘
      
      three’
      
      , ‘
      
      four’
      
      , ‘
      
      five’
      
      , ‘
      
      six’
      
      , ‘
      
      seven’
      
      , ‘
      
      eight’
      
      , and ‘
      
      nine’
      
      .
  - 16. The method of claim 13, wherein said vocabulary includes digit words ‘
    - zero’
      
      , ‘
      
      oh’
      
      , ‘
      
      one’
      
      , ‘
      
      two’
      
      , ‘
      
      three’
      
      , ‘
      
      four’
      
      , ‘
      
      five’
      
      , ‘
      
      six’
      
      , ‘
      
      seven’
      
      , ‘
      
      eight’
      
      , and ‘
      
      nine’
      
      .

17. A method for automatic speech recognition comprising the steps of:
- receiving an utterance containing at least one digit word and at least one non-digit word;
  
  processing the utterance into cepstral coefficients;
  
  separating the utterance into a plurality of words;
  
  separating at least one of said plurality of words into a head portion, a body portion and a tail portion;
  
  recognizing said at least one word using a vocabulary for numbers, dates and times of day.
- View Dependent Claims (18, 19, 20, 21, 22)
- - 18. The method of claim 17, wherein for the pronunciation of numbers less than one million, (unless the word million is included in the vocabulary) said plurality of contexts includes:
19. The method of claim 17, wherein for scheduling said plurality of contexts includes:
- a shared context of months ending in letters ‘
  
  ber’
  
  a shared context of days of the week ending in the letters ‘
  
  day’
  
  a shared context of numbers ending in letters ‘
  
  teen’
  
  a shared context of numbers ending in letters ‘
  
  ty’
  
  , and a shared context of numbers beginning in letters ‘
  
  seven’
  
  .
20. The method of claim 17, wherein said vocabulary words comprise spoken numbers.
21. The method of claim 17, wherein said vocabulary words comprise spoken dates.
22. The method of claim 17, wherein said vocabulary words comprise spoken times.

Specification

Resources

Litigation Campaign Assessment

Current Assignee
WSOU Investments, LLC (WSOU Holdings, LLC)
Original Assignee
Lucent Technologies, Inc. (Nokia Corporation)
Inventors
Jacob, John, Gandhi, Malan Bhatki
Primary Examiner(s)
Dorvil, Richemond
Assistant Examiner(s)
NOLAN, DANIEL A

Application Number

US09/184,620
Time in Patent Office

1,037 Days
Field of Search

704/251, 704/252, 704/253-256, 704/9, 704/10
US Class Current

704/256
CPC Class Codes

G10L 15/142 Hidden Markov Models [HMMs]

G10L 15/18 using natural language mode...

Context sharing of similarities in context dependent word models

First Claim

7 Assignments

0 Petitions

Accused Products

Abstract

Citations

22 Claims

Specification

Solutions

Use Cases

Quick Links

Context sharing of similarities in context dependent word models

First Claim

7 Assignments

Subscription Required

Subscription Required

0 Petitions

Subscription Required

Accused Products

Subscription Required

Abstract

Citations

22 Claims

Specification

Subscription Required

Solutions

Use Cases

Quick Links