Homonym processing in the context of voice-activated command systems

US 7,181,387 B2
Filed: 09/07/2004
Issued: 02/20/2007
Est. Priority Date: 06/30/2004
Status: Active Grant

First Claim

Patent Images

1. A method for constructing a grammar to be processed by a speech recognition engine in the context of a voice-activated command system, the method comprising:

receiving a database containing a plurality of terms;

identifying from said plurality a first term and a second term that are spelled differently but have a first pronunciation in common;

whereinidentifying that one of the first and second terms also has a second pronunciation that is not inherent to the other of the first and second terms; and

placing the first and second pronunciations within the grammar.

View all claims

1 Assignment

Timeline View

Assignment View

0 Petitions

Accused Products

Abstract

A method is disclosed from constructing a grammar. The grammar is configured to be processed by a speech recognition engine in the context of a voice-activated command system. The method includes receiving a database containing a plurality of terms. From the plurality of terms, first and second terms are identified. The first and second terms are spelled differently but have a first pronunciation in common. One of the first and second terms also has a second pronunciation that is not inherent to the other of the first and second terms. The first and second pronunciations are placed within the grammar.

62 Citations

View as Search Results

15 Claims

1. A method for constructing a grammar to be processed by a speech recognition engine in the context of a voice-activated command system, the method comprising:
- receiving a database containing a plurality of terms;
  
  identifying from said plurality a first term and a second term that are spelled differently but have a first pronunciation in common;
  
  whereinidentifying that one of the first and second terms also has a second pronunciation that is not inherent to the other of the first and second terms; and
  
  placing the first and second pronunciations within the grammar.
- View Dependent Claims (2, 3, 4, 5, 6)
- - 2. The method of claim 1, wherein identifying a first term and a second term comprises obtaining a pronunciation for each of the plurality of terms in the database.
  - 3. The method of claim 2, wherein obtaining a pronunciation comprises obtaining a pronunciation from a speech recognition dictionary.
  - 4. The method of claim 2, wherein obtaining a pronunciation comprises obtaining a pronunciation from an application dictionary.
  - 5. The method of claim 2, wherein identifying a first and second term comprises organizing the plurality of terms into a plurality of pronunciation classes, wherein each pronunciation class corresponds to a distinct pronunciation, wherein both of the first and second terms are included in a same pronunciation class, and wherein one of the first and second terms is included in a class that the other is not in.
  - 6. The method of claim 2, wherein identifying a first term and a second term comprises identifying a first term and second term having quasi-homonym characteristics.

7. A computer-implemented method for accomplishing disambiguation in the context of a voice-dialing system, the method comprising:
- providing an input to a speech recognition engine for processing relative to a grammar that corresponds to a database containing a plurality of terms;
  
  including in the grammar a pair of pronunciations that correspond to a pair of terms from said plurality that are spelled differently, one of said pronunciations being a pronunciation shared by the pair of terms and the other being unique to one term in the the pair of terms;
  
  receiving from the speech recognition engine an output corresponding to one pronunciation from the pair of pronunciations;
  
  determining, based at least in part on the output, whether or not to use both in the pair of terms as a basis for disambiguation; and
  
  utilizing both or one of the pair of terms as a basis for disambiguation.
- View Dependent Claims (8, 9, 10, 11, 12, 13, 14)
- - 8. The method of claim 7, wherein utilizing comprises utilizing both terms in the pair of terms when the output corresponds to a pronunciation shared by the pair.
  - 9. The method of claim 7, wherein utilizing comprises utilizing one term from the pair of terms when the output corresponds to one that is unique to one term from the pair of terms.
  - 10. The method of claim 7, wherein providing an input to a speech recognition engine comprises receiving a speech input and providing a representation of the speech input to the speech recognition engine.
  - 11. The method of claim 10, wherein receiving a speech input comprises receiving a speech input from a caller.
  - 12. The method of claim 11, wherein receiving a speech input comprises receiving a spoken name.
  - 13. The method of claim 7, wherein receiving an output comprises receiving a textual representation of a person'"'"'s name.
  - 14. The method of claim 7, wherein utilizing both of the pair of terms as a basis for disambiguation comprises:
    - presenting a spelling of each of the pair of terms to a caller;
      
      receiving an input from the caller corresponding to one of the pair of terms; and
      
      selecting for subsequent processing the one of the pair of terms corresponding to the input.

15. A speech recognition system comprising:
- a context free grammar that includes a representation of a plurality of database terms including a pair of pronunciations that correspond to a pair of terms from said plurality that are spelled differently, one of said pronunciations being a pronunciation shared by the pair of terms and the other being unique to one of the pair of terms; and
  
  a speech recognition engine that utilizes the context free grammar as a basis for identifying a voice-activated command.

Specification

Resources

Litigation Campaign Assessment

Current Assignee
Microsoft Technology Licensing LLC (Microsoft Corporation)
Original Assignee
Microsoft Corporation
Inventors
Ju, Yun-Cheng, Bhatia, Siddharth, Ollason, David
Primary Examiner(s)
Hudspeth; David
Assistant Examiner(s)
Sked; Matthew J

Application Number

US10/935,679
Publication Number

US 20060004572A1
Time in Patent Office

896 Days
Field of Search

None
US Class Current

704/9
CPC Class Codes

G10L 15/06 Creation of reference templ...

G10L 15/187 Phonemic context, e.g. pron...

Homonym processing in the context of voice-activated command systems

First Claim

1 Assignment

0 Petitions

Accused Products

Abstract

62 Citations

15 Claims

Specification

Solutions

Use Cases

Quick Links

Homonym processing in the context of voice-activated command systems

First Claim

1 Assignment

Subscription Required

Subscription Required

0 Petitions

Subscription Required

Accused Products

Subscription Required

Abstract

62 Citations

15 Claims

Specification

Subscription Required

Solutions

Use Cases

Quick Links