System and method for generating a phrase pronunciation
First Claim
Patent Images
1. A method of adding phrase pronunciations to a language model, the method comprising the steps of:
- generating a list of pron components, whose pronunciations differ when they occur in a phrase;
assigning at least one pron to each pron component;
determining the pronunciation of a first phrase, by the steps of;
tokenizing the first phrase by generating a list of tokens corresponding to the first phrase;
determining a pron for each of the list of tokens;
assembling the pronunciation of the first phrase based upon a combination of each said pron; and
adding the first phrase and the pronunciation of the first phrase to the language model.
9 Assignments
0 Petitions
Accused Products
Abstract
A system and method for a speech recognition technology that allows language models to be customized through the addition of special pronunciations for components of phrases, which are added to the factory language models during customization. It allows components of a phrase to have different pronunciations inside customer-added phrases than are specified for those isolated components in the factory language models.
86 Citations
17 Claims
-
1. A method of adding phrase pronunciations to a language model, the method comprising the steps of:
-
generating a list of pron components, whose pronunciations differ when they occur in a phrase;
assigning at least one pron to each pron component;
determining the pronunciation of a first phrase, by the steps of;
tokenizing the first phrase by generating a list of tokens corresponding to the first phrase;
determining a pron for each of the list of tokens;
assembling the pronunciation of the first phrase based upon a combination of each said pron; and
adding the first phrase and the pronunciation of the first phrase to the language model. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12)
-
-
13. A system for adding phrase pronunciations to a language model, the system comprising:
-
a computer with a computer code mechanism for processing a list of pron components whose pronunciations differ when they occur in a phrase, assigning at least one pron to each pron component, determining the pronunciation of a first phrase by first tokenizing the first phrase by generating a list of tokens corresponding to the first phrase, then determining a pron for each of the list of tokens, then assembling the pronunciation of the first phrase based on a combination of each said pron, and adding the first phrase and the pronunciation of the first phrase to a language model;
a language model electronically accessible by said computer code mechanism; and
a tokenizer for generating a list of tokens corresponding to the said first phrase, said tokenizer in electronic communication with said computer code mechanism. - View Dependent Claims (14, 15, 16, 17)
-
Specification