System and method for generating a phrase pronunciation
First Claim
Patent Images
1. A method in a computer system for adding phrase pronunciations to a language model, the method comprising steps of:
- receiving at least one phrase to be added to the language model, the at least one phrase comprising a first phrase, the first phrase comprising a plurality of tokens including a first token;
generating, using the computer system, a phrase pronunciation for the first phrase comprising a token pronunciation for the first token in the first phrase, wherein generating the phrase pronunciation for the first phrase comprises determining if the first token is represented in a pron component list, and, if so, selecting as the token pronunciation for the first token in the first phrase a component pronunciation from the pron component list, wherein the pron component list comprises a list of one or more component pronunciations for at least the first token as pronounced in one or more phrases, wherein the list of one or more component pronunciations is different from any list of one or more language model pronunciations in the language model for the first token; and
adding the phrase pronunciation for the first phrase to the language model;
wherein the step of generating the phrase pronunciation for the first phrase further comprises;
if the first token is not represented in the pron component list, determining if the first token is represented in the language model, and, if so, selecting a language model pronunciation from the language model as the token pronunciation for the first token in the first phrase.
9 Assignments
0 Petitions
Accused Products
Abstract
A system and method for a speech recognition technology that allows language models to be customized through the addition of special pronunciations for components of phrases, which are added to the factory language models during customization. It allows components of a phrase to have different pronunciations inside customer-added phrases than are specified for those isolated components in the factory language models.
-
Citations
18 Claims
-
1. A method in a computer system for adding phrase pronunciations to a language model, the method comprising steps of:
-
receiving at least one phrase to be added to the language model, the at least one phrase comprising a first phrase, the first phrase comprising a plurality of tokens including a first token; generating, using the computer system, a phrase pronunciation for the first phrase comprising a token pronunciation for the first token in the first phrase, wherein generating the phrase pronunciation for the first phrase comprises determining if the first token is represented in a pron component list, and, if so, selecting as the token pronunciation for the first token in the first phrase a component pronunciation from the pron component list, wherein the pron component list comprises a list of one or more component pronunciations for at least the first token as pronounced in one or more phrases, wherein the list of one or more component pronunciations is different from any list of one or more language model pronunciations in the language model for the first token; and adding the phrase pronunciation for the first phrase to the language model; wherein the step of generating the phrase pronunciation for the first phrase further comprises; if the first token is not represented in the pron component list, determining if the first token is represented in the language model, and, if so, selecting a language model pronunciation from the language model as the token pronunciation for the first token in the first phrase. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9)
-
-
10. A computer system comprising:
-
a tokenizer that parses a phrase to be added to a language model into a plurality of tokens including a first token; and a computer code mechanism that; generates a phrase pronunciation for the phrase comprising a token pronunciation for the first token in the phrase, wherein generating the phrase pronunciation for the phrase comprises determining if the first token is represented in a pron component list, and, if so, selecting as the token pronunciation for the first token in the phrase a component pronunciation from the pron component list, wherein the pron component list comprises a list of one or more component pronunciations for at least the first token as pronounced in one or more phrases, wherein the list of one or more component pronunciations is different from any list of one or more language model pronunciations in the language model for the first token; and adds the phrase pronunciation for the phrase to the language model; wherein the computer code mechanism generates the phrase pronunciation for the phrase at least in party by, if the first token is not represented in the pron component list, determining if the first token is represented in the language model, and, if so, selecting a language model pronunciation from the language model as the token pronunciation for the first token in the phrase; and wherein the tokenizer and/or the computer code mechanism is implemented by a computer. - View Dependent Claims (11, 12, 13, 14, 15, 16, 17, 18)
-
Specification