Method and system for automatically determining phonetic transcriptions associated with spelled words
First Claim
1. A method for automatically generating the phonetic transcription associated with a spelled word, comprising:
- transcribing said spelled word into sound units to generate a plurality of transcriptions each corresponding to said spelled word without using a pre-existing dictionary;
associating a score with each transcription;
supplying said plurality of transcriptions to an automatic speech recognizer;
supplying speech data corresponding to said spelled word to said automatic speech recognizer when none of the scores is above a predetermined threshold; and
using said automatic speech recognizer to rescore said transcriptions based on said speech data.
2 Assignments
0 Petitions
Accused Products
Abstract
New entries are added to the lexicon by entering them as spelled words. A transcription generator, such as a decision-tree-based phoneme or morpheme transcription generator, converts each spelled word into a set of n-best transcriptions or sequences. Meanwhile, user input or automatically generated speech corresponding to the spelled word is processed by an automatic speech recognizer and the recognizer rescores the transcriptions or sequences produced by the transcription generator. One or more of the highest scored (highest confidence) transcriptions may be added to the lexicon to update it. If desired, the spelled word-pronunciation pairs generated by the system can be used to retrain the transcription generator, making the system adaptive or self-learning.
103 Citations
15 Claims
-
1. A method for automatically generating the phonetic transcription associated with a spelled word, comprising:
-
transcribing said spelled word into sound units to generate a plurality of transcriptions each corresponding to said spelled word without using a pre-existing dictionary;
associating a score with each transcription;
supplying said plurality of transcriptions to an automatic speech recognizer;
supplying speech data corresponding to said spelled word to said automatic speech recognizer when none of the scores is above a predetermined threshold; and
using said automatic speech recognizer to rescore said transcriptions based on said speech data. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9)
-
-
10. A system for updating a lexicon based on spelled word input comprising:
-
transcription generator receptive of said spelled word input for generating a plurality of scored transcriptions without using a pre-existing dictionary; and
a confidence checking mechanism for determining whether any of the scored transcriptions has a confidence level above a predetermined threshold;
an automatic speech recognizer receptive of speech data corresponding to said spelled word input for rescoring said plurality of scored transcriptions to generate a plurality of rescored transcriptions when none of the scored transcriptions has a confidence level above the predetermined threshold; and
selection mechanism for selecting and using at least one of said rescored transcriptions to update said lexicon. - View Dependent Claims (11, 12, 13, 14, 15)
-
Specification