Method and system for improving pronunciation in a voice control system
First Claim
Patent Images
1. A voice enunciation system in a data processing system comprising:
- a. a processor comprising a central processing unit and memory;
b. an audio signal output device;
c. the processor memory further comprisingi. a work queue for receiving text words for processing;
ii. a playback queue for receiving text words from the work queue for audibly pronouncing the text words on the audio signal output device, andiii. a dictionary for storing preferred pronunciations of words; and
d. the processor further providing means fori. storing text words in a memory;
ii. sequentially extracting text words from the memory;
iii. attempting to look up each of the sequentially extracted words in a dictionary and if a word is found in the dictionary, placing that word on a work queue as a wave file entry, and if the word is not found in the dictionary, placing that word on the work queue as a word string entry;
iv. continuing to place words on the work queue until a predetermined threshold number of words have been placed on the work queue;
v. when the predetermined threshold number of words have been placed on the work queues starting an asynchronous play thread, the asynchronous play thread comprising(a) extracting an entry from the work queue;
(b) determining if the entry is a wave file entry or a word string entry;
(c) if the entry is a wave file entry, audibly playing the wave file, and(d) if the entry is a word string audibly playing the word string phonetically;
vi. once an entry has been audibly played, placing that entry on a playback queue until the playback queue is full; and
vii. once the playback queue is full, deleting the oldest entry from the playback queue.
1 Assignment
0 Petitions
Accused Products
Abstract
A voice enunciation system and method provides a user with the capability to sound out text files. As the files are audibly played, if the user is not satisfied with the pronunciation of a particular word, the system provides the user with the means of replacing the word with his own particular pronunciation. The preferred pronunciation is also stored in an override dictionary so that any subsequent encounter with that particular word is pronounced correctly.
52 Citations
26 Claims
-
1. A voice enunciation system in a data processing system comprising:
-
a. a processor comprising a central processing unit and memory; b. an audio signal output device; c. the processor memory further comprising i. a work queue for receiving text words for processing; ii. a playback queue for receiving text words from the work queue for audibly pronouncing the text words on the audio signal output device, and iii. a dictionary for storing preferred pronunciations of words; and d. the processor further providing means for i. storing text words in a memory; ii. sequentially extracting text words from the memory; iii. attempting to look up each of the sequentially extracted words in a dictionary and if a word is found in the dictionary, placing that word on a work queue as a wave file entry, and if the word is not found in the dictionary, placing that word on the work queue as a word string entry; iv. continuing to place words on the work queue until a predetermined threshold number of words have been placed on the work queue; v. when the predetermined threshold number of words have been placed on the work queues starting an asynchronous play thread, the asynchronous play thread comprising (a) extracting an entry from the work queue; (b) determining if the entry is a wave file entry or a word string entry; (c) if the entry is a wave file entry, audibly playing the wave file, and (d) if the entry is a word string audibly playing the word string phonetically; vi. once an entry has been audibly played, placing that entry on a playback queue until the playback queue is full; and vii. once the playback queue is full, deleting the oldest entry from the playback queue. - View Dependent Claims (2, 3, 4)
-
-
5. A voice enunciation method comprising the steps of:
-
a. storing text words in a memory; b. sequentially extracting text words from the memory; c. attempting to look up each of the sequentially extracted words in a dictionary and if a word is found in the dictionary, placing that word on a work queue as a wave file entry, and if the word is not found in the dictionary, placing that word on the work queue as a word string entry; d. continuing to place words on the work queue until a predetermined threshold number of words have been placed on the work queue; e. when the predetermined threshold number of words have been placed on the work queue, starting an asynchronous play thread, the asynchronous play thread comprising i. extracting an entry from the work queue; ii. determining if the entry is a wave file entry or a word string entry; iii. if the entry is a wave file entry, audibly playing the wave file; and iv. if the entry is a word string audibly playing(l the word string phonetically; f. once an entry has been audibly played, placing that entry on a playback queue until the playback queue is full; and g. once the playback queue is full, deleting the oldest entry from the playback queue. - View Dependent Claims (6, 7, 8, 9, 10)
-
-
11. A method in a data processing system for enhancing voice pronunciation of a textual input stream comprising the steps of:
-
receiving text from the textual input stream; customizing a customizable pronunciation dictionary by a user immediately upon recognition by the user that one or more textual portions from the textual input stream was mispronounced the customizing step further comprising invoking a process interruption by a user during processing of the textual input stream, automatically suspending the process before completing processing of the textual input stream, and presenting an appropriate interface for selecting and editing the textual portions for proper pronunciations; comparing the text with the customizable pronunciation dictionary; determining a sound interface input in accordance with one of a plurality of playing methods for playing sound associated with the text; and routing the sound interface input to an appropriate device interface in accordance with the one of a plurality of playing methods. - View Dependent Claims (12, 13, 14, 15, 16, 17, 18)
-
-
19. Apparatus for enhancing voice pronunciation of a textual input stream in a data processing system comprising:
-
means for receiving text from the textual input stream; means for comparing the text with a customizable pronunciation dictionary, the customizable pronunciation dictionary including means for customizing the pronunciation dictionary by a user immediately upon recognition by the user that one or more textual portions from the textual input stream was mispronounced, wherein the means for customizing further comprises means for invoking a process interruption by a user during processing of the textual input stream. means for automatically suspending the process before completing processing of the textual input stream, and means for presenting an appropriate interface for selecting and editing the textual portions for proper pronunciations; means for determining a sound interface input in accordance with one of a plurality of playing methods for playing sound associated with the text; and means for routing the sound interface input to an appropriate device interface in accordance with the one of a plurality of playing methods. - View Dependent Claims (20, 21, 22, 23, 24, 25, 26)
-
Specification