Domain-specific concatenative audio
First Claim
Patent Images
1. A method for generating speech output of a text string, comprising:
- receiving the text string comprised of more than one word and storing the text string in a remaining string buffer holding an unmatched portion of the string;
recursively repeating the following operations until the length of the text string in the remaining string buffer is not greater than zero;
initializing a working copy of the text string from the remaining string buffer,comparing the working copy of the text string with strings in a speech library,removing a rightmost word from the working copy until the working copy of the text string matches a string in the speech library or until the length of the working copy of the text string is one,determining whether the length of the working copy of the text string is one and whether it matches a string in the speech library,when the length of the working copy of the text string is one and does not match a string in the speech library, removing the leftmost word from the text string in the remaining string buffer and converting the leftmost word into a sound file using a text to speech converter,when the working copy of the text string matches a string in the speech library, looking up an associated speech file for the text string in the speech library, andremoving the matched text from the text string in the remaining string buffer;
subsequent to the recursive operations, concatenating the speech files and any sound files together to produce a speech output for a user.
1 Assignment
0 Petitions
Accused Products
Abstract
One embodiment of the present invention provides a system for generating speech output from a text string. During operation, the system first receives the text string and then examines the text string to locate one or more substrings within the text string that are found in a speech library. Next, the system looks up speech files associated with the one or more substrings in the speech library. The system then concatenates these speech files to produce a speech output for a user.
-
Citations
24 Claims
-
1. A method for generating speech output of a text string, comprising:
-
receiving the text string comprised of more than one word and storing the text string in a remaining string buffer holding an unmatched portion of the string; recursively repeating the following operations until the length of the text string in the remaining string buffer is not greater than zero; initializing a working copy of the text string from the remaining string buffer, comparing the working copy of the text string with strings in a speech library, removing a rightmost word from the working copy until the working copy of the text string matches a string in the speech library or until the length of the working copy of the text string is one, determining whether the length of the working copy of the text string is one and whether it matches a string in the speech library, when the length of the working copy of the text string is one and does not match a string in the speech library, removing the leftmost word from the text string in the remaining string buffer and converting the leftmost word into a sound file using a text to speech converter, when the working copy of the text string matches a string in the speech library, looking up an associated speech file for the text string in the speech library, and removing the matched text from the text string in the remaining string buffer; subsequent to the recursive operations, concatenating the speech files and any sound files together to produce a speech output for a user. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8)
-
-
9. A computer-readable storage device storing instructions that when executed by a computer cause the computer to perform a method for generating speech output of a text string, the method comprising:
-
receiving the text string comprised of more than one word and storing the text string in a remaining string buffer holding an unmatched portion of the string; recursively repeating the following operations until the length of the text string in the remaining string buffer is not greater than zero; initializing a working copy of the text string from the remaining string buffer, comparing the working copy of the text string with strings in a speech library, removing a rightmost word from the working copy until the working copy of the text string matches a string in the speech library or until the length of the working copy of the text string is one, determining whether the length of the working copy of the text string is one and whether it matches a string in the speech library, when the length of the working copy of the text string is one and does not match a string in the speech library, removing the leftmost word from the text string in the remaining string buffer and converting the leftmost word into a sound file using a text to speech converter, when the working copy of the text string matches a string in the speech library, looking up an associated speech file for the text string in the speech library, and removing the matched text from the text string in the remaining string buffer; subsequent to the recursive operations, concatenating the speech files and any sound files together to produce a speech output for a user. - View Dependent Claims (10, 11, 12, 13, 14, 15, 16)
-
-
17. An apparatus for generating speech output of a text string, comprising:
-
a processor; a receiving mechanism configured to; receive the text string comprised of more than one word and store the text string in a remaining string buffer holding an unmatched portion of the string; recursively repeating the following operations until the length of the text string in the remaining string buffer is not greater than zero; the receiving mechanism configured to initialize a working copy of the text string from the remaining string buffer, an examining mechanism configured to compare the working copy of the text string with strings in a speech library, an examining mechanism configured to remove a rightmost word from the working cony until the working copy of the text string matches a string in the speech library or until the length of the working copy of the text string is one, an examining mechanism configured to determine whether the length of the working copy of the text string is one and whether it matches a string in the speech library, an examining mechanism configured to remove the leftmost word from the text string in the remaining string buffer and convert the leftmost word into a sound file using a text to speech converter when the length of the working copy of the text string is one and does not match a string in the speech library, a lookup mechanism configured to look up an associated speech file for the text string in the speech library when the examining mechanism determines the working copy of the text string matches a string in the speech library, and removing the matched text from the text string in the remaining string buffer; subsequent to the recursive operations, a concatenating mechanism is configured to concatenate the speech files and any sound files together to produce a speech output for the user. - View Dependent Claims (18, 19, 20, 21, 22, 23, 24)
-
Specification