SYSTEM AND METHOD FOR PERFORMING SPEECH SYNTHESIS WITH A CACHE OF PHONEME SEQUENCES
First Claim
Patent Images
1. A method of performing speech synthesis, the method comprising:
- applying a first part of a speech synthesizer to a text corpus to obtain a plurality of phoneme sequences, the first part of the speech synthesizer only identifying possible phoneme sequences;
for each of the obtained plurality of phoneme sequences, identifying joins that would be calculated to synthesize each of the plurality of respective phoneme sequences; and
adding the identified joins to a cache for use in speech synthesis.
4 Assignments
0 Petitions
Accused Products
Abstract
Disclosed are systems, methods, and computer readable media for performing speech synthesis. The method embodiment comprises applying a first part of a speech synthesizer to a text corpus to obtain a plurality of phoneme sequences, the first part of the speech synthesizer only identifying possible phoneme sequences, for each of the obtained plurality of phoneme sequences, identifying joins that would be calculated to synthesize each of the plurality of respective phoneme sequences, and adding the identified joins to a cache for use in speech synthesis.
-
Citations
18 Claims
-
1. A method of performing speech synthesis, the method comprising:
-
applying a first part of a speech synthesizer to a text corpus to obtain a plurality of phoneme sequences, the first part of the speech synthesizer only identifying possible phoneme sequences; for each of the obtained plurality of phoneme sequences, identifying joins that would be calculated to synthesize each of the plurality of respective phoneme sequences; and adding the identified joins to a cache for use in speech synthesis. - View Dependent Claims (2, 3, 4)
-
-
5. A method of synthesizing a speech signal, the method comprising:
-
(1) selecting one or more acoustic units from an acoustic unit database; (2) determining whether a join cost of an acoustic unit sequential pair resides in a cache created by steps comprising; (a) applying a first part of a speech synthesizer to a text corpus to obtain a plurality of phoneme sequences, the first part of the speech synthesizer only identifying possible phoneme sequences; (b) for each of the obtained plurality of phoneme sequences, identifying joins that would be calculated to synthesize each of the plurality of respective phoneme sequences; and (c) adding the identified joins to a cache for use in speech synthesis (3) if the cache contains the join, extracting the join from the cache for use in speech synthesis; and (4) if the cache does not contain the join, calculating a value of the join for use in speech synthesis. - View Dependent Claims (6)
-
-
7. A system for performing speech synthesis, the system comprising:
-
a module configured to apply a first part of a speech synthesizer to a text corpus to obtain a plurality of phoneme sequences, the first part of the speech synthesizer only identifying possible phoneme sequences; for each of the obtained plurality of phoneme sequences, a module configured to identify joins that would be calculated to synthesize each of the plurality of respective phoneme sequences; and a module configured to add the identified joins to a cache for use in speech synthesis. - View Dependent Claims (8, 9, 10)
-
-
11. A system for synthesizing a speech signal, the system comprising:
-
(1) a module configured to select one or more acoustic units from an acoustic unit database; (2) a module configured to determine whether a join cost of an acoustic unit sequential pair resides in a cache created by steps comprising; (a) applying a first part of a speech synthesizer to a text corpus to obtain a plurality of phoneme sequences, the first part of the speech synthesizer only identifying possible phoneme sequences; (b) for each of the obtained plurality of phoneme sequences, identifying joins that would be calculated to synthesize each of the plurality of respective phoneme sequences; and (c) adding the identified joins to a cache for use in speech synthesis (3) if the cache contains the join, a module configured to extract the join from the cache for use in speech synthesis; and (4) if the cache does not contain the join, a module configured to calculate a value of the join for use in speech synthesis. - View Dependent Claims (12)
-
-
13. A computer readable medium storing a computer program having instructions for performing speech synthesis, the instructions comprising:
-
applying a first part of a speech synthesizer to a text corpus to obtain a plurality of phoneme sequences, the first part of the speech synthesizer only identifying possible phoneme sequences; for each of the obtained plurality of phoneme sequences, identifying joins that would be calculated to synthesize each of the plurality of respective phoneme sequences; and adding the identified joins to a cache for use in speech synthesis. - View Dependent Claims (14, 15, 16)
-
-
17. A computer readable medium storing a computer program having instructions for synthesizing a speech signal, the instructions comprising:
-
(1) selecting one or more acoustic units from an acoustic unit database; (2) determining whether a join cost of an acoustic unit sequential pair resides in a cache created by steps comprising; (a) applying a first part of a speech synthesizer to a text corpus to obtain a plurality of phoneme sequences, the first part of the speech synthesizer only identifying possible phoneme sequences; (b) for each of the obtained plurality of phoneme sequences, identifying joins that would be calculated to synthesize each of the plurality of respective phoneme sequences; and (c) adding the identified joins to a cache for use in speech synthesis (3) if the cache contains the join, extracting the join from the cache for use in speech synthesis; and (4) if the cache does not contain the join, calculating a value of the join for use in speech synthesis. - View Dependent Claims (18)
-
Specification