Method and system for enhancing a speech database
First Claim
Patent Images
1. A method comprising:
- receiving, on a device having a processor, text from a user for conversion to speech via a text-to-speech process;
identifying, via the processor, a primary speech segment in a primary speech database which does not meet a need of the text-to-speech process;
identifying, via the processor, a replacement speech segment which satisfies the need in a secondary speech database; and
adding replacement speech segment to the primary database such that the primary database meets the need of the text-to-speech process.
4 Assignments
0 Petitions
Accused Products
Abstract
A system, method and computer readable medium that enhances a speech database for speech synthesis is disclosed. The method may include labeling audio files in a primary speech database, identifying segments in the labeled audio files that have varying pronunciations based on language differences, identifying replacement segments in a secondary speech database, enhancing the primary speech database by substituting the identified secondary speech database segments for the corresponding identified segments in the primary speech database, and storing the enhanced primary speech database for use in speech synthesis.
-
Citations
20 Claims
-
1. A method comprising:
-
receiving, on a device having a processor, text from a user for conversion to speech via a text-to-speech process; identifying, via the processor, a primary speech segment in a primary speech database which does not meet a need of the text-to-speech process; identifying, via the processor, a replacement speech segment which satisfies the need in a secondary speech database; and adding replacement speech segment to the primary database such that the primary database meets the need of the text-to-speech process. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8)
-
-
9. A system comprising:
-
a processor; and a computer-readable storage medium having instructions stored which, when executed by the processor, result in the processor performing operations comprising; receiving, on the system, text from a user for conversion to speech via a text-to-speech process; identifying a primary speech segment in a primary speech database which does not meet a need of the text-to-speech process; identifying a replacement speech segment which satisfies the need in a secondary speech database; and adding replacement speech segment to the primary database such that the primary database meets the need of the text-to-speech process. - View Dependent Claims (10, 11, 12, 13, 14, 15, 16)
-
-
17. A computer-readable storage device having instructions stored which, when executed by the computing device, result in the computing device performing operations comprising:
-
receiving, on the computing device, text from a user for conversion to speech via a text-to-speech process; identifying a primary speech segment in a primary speech database which does not meet a need of a text-to-speech process; identifying a replacement speech segment which satisfies the need in a secondary speech database; and adding replacement speech segment to the primary database such that the primary database meets the need of the text-to-speech process. - View Dependent Claims (18, 19, 20)
-
Specification