Cost efficient distributed text-to-speech processing
First Claim
1. A method for performing text-to-speech (TTS) processing, comprising:
- receiving, at a server, a TTS request for TTS processing of text data into speech, wherein the TTS request is sent by a local device remote from the server and includes text data originating from the local device;
receiving a user preference for TTS processing performance factors, the TTS processing performance factors including at least one of a cost of TTS processing, a quality of TTS processing or a length of time until delivery of TTS results;
determining a plurality of processing options for completion of the TTS request based at least in part on the user preference, wherein the plurality of processing options vary over at least one of cost, quality and delivery time;
providing the plurality of processing options to the local device;
receiving a user selection of a processing option from the plurality of processing options;
scheduling TTS resources for processing the TTS request based at least in part on the user selection;
synthesizing the text data into speech based at least in part on the TTS resources; and
providing audio data to the local device, the audio data including the synthesized speech.
1 Assignment
0 Petitions
Accused Products
Abstract
Text-to-speech (TTS) processing systems may be divided among remote TTS servers which are accessible through a network connection to local user devices. The costs for performing processing on these servers may vary according to time. To improve efficiency of TTS processing certain requests may be scheduled during low cost server times. A user may indicate a preference for such low cost delivery. A user may also indicate a preference for quick turnaround time, permitting scheduling of TTS processing during higher cost server times. A TTS processing system may also consider quality of TTS results when scheduling server processing time for a particular TTS request and may allocate more server time when higher quality results are desired.
213 Citations
20 Claims
-
1. A method for performing text-to-speech (TTS) processing, comprising:
-
receiving, at a server, a TTS request for TTS processing of text data into speech, wherein the TTS request is sent by a local device remote from the server and includes text data originating from the local device; receiving a user preference for TTS processing performance factors, the TTS processing performance factors including at least one of a cost of TTS processing, a quality of TTS processing or a length of time until delivery of TTS results; determining a plurality of processing options for completion of the TTS request based at least in part on the user preference, wherein the plurality of processing options vary over at least one of cost, quality and delivery time; providing the plurality of processing options to the local device; receiving a user selection of a processing option from the plurality of processing options; scheduling TTS resources for processing the TTS request based at least in part on the user selection; synthesizing the text data into speech based at least in part on the TTS resources; and providing audio data to the local device, the audio data including the synthesized speech. - View Dependent Claims (2, 3, 4)
-
-
5. A system comprising:
-
at least one processor; a memory device including instructions operable to be executed by the at least one processor to perform a set of actions, configuring the at least one processor; to receive a TTS request for TTS processing of text data into speech, wherein the TTS request is sent by a local device remote from the system and includes text data originating from the local device; to estimate delivery conditions for completion of the TTS request, wherein the delivery conditions include an estimated cost; to receive a user preference for TTS processing based on the estimated delivery conditions; to schedule TTS resources for processing the TTS request based on the user preference; and to synthesize the text data into speech based at least in part on the TTS resources. - View Dependent Claims (6, 7, 8, 9, 10, 11, 12)
-
-
13. A non-transitory computer-readable storage medium storing processor-executable instructions for controlling a computing device, comprising:
-
program code to receive a TTS request for TTS processing of text data into speech, wherein the TTS request is sent by a local device remote from the computing device and includes text data originating from the local device; program code to estimate delivery conditions for completion of the TTS request, wherein the delivery conditions include an estimated cost; program code to receive a user preference for TTS processing based on the estimated delivery conditions; program code to schedule TTS resources for processing the TTS request based on the user preference; and program code to synthesize the text data into speech based at least in part on the TTS resources. - View Dependent Claims (14, 15, 16, 17, 18, 19, 20)
-
Specification