CORRECTING UNINTELLIGIBLE SYNTHESIZED SPEECH
First Claim
1. A method of speech synthesis, comprising the steps of:
- (a) receiving a text input in a text-to-speech system;
(b) processing the text input into synthesized speech using a processor of the system;
(c) establishing that the synthesized speech is unintelligible;
(d) reprocessing the text input into subsequent synthesized speech to correct the unintelligible synthesized speech; and
(e) outputting the subsequent synthesized speech to a user via a loudspeaker.
3 Assignments
0 Petitions
Accused Products
Abstract
A method and system of speech synthesis. A text input is received in a text-to-speech system and, using a processor of the system, the text input is processed into synthesized speech which is established as unintelligible. The text input is reprocessed into subsequent synthesized speech and output to a user via a loudspeaker to correct the unintelligible synthesized speech. In one embodiment, the synthesized speech can be established as unintelligible by predicting intelligibility of the synthesized speech, and determining that the predicted intelligibility is lower than a minimum threshold. In another embodiment, the synthesized speech can be established as unintelligible by outputting the synthesized speech to the user via the loudspeaker, and receiving an indication from the user that the synthesized speech is not intelligible.
-
Citations
20 Claims
-
1. A method of speech synthesis, comprising the steps of:
-
(a) receiving a text input in a text-to-speech system; (b) processing the text input into synthesized speech using a processor of the system; (c) establishing that the synthesized speech is unintelligible; (d) reprocessing the text input into subsequent synthesized speech to correct the unintelligible synthesized speech; and (e) outputting the subsequent synthesized speech to a user via a loudspeaker. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10)
-
-
11. A method of speech synthesis, comprising the steps of:
-
(a) receiving a text input in a text-to-speech system; (b) processing the text input into synthesized speech using a processor of the system; (c) predicting intelligibility of the synthesized speech; (d) determining whether the predicted intelligibility from step (c) is lower than a minimum threshold; (e) outputting the synthesized speech to a user via a loudspeaker if the predicted intelligibility is determined to be not lower than the minimum threshold in step (d); (f) adapting a model used in conjunction with processing the text input if the predicted intelligibility is determined to be lower than the minimum threshold in step (d); (g) reprocessing the text input into subsequent synthesized speech; (h) predicting intelligibility of the subsequent synthesized speech; (i) determining whether the predicted intelligibility from step (h) is lower than the minimum threshold; (j) outputting the subsequent synthesized speech to the user via the loudspeaker if the predicted intelligibility is determined to be not lower than the minimum threshold in step (i); and
, otherwise(k) repeating steps (f) through (k). - View Dependent Claims (12, 13, 14, 15, 16)
-
-
17. A method of speech synthesis, comprising the steps of:
-
(a) receiving a text input in a text-to-speech system; (b) processing the text input into synthesized speech using a processor of the system; (c1) outputting the synthesized speech to the user via the loudspeaker; (c2) receiving an indication from the user that the synthesized speech is not intelligible; (d) reprocessing the text input into subsequent synthesized speech to correct the unintelligible synthesized speech; and (e) outputting the subsequent synthesized speech to a user via a loudspeaker. - View Dependent Claims (18, 19, 20)
-
Specification