ADJUSTING SPEED OF HUMAN SPEECH PLAYBACK
1 Assignment
0 Petitions
Accused Products
Abstract
A system configured to vary a speech speed of speech represented in input audio data without changing a pitch of the speech. The system may vary the speech speed based on a number of different inputs, including non-audio data, data associated with a command, or data associated with the voice message itself. The non-audio data may correspond to information about an account, device or user, such as user preferences, calendar entries, location information, etc. The system may analyze audio data associated with the command to determine command speech speed, identity of person listening, etc. The system may analyze the input audio data to determine a message speech speed, background noise level, identity of the person speaking, etc. Using all of these inputs, the system may dynamically determine a target speech speed and may generate output audio data having the target speech speed.
13 Citations
40 Claims
-
1-20. -20. (canceled)
-
21. A computer-implemented method, comprising:
-
receiving input audio data representing input speech; determining an input speech speed corresponding to the input audio data; determining output data based at least on the input audio data; determining a target output speed based at least on the input speech speed; using the output data to generate output audio data representing output speech, the output speech corresponding to the target output speed; and causing a device to output the output audio data. - View Dependent Claims (22, 23, 24, 25, 26, 27, 28, 29, 30, 31)
-
-
32. A system comprising:
-
at least one processor; and memory including instructions operable to be executed by the at least one processor to configure the system to; receive input audio data representing input speech; determine an input speech speed corresponding to the input audio data; determine output data based at least on the input audio data; determine a target output speed based at least on the input speech speed; using the output data to generate output audio data representing output speech, the output speech corresponding to the target output speed; and cause a device to output the output audio data. - View Dependent Claims (33, 34, 35, 36, 37, 38, 39, 40)
-
Specification