Sound rate modification
First Claim
Patent Images
1. A method implemented by at least one computing device, the method comprising:
- receiving, as a user input, by the at least one computing device, an indication of an amount of time in which sound data is to be output, the sound data including a waveform representation and a plurality of portions, the indicated amount of time being different from an unmodified amount of time for playback of the sound data;
identifying, by the at least one computing device, at least one active portion and at least one inactive portion of the plurality of portions of the sound data based on spectral characteristics of the sound data, the at least one active portion containing multiple different units of speech, the at least one inactive portion corresponding to pauses in speech;
modifying, by the at least one computing device, the sound data to be output in the indicated amount of time using a set of sound rate rules generated to capture sound rate characteristics of units of speech in a natural language model by;
calculating different relative rates at which the multiple different units of speech are to be output, respectively, based on the set of sound rate rules and the indicated amount of time,applying a first calculated rate to a first unit of speech in the active portion to cause the first unit of speech to be output at the first calculated rate, andapplying a second different calculated rate to a second unit of speech in the active portion to cause the second unit of speech to be output at the second different calculated rate; and
outputting, by the at least one computing device, the sound data as modified by the first calculated rate and the second different calculated rate in the indicated amount of time.
3 Assignments
0 Petitions
Accused Products
Abstract
Sound rate modification techniques are described. In one or more implementations, an indication is received of an amount that a rate of output of sound data is to be modified. One or more sound rate rules are applied to the sound data that, along with the received indication, are usable to calculate different rates at which different portions of the sound data are to be modified, respectively. The sound data is then output such that the calculated rates are applied.
232 Citations
20 Claims
-
1. A method implemented by at least one computing device, the method comprising:
-
receiving, as a user input, by the at least one computing device, an indication of an amount of time in which sound data is to be output, the sound data including a waveform representation and a plurality of portions, the indicated amount of time being different from an unmodified amount of time for playback of the sound data; identifying, by the at least one computing device, at least one active portion and at least one inactive portion of the plurality of portions of the sound data based on spectral characteristics of the sound data, the at least one active portion containing multiple different units of speech, the at least one inactive portion corresponding to pauses in speech; modifying, by the at least one computing device, the sound data to be output in the indicated amount of time using a set of sound rate rules generated to capture sound rate characteristics of units of speech in a natural language model by; calculating different relative rates at which the multiple different units of speech are to be output, respectively, based on the set of sound rate rules and the indicated amount of time, applying a first calculated rate to a first unit of speech in the active portion to cause the first unit of speech to be output at the first calculated rate, and applying a second different calculated rate to a second unit of speech in the active portion to cause the second unit of speech to be output at the second different calculated rate; and outputting, by the at least one computing device, the sound data as modified by the first calculated rate and the second different calculated rate in the indicated amount of time. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10)
-
-
11. A system comprising:
at least one module implemented at least partially in hardware and configured to; receive input specifying a time period over which sound data is to be output, the sound data including a plurality of portions; identify at least one active portion and at least one inactive portion of the plurality of portions of the sound data based on spectral characteristics of the sound data, the at least one active portion containing multiple different units of speech, the at least one inactive portion corresponding to pauses in speech; modify the sound data using a set of sound rate rules that reflect a natural language model rule to the sound data by; calculating different relative rates at which the different units of speech are to be output, respectively, based on the set of sound rate rules; applying a first calculated rate to a first unit of speech in the active portion to cause the first unit of speech to be output at the first calculated rate; and applying a second different calculated rate to a second unit of speech in the active portion to cause the second unit of speech to be output at the second different calculated rate; and output the sound data as modified by the first calculated rate and the second different calculated rate over the specified time period. - View Dependent Claims (12, 13, 14, 15)
-
16. At least one computer-readable storage medium having instructions stored thereon that, responsive to execution on a computing device, causes the computing device to perform operations comprising:
-
receiving input specifying a time period over which sound data is to be output, the sound data including a plurality of portions; identifying at least one active portion and at least one inactive portion of the plurality of portions of the sound data based on spectral characteristics of the sound data, the at least one active portion containing multiple different units of speech, the at least one inactive portion corresponding to pauses in speech; modifying the sound data using a set of sound rate rules that reflect a natural language model rule to the sound data by; calculating different relative rates at which the different units of speech are to be output, respectively, based on the set of sound rate rules to enable the sound data to be output within the specified period of time; applying a first calculated rate to a first unit of speech in the active portion to cause the first unit of speech to be output at the first calculated rate; and applying a second different calculated rate to a second unit of speech in the active portion to cause the second unit of speech to be output at the second different calculated rate; and outputting the sound data as modified by the first calculated rate and the second different calculated rate over the specified time period. - View Dependent Claims (17, 18, 19, 20)
-
Specification