Audio verification
First Claim
1. A system of audio verification comprising:
- a processor; and
memory comprising processor-executable instructions that when executed by the processor cause implementation of an audio generation component configured to;
identify an audio signal comprising a code for user verification;
extract one or more audio segments in real-time from an on-going audio stream;
create a second audio signal comprising speech based upon the one or more audio segments extracted in real-time from the on-going audio stream;
identify a pitch and a volume of the audio signal at a first time;
identify a second pitch and a second volume of the second audio signal at a second time, wherein the pitch of the audio signal and the second pitch of the second audio signal are not within a threshold pitch similarity, and wherein the volume of the audio signal and the second volume of the second audio signal are not within a threshold volume similarity;
determine an average pitch between the pitch of the audio signal and the second pitch of the second audio signal;
determine an average volume between the volume of the audio signal and the second volume of the second audio signal;
alter the pitch of the audio signal and the second pitch of the second audio signal to be the average pitch at a third time;
alter the volume of the audio signal and the second volume of the second audio signal to be the average volume at a fourth time;
combine the audio signal and the second audio signal to generate a verification audio signal in response to;
determining that the pitch of the audio signal at the third time and the second pitch of the second audio signal at the third time are both the average pitch so that a bot is unable to discern a difference between the pitch of the audio signal and the second pitch of the second audio signal when the audio signal and the second audio signal are combined; and
determining that the volume of the audio signal at the fourth time and the second volume of the second audio signal at the fourth time are both the average volume so that a bot is unable to discern a difference between the volume of the audio signal and the second volume of the second audio signal when the audio signal and the second audio signal are combined;
present the verification audio signal to a user for the user verification, the user verification comprising verifying that the user is human; and
verify whether the user has access to content or a service based upon user input, obtained in response to the verification audio signal, matching the code within the verification audio signal.
6 Assignments
0 Petitions
Accused Products
Abstract
One or more techniques and/or systems are provided for audio verification. An audio signal, comprising a code for user verification, may be identified. A second audio signal is created comprising speech. The audio signal and the second audio signal may be altered to comprise a same or similar volume, pitch, amplitude, and/or speech rate. The audio signal and the second audio signal may be combined to generate a verification audio signal. The verification audio signal may be presented to a user for the user verification. Verification may be performed to determine whether the user has access to content or a service based upon user input, obtained in response to the user verification audio signal, matching the code within the user verification audio signal. In an example, the user verification may comprise verifying that the user is human.
-
Citations
20 Claims
-
1. A system of audio verification comprising:
-
a processor; and memory comprising processor-executable instructions that when executed by the processor cause implementation of an audio generation component configured to; identify an audio signal comprising a code for user verification; extract one or more audio segments in real-time from an on-going audio stream; create a second audio signal comprising speech based upon the one or more audio segments extracted in real-time from the on-going audio stream; identify a pitch and a volume of the audio signal at a first time; identify a second pitch and a second volume of the second audio signal at a second time, wherein the pitch of the audio signal and the second pitch of the second audio signal are not within a threshold pitch similarity, and wherein the volume of the audio signal and the second volume of the second audio signal are not within a threshold volume similarity; determine an average pitch between the pitch of the audio signal and the second pitch of the second audio signal; determine an average volume between the volume of the audio signal and the second volume of the second audio signal; alter the pitch of the audio signal and the second pitch of the second audio signal to be the average pitch at a third time; alter the volume of the audio signal and the second volume of the second audio signal to be the average volume at a fourth time; combine the audio signal and the second audio signal to generate a verification audio signal in response to; determining that the pitch of the audio signal at the third time and the second pitch of the second audio signal at the third time are both the average pitch so that a bot is unable to discern a difference between the pitch of the audio signal and the second pitch of the second audio signal when the audio signal and the second audio signal are combined; and determining that the volume of the audio signal at the fourth time and the second volume of the second audio signal at the fourth time are both the average volume so that a bot is unable to discern a difference between the volume of the audio signal and the second volume of the second audio signal when the audio signal and the second audio signal are combined; present the verification audio signal to a user for the user verification, the user verification comprising verifying that the user is human; and verify whether the user has access to content or a service based upon user input, obtained in response to the verification audio signal, matching the code within the verification audio signal. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13)
-
-
14. A method of audio verification comprising:
-
identifying an audio signal comprising a code for user verification; extracting one or more audio segments in real-time from an on-going audio stream; creating a second audio signal comprising speech based upon the one or more audio segments extracted in real-time from the on-going audio stream; identifying a speaking rate and an amplitude of the audio signal at a first time; identifying a second speaking rate and a second amplitude of the second audio signal at a second time, wherein the speaking rate of the audio signal and the second speaking rate of the second audio signal are not within a threshold speaking rate similarity, and wherein the amplitude of the audio signal and the second amplitude of the second audio signal are not within a threshold amplitude similarity; altering the speaking rate of the audio signal and the second speaking rate of the second audio signal by altering the speaking rate be more similar to the second speaking rate at the second time and altering the second speaking rate to be more similar to the speaking rate at the first time until the speaking rate and the second speaking rate are within the threshold speaking rate similarity at a third time; altering the amplitude of the audio signal and the second amplitude of the second audio signal by altering the amplitude be more similar to the second amplitude at the second time and altering the second amplitude to be more similar to the amplitude at the first time until the amplitude and the second amplitude are within the threshold amplitude similarity at a fourth time; combining the audio signal and the second audio signal to generate a verification audio signal in response to; determining that the speaking rate of the audio signal at the third time and the second speaking rate of the second audio signal at the third time are within the threshold speaking rate similarity so that a bot is unable to discern a difference between the speaking rate of the audio signal and the second speaking rate of the second audio signal when the audio signal and the second audio signal are combined; and determining that the amplitude of the audio signal at the fourth time and the second amplitude of the second audio signal at the fourth time are within the threshold amplitude similarity so that a bot is unable to discern a difference between the amplitude of the audio signal and the second amplitude of the second audio signal when the audio signal and the second audio signal are combined; presenting the verification audio signal to a user for the user verification, the user verification comprising verifying that the user is human; and verifying whether the user has access to content or a service based upon user input, obtained in response to the verification audio signal, matching the code within the verification audio signal. - View Dependent Claims (15, 16, 17, 18)
-
-
19. A system of audio verification comprising:
-
a processor; and memory comprising processor-executable instructions that when executed by the processor cause implementation of an audio generation component configured to; identify an audio signal comprising a code for user verification; extract one or more audio segments in real-time from an on-going audio stream; create a second audio signal comprising speech based upon the one or more audio segments extracted in real-time from the on-going audio stream; identify at least two of a pitch, an amplitude, a volume or a speaking rate of the audio signal; identify at least two of a second pitch, a second amplitude, a second volume or a second speaking rate of the second audio signal, wherein at least two of the pitch of the audio signal and the second pitch of the second audio signal are not within a threshold pitch similarity, the volume of the audio signal and the second volume of the second audio signal are not within a threshold volume similarity, the amplitude of the audio signal and the second amplitude of the second audio signal are not within a threshold amplitude similarity or the speaking rate of the audio signal and the second speaking rate of the second audio signal are not within a threshold speaking rate similarity; at least two of; alter at least one of the pitch of the audio signal or the second pitch of the second audio signal until the pitch and the second pitch are within the threshold pitch similarity; alter at least one of the volume of the audio signal or the second volume of the second audio signal until the volume and the second volume are within the threshold volume similarity; alter at least one of the amplitude of the audio signal or the second amplitude of the second audio signal until the amplitude and the second amplitude are within the threshold amplitude similarity;
oralter at least one of the speaking rate of the audio signal or the second speaking rate of the second audio signal until the speaking rate and the second speaking rate are within the threshold speaking rate similarity; combine the audio signal and the second audio signal to generate a verification audio signal in response to at least two of; determining that the pitch of the audio signal and the second pitch of the second audio signal are within the threshold pitch similarity so that a bot is unable to discern a difference between the pitch of the audio signal and the second pitch of the second audio signal when the audio signal and the second audio signal are combined; determining that the volume of the audio signal and the second volume of the second audio signal are within the threshold volume similarity so that a bot is unable to discern a difference between the volume of the audio signal and the second volume of the second audio signal when the audio signal and the second audio signal are combined; determining that the amplitude of the audio signal and the second amplitude of the second audio signal are within the threshold amplitude similarity so that a bot is unable to discern a difference between the amplitude of the audio signal and the second amplitude of the second audio signal when the audio signal and the second audio signal are combined;
ordetermining that the speaking rate of the audio signal and the second speaking rate of the second audio signal are within the threshold speaking rate similarity so that a bot is unable to discern a difference between the speaking rate of the audio signal and the second speaking rate of the second audio signal when the audio signal and the second audio signal are combined; present the verification audio signal to a user for the user verification, the user verification comprising verifying that the user is human; and verify whether the user has access to content or a service based upon user input, obtained in response to the verification audio signal, matching the code within the verification audio signal. - View Dependent Claims (20)
-
Specification