ANALYZING AUDIO INPUT FOR EFFICIENT SPEECH AND MUSIC RECOGNITION
First Claim
1. A method for analyzing audio input, the method comprising:
- at an electronic device;
receiving an audio input;
determining whether the audio input includes music;
determining whether the audio input includes speech;
in response to determining that the audio input includes music, generating an acoustic fingerprint representing a portion of the audio input that includes music; and
in response to determining that the audio input includes speech rather than music, identifying an end-point of a speech utterance of the audio input.
1 Assignment
0 Petitions
Accused Products
Abstract
Systems and processes for analyzing audio input for efficient speech and music recognition are provided. In one example process, an audio input can be received. A determination can be made as to whether the audio input includes music. In addition, a determination can be made as to whether the audio input includes speech. In response to determining that the audio input includes music, an acoustic fingerprint representing a portion of the audio input that includes music is generated. In response to determining that the audio input includes speech rather than music, an end-point of a speech utterance of the audio input is identified.
-
Citations
21 Claims
-
1. A method for analyzing audio input, the method comprising:
at an electronic device; receiving an audio input; determining whether the audio input includes music; determining whether the audio input includes speech; in response to determining that the audio input includes music, generating an acoustic fingerprint representing a portion of the audio input that includes music; and in response to determining that the audio input includes speech rather than music, identifying an end-point of a speech utterance of the audio input. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19)
-
20. A non-transitory computer-readable storage medium comprising instructions for causing one or more processor to:
-
receive audio input; determine whether the audio input includes music; determine whether the audio input includes speech; responsive to determining that the audio input includes music, generate an acoustic fingerprint representing a portion of the audio input that includes music; and responsive to determining that the audio input includes speech rather than music, identify an end-point of a speech utterance of the audio input.
-
-
21. An electronic device, comprising:
-
one or more processors; memory; one or more programs, wherein the one or more programs are stored in the memory and configured to be executed by the one or more processors, the one or more programs including instructions for; receiving audio input; determining whether the audio input includes music; determining whether the audio input includes speech; responsive to determining that the audio input includes music, generating an acoustic fingerprint representing a portion of the audio input that includes music; and responsive to determining that the audio input includes speech rather than music, identifying an end-point of a speech utterance of the audio input.
-
Specification