SPEECH-DRIVEN SELECTION OF AN AUDIO FILE
First Claim
Patent Images
1. A method for detecting a refrain in an audio file having vocal components, the method comprising:
- generating a phonetic transcription of at least a portion of the audio file; and
identifying a vocal segment in the generated phonetic transcription, which vocal segment is repeated at least once.
5 Assignments
0 Petitions
Accused Products
Abstract
A system and method for detecting a refrain in an audio file having vocal components. The method and system includes generating a phonetic transcription of a portion of the audio file, analyzing the phonetic transcription and identifying a vocal segment in the generated phonetic transcription that is repeated frequently. The method and system further relate to the speech-driven selection based on similarity of detected refrain and user input.
241 Citations
25 Claims
-
1. A method for detecting a refrain in an audio file having vocal components, the method comprising:
-
generating a phonetic transcription of at least a portion of the audio file; and
identifying a vocal segment in the generated phonetic transcription, which vocal segment is repeated at least once. - View Dependent Claims (2, 3, 4, 5, 6)
-
-
7. A method for processing an audio file having at least vocal components, the method comprising:
-
detecting a refrain of the audio file;
generating either or both a phonetic or acoustic representation of the refrain; and
storing the generated phonetic or acoustic representation together with the audio file. - View Dependent Claims (8, 9, 10, 11)
-
-
12. A method of speech-driven selection of an audio file from a plurality of audio files in an audio player, each of the audio files having at least vocal components, the method comprising:
-
detecting a refrain in each of the audio files of the plurality of audio files;
determining either or both phonetic or acoustic representations of at least part of a refrain of each of the audio files;
supplying each of the phonetic or acoustic representations to a speech recognition unit;
comparing the phonetic or acoustic representations to the voice command of the user of the audio player; and
selecting an audio file based on the best matching result of the comparison. - View Dependent Claims (13, 14, 15, 16, 17, 18, 19, 20, 21, 22)
-
-
23. A system for detecting a refrain in an audio file having at least vocal components, the system comprising:
-
a phonetic transcription unit that generates a phonetic transcription of at least a portion of the audio file;
an analyzing unit that identifies vocal segments within the phonetic transcription that are repeated at least once.
-
-
24. A system for processing an audio file having at least vocal components, the system comprising:
-
a detecting unit that detects the refrain of the audio file;
a transcription unit that generates a phonetic or acoustic representation of the refrain; and
a control unit that stores the phonetic or acoustic representation linked to the audio data.
-
-
25. A system for a speech-driven selection of an audio file comprising:
-
a refrain detecting unit that detects the refrain of an audio file;
a transcription unit that generates a phonetic or acoustic representation of the detected refrain;
a speech recognition unit that compares the phonetic or acoustic representation to the voice command of the user selecting the audio file and that determines the best matching result of the comparison; and
a control unit that selects the audio file in accordance with the result of the comparison.
-
Specification