Method and apparatus for searching multimedia data using speech recognition in mobile device
First Claim
Patent Images
1. A method of searching music using speech recognition, the method comprising:
- recognizing as a phoneme sequence a speech signal uttered by a user; and
searching music information by performing partial symbol matching between the recognized phoneme sequence and a standard pronunciation sequence.
1 Assignment
0 Petitions
Accused Products
Abstract
A method of searching music using speech recognition in a mobile device, the method including: recognizing a speech signal uttered by a user as a phoneme sequence; and searching music information by performing partial symbol matching between the recognized phoneme sequence and a standard pronunciation sequence.
98 Citations
23 Claims
-
1. A method of searching music using speech recognition, the method comprising:
-
recognizing as a phoneme sequence a speech signal uttered by a user; and searching music information by performing partial symbol matching between the recognized phoneme sequence and a standard pronunciation sequence. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
-
8. A computer-readable recording medium in which a program for executing a method of searching music using speech recognition is recorded, the method comprising:
-
recognizing as a phoneme sequence a speech signal uttered by a user; and searching music information by performing partial symbol matching between the recognized phoneme sequence and a standard pronunciation sequence.
-
-
9. A music search apparatus comprising:
-
a music database storing a pronunciation dictionary with respect to music and music information; a phoneme decoding unit decoding a speech signal into a candidate phoneme sequence; a matching unit matching the candidate phoneme sequence with a reference phoneme pattern in the pronunciation dictionary with respect to the music information; a calculation unit calculating a match score according to a result of the matching; and a display unit displaying a music information search result according to the calculated match score. - View Dependent Claims (10, 11, 12, 13, 14, 15)
-
-
16. A music search apparatus comprising:
-
a feature extraction unit extracting a feature vector sequence of a speech signal of an input speech query; a phoneme decoding unit decoding the extracted feature vector sequence into at least one candidate phoneme sequences; a matching unit partially matching a candidate phoneme sequence with a reference pattern included in a stored lexicon by matching the candidate phoneme sequence with the reference pattern using a phoneme confusion matrix and linguistic constraints and, after the partial matching, matching a converted pronunciation sequence with a reference phoneme pattern of the lexicon so as to overcome an inconsistency due to a difference in pronunciation caused by palatalization; and a calculation unit calculating a match score according to the match score using a probability value of the phoneme confusion matrix and considering probabilities of insertion and deletion of the phoneme. - View Dependent Claims (17, 18, 19, 20, 21, 22, 23)
-
Specification