Systems and methods for cross-lingual audio search
First Claim
Patent Images
1. An apparatus comprising:
- at least one processor; and
a computer readable storage medium having computer readable program code embodied therewith and executable by the at least one processor, the computer readable program code comprising;
computer readable program code configured to accept a search query in a first language variety, the search query being in a form of at least one of;
text and audio;
computer readable program code configured to access a corpus of material in the first language variety;
computer readable program code configured to determine similarity of a second language variety with respect to a first language variety;
computer readable program code configured to choose the second language variety based on determining that the first language variety baseforms can be obtained via data from the second language variety, and at least one selection criterion;
the at least one selection criterion comprising a ranking from among ranked pairs of language varieties, the ranked pairs being ranked on a basis of determined similarity between language varieties;
computer readable program code configured to obtain first language variety baseforms via data obtained from the second language variety;
computer readable program code configured to thereupon build a first language variety phonetic model, based on the first language variety baseforms obtained via data obtained from the second language variety; and
computer readable program code configured to employ the first language variety phonetic model and the second language variety in executing an audio search based on the accepted search query.
1 Assignment
0 Petitions
Accused Products
Abstract
Methods and arrangements for executing an audio search. A search query in a first language variety is accepted, the search query being in a form of at least one of: text and audio. A corpus of material in the first language variety is accessed, and first language variety baseforms are obtained via data obtained from a second language variety. A first language variety phonetic model is built, and the first language variety phonetic model and the second language variety are employed in executing an audio search based on the accepted search query.
30 Citations
12 Claims
-
1. An apparatus comprising:
-
at least one processor; and a computer readable storage medium having computer readable program code embodied therewith and executable by the at least one processor, the computer readable program code comprising; computer readable program code configured to accept a search query in a first language variety, the search query being in a form of at least one of;
text and audio;computer readable program code configured to access a corpus of material in the first language variety; computer readable program code configured to determine similarity of a second language variety with respect to a first language variety; computer readable program code configured to choose the second language variety based on determining that the first language variety baseforms can be obtained via data from the second language variety, and at least one selection criterion; the at least one selection criterion comprising a ranking from among ranked pairs of language varieties, the ranked pairs being ranked on a basis of determined similarity between language varieties; computer readable program code configured to obtain first language variety baseforms via data obtained from the second language variety; computer readable program code configured to thereupon build a first language variety phonetic model, based on the first language variety baseforms obtained via data obtained from the second language variety; and computer readable program code configured to employ the first language variety phonetic model and the second language variety in executing an audio search based on the accepted search query.
-
-
2. A computer program product comprising:
-
a non-transitory computer readable storage medium having computer readable program code embodied therewith, the computer readable program code comprising; computer readable program code configured to accept a search query in a first language variety, the search query being in a form of at least one of;
text and audio;computer readable program code configured to access a corpus of material in the first language variety; computer readable program code configured to determine similarity of a second language variety with respect to a first language variety; computer readable program code configured to choose the second language variety based on determining that the first language variety baseforms can be obtained via data from the second language variety, and at least one selection criterion; the at least one selection criterion comprising a ranking from among ranked pairs of language varieties, the ranked pairs being ranked on a basis of determined similarity between language varieties; computer readable program code configured to obtain first language variety baseforms via data obtained from the second language variety; computer readable program code configured to thereupon build a first language variety phonetic model, based on the first language variety baseforms obtained via data obtained from the second language variety; and computer readable program code configured to employ the first language variety phonetic model and the second language variety in executing an audio search based on the accepted search query. - View Dependent Claims (3, 4, 5, 6, 7, 8, 9, 10, 11, 12)
-
Specification