Information retrieval engine
First Claim
Patent Images
1. A computer-implemented method comprising:
- accepting a file and information corresponding to the file;
associating the file and the information corresponding to the file;
organizing the file into at least one document comprising at least a portion of the file;
associating the file and the document corresponding to the file;
quantizing the document to obtain letters;
grouping the letters to form a set of words, the set being based on a predetermined frequency of occurrence of the grouped letters;
associating each document and the corresponding set of words in an index of documents;
obtaining a set of query words;
identifying one or more documents in the index, each of the identified documents containing at least one query word in the set of query words; and
scoring each of the identified documents, a score for each identified document being based at least in part on a weighting of each query word found in the identified document, the weighting being determined using a local weighting factor and a global weighting factor.
8 Assignments
0 Petitions
Accused Products
Abstract
A system, method, and computer program product retrieve information associated with the signals. The information retrieval can be performed on a signal by quantizing the signal, forming words, and indexing based on weights of the words. The words are formed by grouping letters together to form a number of words within predetermined threshold values. The weights of the words are determined using a binomial log likelihood ratio analysis. The present invention may be applied to identification of an unknown song.
114 Citations
40 Claims
-
1. A computer-implemented method comprising:
-
accepting a file and information corresponding to the file;
associating the file and the information corresponding to the file;
organizing the file into at least one document comprising at least a portion of the file;
associating the file and the document corresponding to the file;
quantizing the document to obtain letters;
grouping the letters to form a set of words, the set being based on a predetermined frequency of occurrence of the grouped letters;
associating each document and the corresponding set of words in an index of documents;
obtaining a set of query words;
identifying one or more documents in the index, each of the identified documents containing at least one query word in the set of query words; and
scoring each of the identified documents, a score for each identified document being based at least in part on a weighting of each query word found in the identified document, the weighting being determined using a local weighting factor and a global weighting factor. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 28, 29, 30, 31, 32, 33, 34, 35, 37, 38, 39)
-
-
27. (canceled)
-
36. (canceled)
-
40-112. -112. (canceled)
Specification