Method and system for response time optimization of data query rankings and retrieval
First Claim
1. A computer-implemented method of optimizing a response time for retrieving relevant documents from a set of candidate documents identified in response to a search query where the search query includes one or more terms, the method comprising the steps of:
- assigning alarm weight to each of the terms of the search query;
associating a document to a relevance score bin based on a total matched term weight computed for the document being based on a sum of the term weights of the terms in the search query that are matched by the document, where a document that matches a first total matched term weight is associated to a more relevant score bin than a document that matches a second total matched term weight that is less than the first total term weight; and
retrieving a set of most relevant documents based on the association to the relevance score bins having a highest relevance score without retrieving other candidate documents.
2 Assignments
0 Petitions
Accused Products
Abstract
A method and system for optimizing response time for data query rankings and retrieval is provided. In response to a received search query that contains one or more terms, an information retrieval system identifies a candidate set of documents that match any of the terms. Terms are assigned a term weight making them more or less relevant in relation to other terms. A ranking logic defines score bins from a total score range based on possible matched term weights. A relationship is established that classifies a document into a score bin based on a sum of term weights from matched terms. Documents that match more term weights have higher total relevance scores than documents that match less term weights. The most relevant documents are retrievable without having to retrieve the entire set of candidate documents and without having to compute total relevance scores for all the candidate documents.
38 Citations
23 Claims
-
1. A computer-implemented method of optimizing a response time for retrieving relevant documents from a set of candidate documents identified in response to a search query where the search query includes one or more terms, the method comprising the steps of:
-
assigning alarm weight to each of the terms of the search query;
associating a document to a relevance score bin based on a total matched term weight computed for the document being based on a sum of the term weights of the terms in the search query that are matched by the document, where a document that matches a first total matched term weight is associated to a more relevant score bin than a document that matches a second total matched term weight that is less than the first total term weight; and
retrieving a set of most relevant documents based on the association to the relevance score bins having a highest relevance score without retrieving other candidate documents.
-
-
2. The method as set forth in claim 1 further including determining a relevance of a document independently from other candidate documents based on the relevance score bin associated to the document.
-
3. The method as set forth in claim 1 further including determining a set of most relevant documents from the candidate documents based on the relevance score bin associated to the documents without determining an exact relevance score for all the candidate documents.
-
4. The method as set forth in claim 1 wherein assigning the term weight to a term is based on an inverse frequency scoring.
-
5. The method as set forth in claim 1 further including:
-
defining a total relevance score range; and
defining one or more relevance score bins within the total relevance score range as a function of a total tern weight, the total term weight being a sum of the term weights from the search query.
-
-
6. The method as set forth in claim 5 wherein the total relevance score range is divided into the one or more relevance score bins each having an equal size.
-
7. A computer readable medium having computer executable instructions for performing a method for optimizing a response time for retrieving relevant documents from a set of candidate documents identified in response to a search query where the search query includes one or more terms, the method comprising the steps of:
-
assigning a term weight to each of the terms of the search query;
associating a document to a relevance score bin based on a total matched term weight where a document that matches a first total term weight is associated to a more relevant score bin than a document that matches a second total term weight that is less than the first total term weight; and
retrieving a set of most relevant documents based on the association to the relevance score bins having a highest relevance score without retrieving other candidate documents.
-
-
8. An object retrieval system comprising:
-
a logic for processing a search query having one or more terms;
an object retrieval logic for identifying candidate objects that match the search query;
a ranking logic for assigning a term weight to each of the terms of the search query and associating each combination of matched term weights to a relevance score range, the ranking logic grouping the candidate objects based on the total matched term weight where an object that matches a first total term weight is associated to a more relevant score range than an object that matches a second total term weight that is less than the first total term weight; and
a retrieval logic for retrieving a set of relevant objects associated to the relevance score ranges having a greatest total matched term weight without retrieving the candidate objects from other relevance score ranges.
-
-
9. The object retrieval system of claim 8 wherein the logic for processing the search query includes a parser that parses the search query to identify the terms.
-
10. The object retrieval system of claim 8 wherein the ranking logic includes:
-
logic for defining a total relevance score range; and
logic for defining the relevance score ranges within the total relevance score range as a function of possible term weights that an object can match.
-
-
11. The object retrieval system of claim 8 wherein the retrieval logic includes logic for retrieving only objects from the candidate objects that match a highest value of the term weights.
-
12. The object retrieval system of claim 8 wherein the ranking logic includes means for associating document relevance scores to the relevance ranges based on matched term weight.
-
13. A computer-implemented method of retrieving most relevant documents from a set of candidate documents that match a search query having one or more terms, the method comprising:
-
assigning a term weight to each of the terms in the search query;
defining a total relevance score range and a plurality of score bins therein;
establishing a relationship between a total term weight matched by a document and a score bin within the total relevance score range;
associating a score bin to a document based on the total term weight matched by the document; and
retrieving the most relevant documents based on the score bins.
-
-
14. The method as set forth in claim 13 wherein associating includes associating a score bin to a document such that a final relevance score of the document is limited to the score bin and the most relevant documents are identifiable without having to determine a final relevance score for all the candidate documents.
-
15. The method as set forth in claim 13 wherein retrieving the most relevant documents includes retrieving documents that match a highest term weight associated to a highest score bin without retrieving documents associated to other score bins.
-
16. The method as set forth in claim 13 wherein a score bin is associated to a document such that a document that matches a first total term weight is associated to a more relevant score bin than a document that matches a second total term weight that is lees than the first total term weight.
-
17. The method as set forth in claim 13 wherein the assigning a term weight is based on an inverse frequency scoring.
-
18. The method as set forth in claim 13 wherein the associating allows the most relevant documents to be identified independently from the other candidate documents.
-
19. The method as set forth in claim 13 further including displaying the most relevant documents to a user.
-
20. A computer readable medium having computer executable instructions for performing a method for retrieving most relevant documents from a set of candidate documents that match a search query having one or more terms, the method comprising:
-
assigning a term weight to each of the terms;
defining a total relevance score range and a plurality of score bins therein;
establishing a relationship between a total term weight matched by a document and a score bin within the total relevance score range;
associating a score bin to a document based on the total term weight matched by the document; and
retrieving the most relevant documents based on the score bins.
-
-
21. A computer-implemented method of optimizing a response time for retrieving relevant documents from a set of candidate documents identified in response to a search query where the search query includes one or more terms, the method comprising:
-
assigning a term weight to each of the terms in the search query, computing a total matched term weight for a document by matching the terms in the search query to terms associated with the document and summing the term weights of the terms in the search query that match the terms associated with the document, whore the terms associated with the document are stored in an index;
associating the document with a relevance score bin based on a total matched term weight, where a first document with a first total term weight in a first relevance range is associated with a first score bin associated with more relevant documents than a second document with a second total term weight in a second relevance range; and
selectively retrieving a subset of a set of documents, where the subset includes documents associated with one or more relevance score bins.
-
-
22. The method of claim 21, where the relevance range is computed dynamically based, at least in part, on the sum of the term weights from the search query.
-
23. The method of claim 21, where the relevance range is computed dynamically based on the sum of the term weights from the search query divided by a greatest common denominator of the term weights.
Specification