Field weighting in text searching
First Claim
Patent Images
1. A method of determining a field-weighted score for a document having multiple fields relative to a query having a plurality of query terms, the method comprising:
- replicating each field of the document in accordance with a field weight corresponding to the field to produce an individual field set corresponding to each field in the document;
combining each field set for the document into a virtual document;
indexing the virtual document to produce a virtual document statistics; and
computing the field-weighted score from the virtual document index based on the query.
3 Assignments
0 Petitions
Accused Products
Abstract
A field-weighted search combines statistical information for each term across document fields in a suitably weighted fashion. Both field-specific term frequencies and field and document lengths are considered to obtain a field-weighted document weight for each query term. Each field-weighted document weight can then be combined in order to generate a field-weighted document score that is responsive to the overall query.
-
Citations
51 Claims
-
1. A method of determining a field-weighted score for a document having multiple fields relative to a query having a plurality of query terms, the method comprising:
-
replicating each field of the document in accordance with a field weight corresponding to the field to produce an individual field set corresponding to each field in the document;
combining each field set for the document into a virtual document;
indexing the virtual document to produce a virtual document statistics; and
computing the field-weighted score from the virtual document index based on the query. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8)
-
-
9. A computer program product encoding a computer program for executing on a computer system a computer process for determining a field-weighted score for a document having multiple fields relative to a query having a plurality of query terms, the computer process comprising:
-
replicating each field of the document in accordance with a field weight corresponding to the field to produce an individual field set corresponding to each field in the document;
combining each field set for the document into a virtual document;
indexing the virtual document to produce a virtual document statistics; and
computing the field-weighted score from the virtual document index based on the query. - View Dependent Claims (10, 11, 12, 13, 14, 15, 16)
-
-
17. A method of determining a field-weighted score for a document having multiple fields relative to a query having a plurality of query terms, the method comprising:
-
determining a field-specific term frequency for each field in the document for each query term;
weighting each field-specific term frequency according to a field weight designated for the corresponding field to compute a field-weighted term frequency for each query term;
computing a field-weighted document weight for each query term based on the field-weighted term frequency for each query term; and
computing the field-weighted score as a function of the field-weighted document weight of all query terms. - View Dependent Claims (18, 19, 20, 21, 22, 23, 24, 25)
-
-
26. A computer program product encoding a computer program for executing on a computer system a computer process for determining a field-weighted score for a document having multiple fields relative to a query having a plurality of terms, the computer process comprising:
-
determining a field-specific term frequency for each field in the document for each query term;
weighting each field-specific term frequency according to a field weight designated for the corresponding field to compute a field-weighted term frequency for each query term;
computing a field-weighted document weight for each query term based on the field-weighted term frequency for each query term; and
computing the field-weighted score as a function of the field-weighted document weight of all query terms. - View Dependent Claims (27, 28, 29, 30, 31, 32, 33, 34)
-
-
35. A system for determining a field-weighted score for a document having multiple fields relative to a query having a plurality of terms, the system comprising:
-
a field-weighted term frequency calculator that determines a field-specific term frequency for each field in the document for each query term and weights each field-specific term frequency according to a field weight identified for the corresponding field to compute a field-weighted term frequency for each query term;
a field-weighted document weight calculator that computes a field-weighted document weight for each query term based on the field-specific term frequency for each query term; and
a document score calculator that computes the field-weighted score as a function of the field-weighted document weight of all query terms. - View Dependent Claims (36, 37)
-
-
38. A method of determining a field-weighted score for a document having multiple fields relative to a query having a plurality of query terms, the method comprising:
-
computing a field-weighted term frequency for each query term based on field weights designated for individual fields in the document;
computing a field-weighted document weight for each query term based on the field-weighted term frequency for each field in the document; and
computing the field-weight score as a function of the field-weighted document weights of the query terms. - View Dependent Claims (39, 40, 41, 42, 43)
-
-
44. A computer program product encoding a computer program for executing on a computer system a computer process for determining a field-weighted score for a document having multiple fields relative to a query having a plurality of query terms, the computer process comprising:
-
computing a field-weighted term frequency for each query term based on field weights designated for individual fields in the document;
computing a field-weighted document weight for each query term based on the field-weighted term frequency for each field in the document; and
computing the field-weight score as a function of the field-weighted document weights of the query terms. - View Dependent Claims (45, 46, 47, 48, 49)
-
-
50. A system for determining a field-weighted score for a document having multiple fields relative to a query having a plurality of query terms, the system comprising:
-
a field-weighted term frequency calculator that computes a field-weighted term frequency for each query term based on field weights designated for individual fields in the document;
a field-weighted document weight calculator that computes a field-weighted document weight for each query term based on the field-weighted term frequency for each field in the document; and
a search engine that computes the field-weighted score as a function of the field-weighted document weights of the query terms. - View Dependent Claims (51)
-
Specification