Topic distillation via subsite retrieval
First Claim
1. A system for providing a search result for a query, the search result being derived from hierarchically organized documents, comprising:
- a search component that identifies documents that are relevant to a query based on a feature of the documents;
a subtree feature calculation component that calculates a subtree feature for an identified document based on a contribution of the feature of the identified document and a contribution of the feature from its descendant documents; and
a relevance component that determines the relevance of the identified documents to the query based on the calculated subtree feature of the identified documents.
2 Assignments
0 Petitions
Accused Products
Abstract
A method and system for generating a search result for a query of hierarchically organized documents based on retrieval of subtrees that are key resources for topic distillation is provided. The retrieval system may identify documents relevant to a query using conventional searching techniques. The retrieval system then calculates a subtree feature for subtrees that have an identified document as their root. After the retrieval system calculates the subtree feature for the subtrees, the retrieval system may generate a subtree relevance score for each subtree based on its subtree feature. The retrieval system may then order the identified documents based on their corresponding subtree relevances.
27 Citations
20 Claims
-
1. A system for providing a search result for a query, the search result being derived from hierarchically organized documents, comprising:
-
a search component that identifies documents that are relevant to a query based on a feature of the documents;
a subtree feature calculation component that calculates a subtree feature for an identified document based on a contribution of the feature of the identified document and a contribution of the feature from its descendant documents; and
a relevance component that determines the relevance of the identified documents to the query based on the calculated subtree feature of the identified documents. - View Dependent Claims (2, 3, 4, 5, 6)
-
-
7. A system for calculating subtree features for subtrees having root documents, comprising:
-
a calculate feature component that calculates a feature for each document within a subtree; and
a calculate subtree feature component that calculates a subtree feature for the subtree wherein a contribution of a descendant document of the root document decreases as an ancestral distance between the descendant document and the root document increases and as a number of sibling documents of the descendant document increases. - View Dependent Claims (8, 9, 10, 11)
-
-
12. A computer-readable medium containing instructions for controlling a computer system to identify web pages for a search result for a query, comprising:
-
calculating a subsite feature for subsites of web pages based on a contribution from a root web page of a subsite and a contribution from descendant web pages of the root web page; and
determining relevance of a subsite to the query based on the calculated subsite feature of the subsite. - View Dependent Claims (13, 14, 15, 16, 17, 18, 19, 20)
-
Specification