Structured contextual clustering method and system in a federated search engine
First Claim
1. A method for grouping search results received from information sources, comprising:
- receiving a query with user-selected keywords to be evaluated in an information domain;
identifying a plurality of information sources associated with the information domain;
submitting the query to at least two of the plurality of information sources identified;
receiving a set of search results from the at least two information source for the query submitted thereto;
extracting a plurality of attributes from content in each set of search results received using information source wrappers;
specifying a first set of the plurality of attributes, which defines a main clustering strategy, and a second set of the plurality of attributes, which defines a sub-clustering strategy;
each clustering strategy defining a unique set of attributes that may be extracted from the content of search results using the information source wrappers;
generating a main grouping of the sets of search results received from the information sources using the first set of attributes;
the main grouping of search results including a plurality of labeled groups with a plurality of search results in each group; and
generating a sub-grouping of search results for each labeled group of search results in the main grouping of search results using the second set of attributes.
4 Assignments
0 Petitions
Accused Products
Abstract
A federated search engine groups search results from information sources using attributes of the search results. In grouping the search results, a first set and a second set of attributes are extracted from content in each set of search results received using information source wrappers. The first set of attributes defines a main clustering strategy, and the second set of attributes defines a sub-clustering strategy. A main grouping of the sets of search results received from the information sources is generated using the first set of attributes. The main grouping of search results includes a plurality of labeled groups with a plurality of search results in each group. A sub-grouping of search results is generated for each labeled group of search results in the main grouping of search results using the second set of attributes.
-
Citations
20 Claims
-
1. A method for grouping search results received from information sources, comprising:
-
receiving a query with user-selected keywords to be evaluated in an information domain;
identifying a plurality of information sources associated with the information domain;
submitting the query to at least two of the plurality of information sources identified;
receiving a set of search results from the at least two information source for the query submitted thereto;
extracting a plurality of attributes from content in each set of search results received using information source wrappers;
specifying a first set of the plurality of attributes, which defines a main clustering strategy, and a second set of the plurality of attributes, which defines a sub-clustering strategy;
each clustering strategy defining a unique set of attributes that may be extracted from the content of search results using the information source wrappers;
generating a main grouping of the sets of search results received from the information sources using the first set of attributes;
the main grouping of search results including a plurality of labeled groups with a plurality of search results in each group; and
generating a sub-grouping of search results for each labeled group of search results in the main grouping of search results using the second set of attributes. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18)
-
-
19. A method for grouping search results in an information source, comprising:
-
receiving a query with user-selected keywords to be evaluated in an information domain of the information source;
querying the information domain of the information source with the query;
identifying a set of search results for the query submitted to the information domain of the information source;
extracting a plurality of attributes from content in the set of search results received for the submitted query;
specifying a first set of the plurality of attributes, which defines a main clustering strategy, and a second set of the plurality of attributes, which defines a sub-clustering strategy;
each clustering strategy defining a unique set of attributes that may be extracted from the content of search results;
generating a main grouping of the set of search results received from the information sources using the first set of attributes;
the main grouping of search results including a plurality of labeled groups with a plurality of search results in each group; and
generating a sub-grouping of search results for each labeled group of search results in the main grouping of search results using the second set of attributes.
-
-
20. A system for grouping search results received from information sources, comprising:
-
a query translation module for receiving a query with user-selected keywords to be evaluated in an information domain;
the query translation module identifying a plurality of information sources associated with the information domain and submitting the query to at least two of the plurality of information sources identified;
an attribute extraction module for receiving a set of search results from the at least two information source for the query submitted thereto;
the attribute extraction module extracting a plurality of attributes from content in each set of search results received using information source wrappers;
a clustering module for specifying a first set of the plurality of attributes, which defines a main clustering strategy, and a second set of the plurality of attributes, which defines a sub-clustering strategy;
each clustering strategy defining a unique set of attributes that may be extracted from the content of search results using the information source wrappers;
wherein the clustering module generates a main grouping of the sets of search results received from the information sources using the first set of attributes;
the main grouping of search results including a plurality of labeled groups with a plurality of search results in each group; and
wherein the clustering module generates a sub-grouping of search results for each labeled group of search results in the main grouping of search results using the second set of attributes.
-
Specification