Information retrieval system and method
First Claim
1. An information retrieval system comprising:
- a structured data store;
a signature generator configured to receive data from the structured data store, to create a category signature based on the data received from the structured data store, to receive search results from at least one crawler, and to generate a document signature based on the results from the at least one crawler;
a data store populated with a set of category signatures;
a search utility configured to receive a seed, to provide the seed to a plurality of search engines, each search engine configured to generate a search result set, to parse each search result set, and to return a relevant data set;
a crawler configured to receive the relevant data set and to generate a second set of search results with a relevancy to a category, where the second set of results is larger than the first set of results;
a signature comparator configured to receive at least one document signature and at least one category signature, compare the at least one document signature and the at least one category signature, and generate flagged records; and
an indexed data store populated with flagged records from the signature comparator.
1 Assignment
0 Petitions
Accused Products
Abstract
An information retrieval system having a structured data store; and a signature generator configured to receive data from the structured data store, to create a category signature based on the data received from the structured data store, to receive search results from at least one crawler, and to generate a document signature based on the results from the at least one crawler. The system may also include a data store populated with a set of category signatures; and a search utility configured to receive a seed and to provide the seed to a plurality of search engines. Each search engine may be configured to generate a search result set, to parse each search result set, and to return a relevant data set. The crawler is configured to receive the relevant data set and to generate a second set of search results with a relevancy to a category. A signature comparator receives at least one document signature and at least one category signature and compares the two. The signature comparator generates flagged records based on the comparison and an indexed data store is populated with flagged records.
304 Citations
73 Claims
-
1. An information retrieval system comprising:
-
a structured data store;
a signature generator configured to receive data from the structured data store, to create a category signature based on the data received from the structured data store, to receive search results from at least one crawler, and to generate a document signature based on the results from the at least one crawler;
a data store populated with a set of category signatures;
a search utility configured to receive a seed, to provide the seed to a plurality of search engines, each search engine configured to generate a search result set, to parse each search result set, and to return a relevant data set;
a crawler configured to receive the relevant data set and to generate a second set of search results with a relevancy to a category, where the second set of results is larger than the first set of results;
a signature comparator configured to receive at least one document signature and at least one category signature, compare the at least one document signature and the at least one category signature, and generate flagged records; and
an indexed data store populated with flagged records from the signature comparator. - View Dependent Claims (2, 3, 4)
-
-
5. An information retrieval system comprising:
-
a structured data store;
a signature generator configured to receive groups of related data from the structured data store, to create a category signature based on the data received from the structured data store, to receive a document, and to generate a document signature based on the document;
a data store populated with a set of category signatures;
a signature comparator configured to receive at least one document signature and at least one category signature, compare the at least one document signature and the at least one category signature, and generate flagged records; and
an indexed data store populated with flagged records from the signature comparator. - View Dependent Claims (6, 7, 8)
-
-
9. A method of creating a structured data store from an unstructured data store, the method comprising:
-
generating search results from a search of the unstructured data store;
providing the search results to a signature generator to create a document signature;
generating a category signature based on information from a structured data store;
providing the document signature and the category signature to a signature comparator to generate a flagged record; and
populating a data store with the flagged record. - View Dependent Claims (10, 11, 12)
-
-
13. A method of creating a structured data store from an unstructured data store, the method comprising:
-
generating search results from a search of an unstructured data store;
providing the search results to a signature generator to create a document signature;
generating a category signature from a structured data store;
providing the document signature and the category signature to a signature comparator to generate a relevancy index;
determining whether the relevancy index exceeds a threshold;
generating flagged records if the relevancy index exceeds the threshold; and
populating a first data store with flagged records. - View Dependent Claims (14, 15, 16, 23)
-
-
17. A method of creating a structured data store from a group of documents, the method comprising:
-
providing documents to a signature generator to create a document signature;
generating a category signature from one or more related documents;
providing the document signature and the category signature to a signature comparator to generate a flagged record; and
populating a data store with the flagged record.
-
-
18. An apparatus for creating a data store of related documents, the apparatus comprising:
-
a set of documents segmented into related groups;
a signature generator to create a unique signature for each document group;
a data store populated with signatures for each group of documents;
a signature created by the signature generator for a document;
a signature comparator to flag related documents; and
a data store to hold related, flagged documents.
-
-
19. A system for creating a data store of related documents comprising:
-
a plurality of documents segmented into groups of related documents;
a device to compare the magnitude of the relationship between a document and each group of related documents and to flag documents where the relationship exceeds a threshold; and
a data store to hold the flagged documents.
-
-
20. A method to identify relevancy of documents, the method comprising:
-
generating a signature defining a first set of documents;
generating a second signature defining a second set of documents;
comparing the two signatures;
generating a relevancy index; and
determining the relevancy of the two sets of documents based on a threshold.
-
-
21. A system to remove irrelevant records from a query, the system comprising:
-
a structured data store including groups of related documents;
a signature generator configured to receive groups of related documents and generate a group signature;
a data store of group signatures;
a signature generator configured to receive documents and provide a signature identifying each document;
a signature comparator to compare the signature of a document to the group signatures in the data store of group signatures, flag documents with a high degree of relevancy to one or more groups, and provide the documents to an indexed data store;
a query module to query one or more groups; and
a search engine configured to search the indexed data store and return documents relevant to the chosen group.
-
-
22. A method to search a data store, the method comprising:
-
generating a list of terms descriptive of a category;
generating a set of search results from a plurality of search engines;
parsing the search result sets; and
crawling a data store based on the parsed search result set.
-
-
24. A system for crawling a data store, the system comprising:
-
a set of terms descriptive of a category;
a plurality of search engines configured to receive the set of terms and generate a first search result;
a parser to filter the first search results; and
a crawler configured to receive the parsed results and to generate a second set of results, where the second set of results is larger than the first set of results. - View Dependent Claims (25)
-
-
26. An information retrieval system comprising:
-
an indexed data store containing data from a plurality of structured and unstructured data stores;
a query builder configured to choose at least one of the plurality of structured and unstructured data stores to include in a query, select fields related to the at least one data store chosen, and accept criteria from a user interface for the selected fields; and
a search utility to search the indexed data store and return results matching the query built. - View Dependent Claims (27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41)
-
-
42. An information retrieval system comprising:
-
an indexed data store containing data from a plurality of structured and unstructured data stores;
a query builder configured to choose at least one of the plurality of structured and unstructured data stores to include in a query, select fields related to the at least one data store chosen, and accept criteria from a user interface for the selected fields; and
a search utility to search the indexed data store and return results matching the query built;
the search utility configured to allow a user to select zero or more results for entry in a data store and to perform multiple searches. - View Dependent Claims (43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57)
-
-
58. An information retrieval system comprising:
-
an indexed data store containing data from a plurality of structured and unstructured data stores;
a query builder configured to choose at least one of the plurality of structured and unstructured data stores to include in a query, select fields related to the at least one data store chosen, and accept criteria from a user interface for the selected fields, and receive query input from a plurality of users; and
a search utility to search the indexed data store and return results matching the query built; and
- View Dependent Claims (59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73)
-
Specification