Systems and methods for document searching and organizing
First Claim
1. A method for interactive document searching comprising:
- receiving a search query;
searching for documents using the search query;
retrieving documents located during the searching;
clustering the retrieved documents into categories;
labeling the categories of documents from the clustering;
summarizing the retrieved documents;
displaying the labeled categories and document summaries;
parsing words of the retrieved documents to create word sets before the clustering of retrieved documents into categories; and
filtering a set of predefined words from the word sets.
0 Assignments
0 Petitions
Accused Products
Abstract
Systems and methods interactive document search, retrieval, categorization, and summarization are provided. A document organizer processor may analyze the content of documents, such as web pages and text documents, downloaded from a computer network, such as the Internet or an intranet, in response to a user'"'"'s search query. After receiving a search query from a user, the processor may locate documents related to the query, parse words in the documents into a word set, filter out unnecessary words, group the documents into categories, provide labels for the categories, construct summaries of the documents in each category, determine if any additional words or phases are to be recommended, present the labels and summaries to the user, and enable the user to iteratively refine the search.
234 Citations
27 Claims
-
1. A method for interactive document searching comprising:
-
receiving a search query; searching for documents using the search query; retrieving documents located during the searching; clustering the retrieved documents into categories; labeling the categories of documents from the clustering; summarizing the retrieved documents; displaying the labeled categories and document summaries; parsing words of the retrieved documents to create word sets before the clustering of retrieved documents into categories; and filtering a set of predefined words from the word sets.
-
-
2. A method for interactive document searching comprising:
-
receiving a search query; searching for documents using the search query; retrieving documents located during the searching; clustering the retrieved documents into categories; labeling the categories of documents from the clustering; summarizing the retrieved documents; and displaying the labeled categories and document summaries; parsing words of the retrieved documents to create word sets before the clustering of retrieved documents into categories; and assigning a numerical weight value to the parsed words in each document based on their frequency of appearance in the document. - View Dependent Claims (3, 4, 5, 6, 7, 8, 9, 10)
-
-
11. A system for interactive document searching comprising:
-
means for receiving a search query; means for searching for documents using the search query; means for retrieving documents located by the searching means; means for clustering the documents retrieved by the retrieving means into categories; means for labeling the categories from the clustering means; means for summarizing the documents retrieved by the retrieving means; means for displaying the labeled categories from the labeling means and the summaries from the summarizing means; means for parsing words of the retrieved documents to create word sets before clustering the retrieved documents into categories by the clustering means; and means for filtering a set of predefined words from the word sets. - View Dependent Claims (12, 13, 14, 15, 16, 17)
-
-
18. A system for interactive document searching comprising:
-
means for receiving a search query; means for searching for documents using the search query; means for retrieving documents located by the searching means; means for clustering the documents retrieved by the retrieving means into categories; means for labeling the categories from the clustering means; means for summarizing the documents retrieved by the retrieving means; means for displaying the labeled categories from the labeling means and the summaries from the summarizing means; means for parsing words of the retrieved documents to create word sets before clustering the retrieved documents into categories by the clustering means; and means for assigning a numerical weight value to the parsed words in each document based on their frequency of appearance in the document. - View Dependent Claims (19, 20)
-
-
21. An interactive document search system comprising:
-
an input device for receiving a search query; a communications device for communicating with a computer network; a search module for searching and retrieving documents on the computer network using the communications device based on the search query received from the input device; a categorizer module for clustering the documents retrieved by the search module into categories; a category labeler module for labeling the categories of clustered documents from the categorizer module; a summarizer module for summarizing the documents categorized by the categorizing module; and a display device for displaying the labels from the labeler module and the summaries from the categorizer module; a word parser module that parses words from the documents retrieved by the search module to create word sets; and a filter module that filters a set of predefined words from the word sets formed by the word parser module. - View Dependent Claims (22, 23, 24)
-
-
25. An interactive document search system comprising:
-
an input device for receiving a search query; a communications device for communicating with a computer network; a search module for searching and retrieving documents on the computer network using the communications device based on the search query received from the input device; a categorizer module for clustering the documents retrieved by the search module into categories; a category labeler module for labeling the categories of clustered documents from the categorizer module; a summarizer module for summarizing the documents categorized by the categorizing module; and a display device for displaying the labels from the labeler module and the summaries from the categorizer module; a word parser module that parses words from the documents retrieved by the search module to create word sets, wherein the word parser module assigns a numerical weight value to the parsed words in each document based on their frequency of appearance in the document. - View Dependent Claims (26, 27)
-
Specification