Process for enhancing queries for information retrieval
First Claim
1. A method for enhancing information search queries for information retrieval by computer, comprising the steps of:
- establishing a plurality of term usage subject areas (TUSAs) wherein each TUSA comprises a predetermined subject area;
identifying a corpus of documents, messages, expositions, or communications exemplifying patterns of term usage specific to each TUSA wherein the corpus for each TUSA includes documents, messages, expositions, or communications disparate from information search queries;
analyzing documents, messages, expositions, or communications within each corpus of each TUSA to extract term co-occurrence and usage patterns;
receiving an information search query wherein the information search query includes one or more search terms;
identifying and assigning a primary TUSA corresponding to the information search query;
locating alternative or additional query terms or query phrases within the primary TUSA based on the term co-occurrence and usage patterns extracted through the analysis of the documents, messages, expositions, or communications within the corpus of the primary TUSA;
presenting the located alternative or additional query terms or phrases within the primary TUSA for use in refining the information search query;
permitting the selection and de-selection of alternative or additional query terms or phrases from among the located alternative or additional query terms or phrases presented by an executive action that does not necessarily require typing individual characters;
providing a mechanism to combine alternative or additional query terms or phrases selected from among the located alternative or additional query terms or phrases presented with the information search query received to create a new, enhanced information search query;
providing a mechanism to submit the new, enhanced information search query to a search engine to generate information search query results.
2 Assignments
0 Petitions
Accused Products
Abstract
Enhancing queries for information retrieval that automatically finds the preferred, first ranked matching term usage subject area (“TUSA”) from a prior query. The process automatically finds alternative TUSAs for the prior query, ranked by degree of match or preference, and provides an option to switch among the alternative TUSAs. It is required that a TUSA for the query be passively accepted or actively selected from a presented list based on the prior query. Using means prepared in advance from data sets of messages collected for each TUSA and general vocabulary the process also ranks and presents to the user alternative and additional query terms and phrases reflecting specificity and relevance to the query and the TUSA. Significantly relevant terms and phrases are presented for query refinement and ranked by relevance permitting the user to select and deselect query terms and effect a new search based on the enhanced query.
145 Citations
26 Claims
-
1. A method for enhancing information search queries for information retrieval by computer, comprising the steps of:
-
establishing a plurality of term usage subject areas (TUSAs) wherein each TUSA comprises a predetermined subject area; identifying a corpus of documents, messages, expositions, or communications exemplifying patterns of term usage specific to each TUSA wherein the corpus for each TUSA includes documents, messages, expositions, or communications disparate from information search queries; analyzing documents, messages, expositions, or communications within each corpus of each TUSA to extract term co-occurrence and usage patterns; receiving an information search query wherein the information search query includes one or more search terms; identifying and assigning a primary TUSA corresponding to the information search query; locating alternative or additional query terms or query phrases within the primary TUSA based on the term co-occurrence and usage patterns extracted through the analysis of the documents, messages, expositions, or communications within the corpus of the primary TUSA; presenting the located alternative or additional query terms or phrases within the primary TUSA for use in refining the information search query; permitting the selection and de-selection of alternative or additional query terms or phrases from among the located alternative or additional query terms or phrases presented by an executive action that does not necessarily require typing individual characters; providing a mechanism to combine alternative or additional query terms or phrases selected from among the located alternative or additional query terms or phrases presented with the information search query received to create a new, enhanced information search query; providing a mechanism to submit the new, enhanced information search query to a search engine to generate information search query results. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9)
-
-
10. A system for enhancing information search queries for information retrieval by computer, comprising
a browser program; -
information pages served to the browser program; a plurality of established term usage subject areas (TUSAs) wherein each TUSA comprises a predetermined subject area; an identified corpus of documents, messages, expositions, or communications exemplifying patterns of term usage specific to each TUSA wherein the corpus for each TUSA includes documents, messages, expositions, or communications disparate from information search queries; means for analyzing documents, messages, expositions, or communications within each corpus of each TUSA to extract term co-occurrence and usage patterns and statistics; means for receiving an information search query relative to the browser program wherein the information search query includes one or more search terms; means for identifying and assigning a primary TUSA corresponding to the information search query; means for locating alternative or additional query terms or query phrases within the primary TUSA based on the term co-occurrence and usage patterns extracted through the analysis of the documents, messages, expositions, or communications within the corpus of the primary TUSA; means for presenting the located alternative or additional query terms or phrases within the primary TUSA to the user via an interface for use in refining the information search query; means for permitting a selection and de-selection of alternative or additional query terms or phrases from among the located alternative or additional query terms or phrases presented by an executive action that does not necessarily require typing individual characters; a mechanism to combine alternative or additional query terms or phrases selected from among the located alternative or additional query terms or phrases presented with the information search query received to create a new, enhanced information search query; and a mechanism to submit the new, enhanced information search query to a search engine to generate information search query results. - View Dependent Claims (11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22)
-
-
23. A method for enhancing information search queries for information retrieval by computer, comprising the steps of:
-
establishing during system development of a plurality of term usage subject areas (TUSAs) wherein each TUSA comprises a predetermined subject area; identifying a corpus of documents, messages, expositions, or communications exemplifying patterns of term usage specific to each TUSA wherein the corpus for each TUSA includes documents, messages, expositions, or communications disparate from information search queries; analyzing documents, messages, expositions, or communications within each corpus of each TUSA to extract term co-occurrence and usage patterns; receiving an information search query wherein the information search query includes one or more search terms; prior preparation during system development of means, for each TUSA, to suggest, in lists of single terms or multiple term phrases, ranked by relevance to the information search query, alternative or additive terms corresponding to terms of the information search query based on the term co-occurrence and usage patterns extracted through the analysis of the documents, messages, expositions, or communications within the corpus of the TUSA; establishing a primary TUSA from the information search query; establishing alternative TUSAs for the information search query, ranking the primary TUSA and the alternative TUSAs by degree of match to the information search query, and providing an option to switch among the primary TUSA and the alternative TUSAs; requiring that a TUSA for the information search query be passively accepted or actively selected from the primary TUSA and the alternative TUSAs to establish a selected TUSA; locating and presenting statistically significant additional terms or query phrases based on the term co-occurrence and usage patterns extracted through the analysis of the documents, messages, expositions, or communications within the corpus of the selected TUSA, specific for the information search query and the selected TUSA, for the purpose of query refinement, wherein the additional terms or query phrases are ranked by specificity and relevance to the information search query and the selected TUSA; permitting a selection and de-selection of alternative or additional query terms by a simplified executive action that does not require typing individual characters; providing a mechanism to combine the alternative or additional query terms selected by the user with the information search query to create a new, enhanced information search query; and providing means to submit the new, enhanced information search query to a search engine to generate and view information search query results. - View Dependent Claims (24, 25, 26)
-
Specification