INFORMATION-RETRIEVAL SYSTEMS, METHODS, AND SOFTWARE WITH CONCEPT-BASED SEARCHING AND RANKING
First Claim
1. A system comprising:
- a set of target documents; and
means for searching and identifying one or more of the set of target documents as result documents based on a user query, the means for searching and identifying including;
means for identifying one or more first documents based on a set of word co-occurrence probabilities, with the set of word co-occurrence probabilities derived from a set of documents the same as or different than the set of target documents.
4 Assignments
0 Petitions
Accused Products
Abstract
To improve traditional keyword based search engines, the present inventors devised, among other things, systems, methods, and software that use word co-occurrence probabilities not only to identify documents conceptually related to user queries, but also to score and rank search results. One exemplary system combines inverse-document-frequency searching with concept searching based on word co-occurrence probabilities to facilitate finding of documents that would otherwise go unfound using a given query. The exemplary system also allows ranking of search results based both on both keyword matching and concept presence, promoting more efficient organization and review of search results.
-
Citations
11 Claims
-
1. A system comprising:
-
a set of target documents; and means for searching and identifying one or more of the set of target documents as result documents based on a user query, the means for searching and identifying including;
means for identifying one or more first documents based on a set of word co-occurrence probabilities, with the set of word co-occurrence probabilities derived from a set of documents the same as or different than the set of target documents. - View Dependent Claims (2, 3, 4)
-
-
5-8. -8. (canceled)
-
9. A method for using a query having one or more query terms to identify a set of one or more documents within a database, the method comprising:
-
determining for each of one or more documents in the database a score based on; occurrence of one or more of the query terms in the document; and occurrence or one or more non-query terms that are known to co-occur with one or more of the query terms in a set of documents; and displaying one or more of the documents within a search result based on its determined score. - View Dependent Claims (10, 11)
-
Specification