Phrase-based personalization of searches in an information retrieval system
First Claim
Patent Images
1. A method of personalizing a search of a document collection to a user, the method comprising:
- storing a user model associated with the user, and comprising a plurality of phrases contained in documents accessed by the user;
receiving a query from the user;
selecting search results comprising a plurality of documents responsive to the query;
identifying phrases that are related to the query and present in the user model;
weighting a plurality of scores of a corresponding plurality of the search results according to the identified phrases;
ranking the plurality of the search results for presentation to the user according to their weighted scores, to provide personalized search results; and
presenting the personalized search results to the user.
3 Assignments
0 Petitions
Accused Products
Abstract
An information retrieval system uses phrases to index, retrieve, organize and describe documents. Phrases are identified that predict the presence of other phrases in documents. Documents are the indexed according to their included phrases. Related phrases and phrase extensions are also identified. Phrases in a query are identified and used to retrieve and rank documents. Phrases are also used to cluster documents in the search results, create document descriptions, and eliminate duplicate documents from the search results, and from the index.
161 Citations
27 Claims
-
1. A method of personalizing a search of a document collection to a user, the method comprising:
-
storing a user model associated with the user, and comprising a plurality of phrases contained in documents accessed by the user; receiving a query from the user; selecting search results comprising a plurality of documents responsive to the query; identifying phrases that are related to the query and present in the user model; weighting a plurality of scores of a corresponding plurality of the search results according to the identified phrases; ranking the plurality of the search results for presentation to the user according to their weighted scores, to provide personalized search results; and presenting the personalized search results to the user. - View Dependent Claims (5, 6, 7, 8, 9, 10, 11, 12)
-
-
2. A method of personalizing a search of a document collection to a user, the method comprising:
-
storing a user model associated with the user, and comprising a plurality of phrases contained in documents accessed by the user; receiving a query from the user; selecting search results comprising a plurality of documents responsive to the query; identifying phrases that are related to the query and present in the user model, comprising; for each phrase of the query, accessing a related phrase bit vector for the query phrase, wherein each bit of the related phrase bit vector indicates the presence or absence of a related phrase of the query phrase; determining from the related phrase bit vector which of the related phrases are present in the user model; and forming a related phrase bit mask corresponding to the related phrases that are present in the user model; weighting a plurality of scores of a corresponding plurality of the search results according to the identified phrases; ranking the plurality of the search results for presentation to the user according to their weighted scores, to provide personalized search results; and presenting the personalized search results to the user. - View Dependent Claims (3)
-
-
4. A method of personalizing a search of a document collection to a user, the method comprising:
-
storing a user model associated with the user, and comprising a plurality of cluster counts, each cluster count associated with a predetermined cluster that includes a plurality of related phrases, and storing a count of a number of instances of related phrases of the cluster appearing in a document accessed by the user; receiving a query from the user; selecting search results comprising a plurality of documents responsive to the query; identifying phrases that are related to the query and associated with cluster counts in the user model; weighting a plurality of scores of a corresponding plurality of the search results according to the cluster counts of the identified phrases; ranking the plurality of the search results for presentation to the user according to their weighted scores, to provide personalized search results; and presenting the personalized search results to the user. - View Dependent Claims (13, 14, 15, 16, 17, 18, 19)
-
-
20. A computer readable storage medium storing a computer program executable by a processor for personalizing a search of a document collection to a user, the operations of the computer program comprising:
-
storing a user model associated with the user, and comprising a plurality of phrases contained in documents accessed by the user; receiving a query from the user; selecting search results comprising a plurality of documents responsive to the query; identifying phrases that are related to the query and present in the user model; weighting a plurality of scores of a corresponding plurality of the search results according to the identified phrases; ranking the plurality of the search results for presentation to the user according to their weighted scores, to provide personalized search results; and presenting the personalized search results to the user. - View Dependent Claims (21)
-
-
22. A computer readable storage medium storing a computer program executable by a processor for personalizing a search of a document collection to a user, the operations of the computer program comprising:
-
storing a user model associated with the user, and comprising a plurality of cluster counts, each cluster count associated with a predetermined cluster that includes a plurality of related phrases, and storing a count of a number of instances of related phrases of the cluster appearing in a document accessed by the user; receiving a query from the user; selecting search results comprising a plurality of documents responsive to the query; identifying phrases that are related to the query and associated with cluster counts in the user model; weighting a plurality of scores of a corresponding plurality of the search results according to the cluster counts of the identified phrases; ranking the plurality of the search results for presentation to the user according to their weighted scores, to provide personalized search results; and presenting the personalized search results to the user. - View Dependent Claims (23)
-
-
24. A computer implemented system for personalizing a search of a document collection to a user, comprising:
-
a user model associated with the user, stored in a storage medium and comprising a plurality of phrases contained in documents accessed by the user; and a query processing system executed by a computer and adapted to; receive a query from the user, select search results comprising a plurality of documents responsive to the query, identify phrases that are related to the query and present in the user model, weight a plurality of scores of a corresponding plurality of the search results according to the identified phrases, rank the plurality of the search results for presentation to the user according to their weighted scores, to provide personalized search results, and present the personalized search results to the user. - View Dependent Claims (25)
-
-
26. A computer implemented system for personalizing a search of a document collection to a user, comprising:
-
a user model associated with the user, stored in storage medium and comprising a plurality of cluster counts, each cluster count associated with a predetermined cluster that includes a plurality of related phrases, and storing a count of a number of instances of related phrases of the cluster appearing in a document accessed by the user; and a query processing system executed by a computer and adapted to; receive a query from the user, select search results comprising a plurality of documents responsive to the query, identify phrases that are related to the query and associated with cluster counts in the user model, weight a plurality of scores of a corresponding plurality of the search results according to the cluster counts of the identified phrases, rank the plurality of the search results for presentation to the user according to their weighted scores, to provide personalized search results, and present the personalized search results to the user. - View Dependent Claims (27)
-
Specification