Information retrieval using dynamic guided navigation
First Claim
Patent Images
1. A method comprising:
- receiving, by a computing device, a search category and a current query from a current user;
determining, by the computing device, relevance of documents comprising a corpus of documents relating to the search category;
obtaining, by the computing device, degree of interest data by users in one or more documents of the corpus of documents, the degree of interest data determined by a degree of interest data group including one or more of measuring how fast a user reads the one or more documents based on page scroll speed and average reading time based on length of the one or more documents, whether the user chose to cut and paste a portion of the document for further reading, whether the user bookmarked the one or more documents, or combinations thereof;
determining, by the computing device, documents comprising a search result and an order of the documents comprising the search result based on the determined relevance and obtained degree of interest data;
obtaining, by the computing device, direct search query terms from the users;
identifying, by the computing device, document query terms from content of the corpus of documents;
determining, by the computing device, query to query correlations from a query correlation data structure, where the determining of the query to query correlations comprises tracking query terms provided by the users in past search sessions which the users selected and where the query correlation data structure comprises, for each pair of the query terms, a measure selected from a group of measures consisting of a probability of co-occurrence in past user sessions and a degree of co-occurrence in a document of the corpus of documents;
determining, by the computing device, at least one query suggestion, the at least one query suggestion obtained from the direct search query terms, document query terms, and the query to query correlations;
tracking, by the computing device, (a) the search category and the current query specified by the current user, (b) interaction with each document clicked through from the documents comprising the search result, and (c) a query selected from the at least one query suggestion, to develop session history for both the users in the past search sessions and the current user; and
generating, by the computing device, a search results web page from the interaction with the each document and the documents comprising the search result, ordered in the determined order, the search result web page comprising the at least one query suggestion.
9 Assignments
0 Petitions
Accused Products
Abstract
An apparatus and method for providing relevant search result and query terms are disclosed herein. Natural language processing of the documents and previous search session history are used to dynamically determine document relevance, queries relevant to search categories prior to start of a search session, and query to query correlations.
-
Citations
20 Claims
-
1. A method comprising:
-
receiving, by a computing device, a search category and a current query from a current user; determining, by the computing device, relevance of documents comprising a corpus of documents relating to the search category; obtaining, by the computing device, degree of interest data by users in one or more documents of the corpus of documents, the degree of interest data determined by a degree of interest data group including one or more of measuring how fast a user reads the one or more documents based on page scroll speed and average reading time based on length of the one or more documents, whether the user chose to cut and paste a portion of the document for further reading, whether the user bookmarked the one or more documents, or combinations thereof; determining, by the computing device, documents comprising a search result and an order of the documents comprising the search result based on the determined relevance and obtained degree of interest data; obtaining, by the computing device, direct search query terms from the users; identifying, by the computing device, document query terms from content of the corpus of documents; determining, by the computing device, query to query correlations from a query correlation data structure, where the determining of the query to query correlations comprises tracking query terms provided by the users in past search sessions which the users selected and where the query correlation data structure comprises, for each pair of the query terms, a measure selected from a group of measures consisting of a probability of co-occurrence in past user sessions and a degree of co-occurrence in a document of the corpus of documents; determining, by the computing device, at least one query suggestion, the at least one query suggestion obtained from the direct search query terms, document query terms, and the query to query correlations; tracking, by the computing device, (a) the search category and the current query specified by the current user, (b) interaction with each document clicked through from the documents comprising the search result, and (c) a query selected from the at least one query suggestion, to develop session history for both the users in the past search sessions and the current user; and generating, by the computing device, a search results web page from the interaction with the each document and the documents comprising the search result, ordered in the determined order, the search result web page comprising the at least one query suggestion. - View Dependent Claims (2, 3, 4, 5)
-
-
6. A system comprising:
-
a processor; a storage medium for tangibly storing thereon program logic for execution by the processor, the stored program logic comprising; query information logic executed by the processor for storing occurrence and frequency data relating to each of a plurality of query terms from search queries specified by a plurality of users and natural language processing of content of a corpus of documents; query correlation logic executed by the processor for storing correlation data for each pair of the plurality of query terms from query term content in the content of the corpus of documents and degree of query term pairings specified by the plurality of users, where the query correlation logic stores, for each pair of the plurality of query terms, a measure selected from a group of measures consisting of a probability of co-occurrence in past user sessions and a degree of co-occurrence in a document of the corpus of documents; interest obtaining logic executed by the processor for obtaining degree of interest data by the plurality of users in one or more documents of the corpus of documents, the degree of interest data determined by a degree of interest data group including one or more of measuring how fast a user reads the one or more documents based on page scroll speed and average reading time based on length of the one or more documents, whether the user chose to cut and paste a portion of the document for further reading, whether the user bookmarked the one or more documents, or combinations thereof; determining logic executed by the processor for determining at least one query suggestion to provide with a search result, the at least one query suggestion obtained from the search queries specified by the plurality of users, the search queries from the natural language processing of the content, and the correlation data; tracking logic executed by the processor for tracking (a) the search category and the current query specified by the current user, (b) interaction with each document clicked through from the documents comprising the search result, and (c) a query selected from the at least one query suggestion, to develop session history for both the users in the past search sessions and the current user; and search results web page generating logic executed by the processor for generating a search results web page from the interaction with the each document the obtained degree of interest data, and the documents comprising the search result, ordered in the determined order, the search result web page comprising the at least one query suggestion. - View Dependent Claims (7, 8, 9)
-
-
10. A system comprising:
-
a processor; a storage medium for tangibly storing thereon program logic for execution by the processor, the stored program logic comprising; document identifying logic executed by the processor for identifying documents relating to a search category; document analyzing logic executed by the processor for analyzing the documents to determine internal statistical content; interestingness tracking logic executed by the processor for tracking users'"'"' interestingness data for the documents, where the tracking of the users'"'"' interestingness data comprises tracking user interaction with each document clicked through from a search result page, the tracking of the user interaction with each document comprising determining a user interaction data group including one or more of measuring how fast a user reads the each document based on page scroll speed and average reading time based on length of the each document, whether the user chose to cut and paste a portion of the document for further reading, whether the user bookmarked the each document, or combinations thereof; first query term identifying logic executed by the processor for identifying first query terms from the documents; second query term tracking logic executed by the processor for tracking second query terms specified by the users; interest tracking logic executed by the processor for tracking users'"'"' indication of interest in pairs of query terms; query suggestion tracking logic executed by the processor for tracking a query suggestion selected from a plurality of query suggestions and obtained from the first query terms, the second query terms, and the users'"'"' indication of interest in pairs of query terms; search results web page generating logic executed by the processor for generating the search results page from the interaction with the each document and documents comprising a search result in the identified documents, ordered in a determined order, wherein the search results page comprises the plurality of query suggestions. - View Dependent Claims (11, 12, 13, 14, 15, 16, 17, 18, 19, 20)
-
Specification