Document retrieval by information unit
First Claim
1. A method of identifying an information unit in response to a query of a search space, the search space comprising a plurality of pages;
- said method comprising;
(a) responsive to a query comprising a plurality of keywords, determining a set of pages within the search space wherein each page in said set of pages contains at least one of said plurality of keywords; and
(b) identifying at least one information unit comprising one or more pages selected from said set of pages such that each page in said information unit contains at least one of said plurality of keywords;
every page in said information unit being linked, directly or indirectly, to every other page in said information unit.
2 Assignments
0 Petitions
Accused Products
Abstract
A method of searching a search space comprising a plurality of pages in response to a query comprising a plurality of keywords includes identifying at least one information unit. Where a query includes exactly two keywords, an information unit comprises one page which contains both keywords in the query or two pages selected from the search space such that the first page in the information unit contains the first keyword in the query and the second page in the information unit contains the second keyword in the query; where an information unit contains two pages, one page is linked, directly or indirectly, to the other page. Relaxed query processing techniques enable the method to identify information units which do not contain every keyword in the query, which have only semantically similar words or synonyms, and which have keywords of differing relative importance. The method is adapted to report the identified information units and to accommodate altered queries provided as a result of a report.
62 Citations
45 Claims
-
1. A method of identifying an information unit in response to a query of a search space, the search space comprising a plurality of pages;
- said method comprising;
(a) responsive to a query comprising a plurality of keywords, determining a set of pages within the search space wherein each page in said set of pages contains at least one of said plurality of keywords; and
(b) identifying at least one information unit comprising one or more pages selected from said set of pages such that each page in said information unit contains at least one of said plurality of keywords;
every page in said information unit being linked, directly or indirectly, to every other page in said information unit. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21)
- said method comprising;
-
22. A method of identifying and reporting an information unit in response to a query of a search space, the search space comprising a plurality of pages;
- said method comprising;
(a) responsive to a query comprising a plurality of keywords, determining a set of pages within the search space wherein each page in said set of pages contains at least one of said plurality of keywords;
(b) identifying at least one information unit comprising one or more pages selected from said set of pages such that each page in said information unit contains at least one of said plurality of keywords;
every page in said information unit being linked, directly or indirectly, to every other page in said information unit;
(c) reporting said at least one information unit to a user or, in the alternative, reporting that an information unit which satisfies the query cannot be found; and
(d) responsive to an altered query provided as a result of said reporting, repeating (a) through (c) for said altered query. - View Dependent Claims (23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40)
- said method comprising;
-
41. A computer-based system for identifying and reporting an information unit in response to a query of a search space, the search space comprising a plurality of pages;
- said system comprising;
means for receiving a query input;
said query input comprising a plurality of keywords; and
means, responsive to said query input, for identifying at least one information unit;
said at least one information unit comprising one or more pages within the search space selected such that each page in said information unit contains at least one of said plurality of keywords;
every page in said information unit being linked, directly or indirectly, to every other page in said information unit. - View Dependent Claims (42, 43, 44, 45)
- said system comprising;
Specification