Methods and apparatus for query formulation
First Claim
1. A method for query formulation, comprising:
- searching, performed at least in part with a configuration of computing hardware and programmable memory, a corpus, to produce a first collection of records, by testing for appearances of a lexical unit that can be indicative of an object-of-interest;
identifying, performed at least in part with a configuration of computing hardware and programmable memory, a first collection of role values corresponding to the first collection of records;
performing, performed at least in part with a configuration of computing hardware and programmable memory, frequency analysis of sets of one or more lexical units, contained within the first collection of role values, to produce a first candidate list of exclude terms; and
wherein each role value is part of a frame instance, each instance produced by application of a frame extraction rule to a record of the corpus.
13 Assignments
0 Petitions
Accused Products
Abstract
To the standard inverted index database, a new “To” operator is added. The “To” operator treats the standard single-level linear collection of records as being organized into localized clusters. Techniques for hierarchical clusters are presented. During indexing, hierarchical clusters are serialized according to a uniform visitation procedure. Serialization produces bit maps, one for each hierarchical level, that preserve the hierarchical level of each record and its location in the serialization sequence. Also presented are techniques, when searching for an Object-of-Interest, for greatly improving the process by which Exclude Terms are identified. Exclude Terms are particularly useful when the lexical units, representing an Object-of-Interest, are ambiguous. When in the mode of searching for Exclude Terms, the Object-of-Interest can match anywhere in a snippet, rather than just in the focus sentence. Using the “To” operator, the focus sentences thus found are converted into role values, from which are identified candidate Exclude Terms.
-
Citations
22 Claims
-
1. A method for query formulation, comprising:
-
searching, performed at least in part with a configuration of computing hardware and programmable memory, a corpus, to produce a first collection of records, by testing for appearances of a lexical unit that can be indicative of an object-of-interest; identifying, performed at least in part with a configuration of computing hardware and programmable memory, a first collection of role values corresponding to the first collection of records; performing, performed at least in part with a configuration of computing hardware and programmable memory, frequency analysis of sets of one or more lexical units, contained within the first collection of role values, to produce a first candidate list of exclude terms; and wherein each role value is part of a frame instance, each instance produced by application of a frame extraction rule to a record of the corpus. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11)
-
-
12. A system for query formulation, comprising:
-
a sub-system configured, as a result of the computing hardware and programmable memory, to accomplish searching a corpus, to produce a first collection of records, by testing for appearances of a lexical unit that can be indicative of an object-of-interest; a sub-system configured, as a result of the computing hardware and programmable memory, to accomplish identifying a first collection of role values corresponding to the first collection of records; a sub-system configured, as a result of the computing hardware and programmable memory, to accomplish performing frequency analysis of sets of one or more lexical units, contained within the first collection of role values, to produce a first candidate list of exclude terms; and wherein each role value is part of a frame instance, each instance produced by application of a frame extraction rule to a record of the corpus. - View Dependent Claims (13, 14, 15, 16, 17, 18, 19, 20, 21, 22)
-
Specification