Relational text index creation and searching
First Claim
Patent Images
1. A system for performing a search using a relational text index, the system comprising:
- a computer system having an input device, an output device, storage media, dynamic memory and a processor, documents located on said storage media, a parser capable of parsing natural language sentences contained within said documents to identify grammatical parts of the natural language sentences, a tool for applying caseframes to said parsed sentences to generate caseframe extractions, caseframes being syntactic structures that recognize local area context, a tool for performing thematic role assignment on said caseframe extractions to generate thematic role extractions, a tool for utilizing sentence information to build a relational text index that is usable by said computer system, said relational text index being located on said storage media and a search tool performing a search of said documents using said relational text index;
wherein said search tool can search using at least one thematic role as a search determiner; and
wherein said search tool can search using a thematic role specifier as a search determiner.
1 Assignment
0 Petitions
Accused Products
Abstract
In an environment where it is desire to perform information extraction over a large quantity of textual data, methods, tools and structures are provided for building a relational text index from the textual data and performing searches using the relational text index.
115 Citations
16 Claims
-
1. A system for performing a search using a relational text index, the system comprising:
-
a computer system having an input device, an output device, storage media, dynamic memory and a processor, documents located on said storage media, a parser capable of parsing natural language sentences contained within said documents to identify grammatical parts of the natural language sentences, a tool for applying caseframes to said parsed sentences to generate caseframe extractions, caseframes being syntactic structures that recognize local area context, a tool for performing thematic role assignment on said caseframe extractions to generate thematic role extractions, a tool for utilizing sentence information to build a relational text index that is usable by said computer system, said relational text index being located on said storage media and a search tool performing a search of said documents using said relational text index;
wherein said search tool can search using at least one thematic role as a search determiner; and
wherein said search tool can search using a thematic role specifier as a search determiner. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 11, 12, 13)
a facility for accepting a search query from a user, and a tool for collapsing on the root form of search terms.
-
-
13. A system as recited in claim 1, wherein said search tool is capable of comparing verb roles to pre-defined meta types.
-
10. A system for performing a search using a relational text index, the system comprising:
-
a computer system having an input device, an output device, storage media, dynamic memory and a processor, documents located on said storage media, a parser capable of parsing natural language sentences contained within said documents to identify grammatical parts of the natural language sentences, a tool for applying caseframes to said parsed sentences to generate caseframe extractions, caseframes being syntactic structures that recognize local area context, a tool for performing thematic role assignment on said caseframe extractions to generate thematic role extractions, a tool for utilizing sentence information to build a relational text index that is usable by said computer system, said relational text index being located on said storage media and a search tool performing a search of said documents using said relational text index;
a tool for accepting a search query from a user, said search query including actor and action roles, a tool for querying said relational text index for instances when said actor or action is recorded in its appropriate role, for each query match, said tool being capable of retrieving the noun phrase of the extracted term, and for each query match, said tool being capable of retrieving the document and phrase where the match occurred, and a facility for displaying search results to the user.
-
-
14. A system for creating a relational text index search system, the system comprising:
-
a computer system that includes an input device, an output device, storage media and a processor, a corpa of natural language text documents located on said storage media, a parser capable of parsing natural language sentences in said documents to generate diagrammed sentences, said parser being capable of producing an output selected from the group consisting of noun phrases, verb phrases, prepositional phrases, adverbial phrases, adjectival phrases, clauses, and combinations of them, a tool for applying caseframes to said diagrammed sentences to generate caseframe extractions, a tool for performing thematic role assignment on said caseframe extractions to generate thematic role extractions, said thematic role assignment tool being capable of translating raw caseframe-extractions to specific thematic roles, relational text index file containing thematic role, and a search tool for searching of said corpa of documents using said relational text index;
a facility for accepting a search query from a user and permitting a user to specify at least one search role selected from the group consisting of actor, action, object and specifier; and
a tool for comparing a user'"'"'s search roles to thematic roles in said relational text index. - View Dependent Claims (15, 16)
-
Specification