Relational text index creation and searching
First Claim
Patent Images
1. Computer readable storage media having a relational text index for use in searching a corpus of text documents on a computer system, the computer system including an input device, an output device, dynamic memory and a processor, the relational text index including:
- noun phrases, verb phrases, prepositional phrases, adverbial phrases, adjectival phrases, and clauses from parsed sentences in the corpus of text documents, caseframe extractions applied to said parsed sentences, thematic role extractions corresponding to of caseframe extractions, said thematic role extractions including actors, actions and objects, location information for sentences and documents corresponding to thematic role extractions;
wherein said thematic role extractions include generic thematic roles; and
wherein said generic thematic roles are selected from the group consisting of actors, actions, objects, experiencers and specifier.
1 Assignment
0 Petitions
Accused Products
Abstract
In an environment where it is desire to perform information extraction over a large quantity of textual data, methods, tools and structures are provided for building a relational text index from the textual data and performing searches using the relational text index.
132 Citations
15 Claims
-
1. Computer readable storage media having a relational text index for use in searching a corpus of text documents on a computer system, the computer system including an input device, an output device, dynamic memory and a processor, the relational text index including:
-
noun phrases, verb phrases, prepositional phrases, adverbial phrases, adjectival phrases, and clauses from parsed sentences in the corpus of text documents, caseframe extractions applied to said parsed sentences, thematic role extractions corresponding to of caseframe extractions, said thematic role extractions including actors, actions and objects, location information for sentences and documents corresponding to thematic role extractions;
wherein said thematic role extractions include generic thematic roles; and
wherein said generic thematic roles are selected from the group consisting of actors, actions, objects, experiencers and specifier. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 10, 11, 12)
-
-
9. Computer readable storage media having a relational text index for use in searching a corpus of text documents on a computer system, the computer system including an input device, an output device, dynamic memory and a processor, the relational text index including:
-
noun phrases, verb phrases, prepositional phrases, adverbial phrases, adjectival phrases, and clauses from parsed sentences in the corpus of text documents, caseframe extractions applied to said parsed sentences, thematic role extractions corresponding to of caseframe extractions, said thematic role extractions including actors, actions and objects, location information for sentences, documents corresponding to thematic role extractions, and a search tool for searching a relational text index, that includes a tool for accepting a search query from a user, said search query including actor and action roles, and a tool for querying said relational text index for instances when said actor or action is recorded in its appropriate role, for each query match, said tool being capable of retrieving the noun phrase of the extracted term, and for each query match, said tool being capable of retrieving the document and phrase where the match occurred.
-
-
13. Computer readable storage media having a relational text index for use in searching a corpus of text documents on a computer system, the computer system including an input device, an output device, dynamic memory and a processor, the relational text index including:
-
noun phrases, verb phrases, prepositional phrases, adverbial phrases, adjectival phrases and clauses from parsed sentences of the corpus of text documents, caseframe extractions corresponding to said parsed sentences, and thematic role assignments corresponding to said caseframe extraction, and file information for locating documents corresponding to information located in the relational text index;
wherein at least some of said thematic role assignments are selected from the group consisting of action, action, object, and specifier. - View Dependent Claims (14, 15)
-
Specification