Relational text index creation and searching
First Claim
Patent Images
1. A method for creating a relational text index the method comprising the steps of:
- accessing a natural language text document,parsing said document to identify grammatical parts sentences in said document,applying caseframes to said parsed sentences to generate caseframe extractions, caseframes being syntactic structures that recognize local area context,performing thematic role assignment on said caseframe extractions to generate thematic role extractions,performing unification for each sentence that generates more than one thematic role extraction to generate a single unified representation of each sentence, andutilizing sentence information to build a relational text index that is usable by a computer system;
wherein said thematic role assignment is performed by translating raw caseframe-extracted elements to specific thematic roles.
5 Assignments
0 Petitions
Accused Products
Abstract
In an environment where it is desire to perform information extraction over a large quantity of textual data, methods, tools and structures are provided for building a relational text index from the textual data and performing searches using the relational text index.
113 Citations
18 Claims
-
1. A method for creating a relational text index the method comprising the steps of:
-
accessing a natural language text document, parsing said document to identify grammatical parts sentences in said document, applying caseframes to said parsed sentences to generate caseframe extractions, caseframes being syntactic structures that recognize local area context, performing thematic role assignment on said caseframe extractions to generate thematic role extractions, performing unification for each sentence that generates more than one thematic role extraction to generate a single unified representation of each sentence, and utilizing sentence information to build a relational text index that is usable by a computer system; wherein said thematic role assignment is performed by translating raw caseframe-extracted elements to specific thematic roles. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12)
-
-
13. A method for creating a relational text index, the method comprising the steps of:
-
accessing a natural language text document, parsing said document to identify grammatical parts sentences in said document, applying caseframes to said parsed sentences to generate caseframe extractions, caseframes being syntactic structures that recognize local area context, performing thematic role assignment on said caseframe extractions to generate thematic role extractions, performing unification for each sentence that generates more than one thematic role extraction to generate a single unified representation of each sentence, and utilizing sentence information to build a relational text index that is usable by a computer system; wherein said thematic assignment uses conceptual thematic roles defined according to a particular caseframe useful in a specific subject area.
-
-
14. A method for creating a relational text index, the method comprising the steps of:
-
accessing a natural language text document, parsing said document to identify grammatical parts sentences in said document, applying caseframes to said parsed sentences to generate caseframe extractions, caseframes being syntactic structures that recognize local area context, performing thematic role assignment on said caseframe extractions to generate thematic role extractions, performing unification for each sentence that generates more than one thematic role extraction to generate a single unified representation of each sentence, and utilizing sentence information to build a relational text index that is usable by a computer system; wherein said step of building a relational text index includes the step of storing information selected from the group consisting of sentence information, semantic hierarchy information, semantic category information, generic thematic role information and specifier thematic role information.
-
-
15. A method for creation a relational text index, the method comprising the steps of:
-
accessing a corpa of natural language text documents, for A plurality of said document, parsing sentences in said documents to generate diagrammed sentences, applying caseframes to said diagrammed sentences to generate caseframe extractions, performing thematic role assignment on said caseframe extractions to generate thematic role extractions, said thematic role assignment being performed by translating raw caseframe-extractions to specific thematic roles, and accessing a relational text index file, and appending thematic role information to said relational text index file; wherein said parsing step produces an output selected from the group consisting of noun phrases, verb phrases, prepositional phrases, adverbial phrases, adjectival phrases, clauses, and combinations of them; wherein at least one of said caseframes is based on both a trigger term and a syntactic term.
-
-
16. A method for creating a relational text index, the method comprising the steps of:
-
accessing a corpa of natural language text documents, for a plurality of said documents, parsing sentences in said documents to generate diagrammed sentences, applying caseframes to said diagrammed sentences to generate caseframe extractions, performing thematic role assignment on said caseframe extractions to generate thematic role extractions, said thematic role assignment being performed by translating raw caseframe-extractions to specific thematic roles, and accessing a relational text index file, and appending thematic role information to said relational text index file; wherein said parsing step produces an output selected from the group consisting of noun phrases, verb phrases, prepositional phrases, adverbial phrases, adjectival phrases, clauses, and combinations of them; further comprising the step of performing unification for each sentence that generates more than one thematic role extraction to generate a single unified representation of each sentence.
-
-
17. A method for creating a relational text index, the method comprising the steps of:
-
accessing a corpa of natural language text documents, for a plurality of said documents, parsing sentences in said documents to generate diagrammed sentences, applying caseframes to said diagrammed sentences to generate caseframe extractions, performing thematic role assignment on said caseframe extractions to generate thematic role extractions, said thematic role assignment being performed by translating raw caseframe-extractions to specific thematic roles, and accessing a relational text index file, and appending thematic role information to said relational text index file; wherein said parsing step produces an output selected from the group consisting of noun phrases, verb phrases, prepositional phrases, adverbial phrases, adjectival phrases, clauses, and combinations of them; wherein said thematic role assignment includes assigning roles selected from the group consisting of actions, actors, objects, experiencers, and specifiers.
-
-
18. A method for creating a relational text index, the method comprising the steps of:
-
accessing a corpa of natural language text documents, for a plurality of said documents, parsing sentences in said documents to generate diagrammed sentences, applying caseframes to said diagrammed sentences to generate caseframe extractions, performing thematic role assignment on said caseframe extractions to generate thematic role extractions, said thematic role assignment being performed by translating raw caseframe-extractions to specific thematic roles, and accessing a relational text index file, and appending thematic role information to said relational text index file; wherein said parsing step produces an output selected from the group consisting of noun phrases, verb phrases, prepositional phrases, adverbial phrases, adjectival phrases, clauses and combinations of them; wherein said step of building a relational text index includes the step of, for each specifier role, record said role'"'"'s raw form, a link to an extracted role specified by specifier, and a full specifier phrase for said specifier role in said relational text index.
-
Specification