Customizing information by combining pair of annotations from at least two different documents
First Claim
1. A method for obtaining information embedded in unstructured text, comprising:
- generating computer-readable annotations based on the unstructured text, at least one of the computer-readable annotations comprising an indication of a linguistic feature;
generating at least one computer-readable relation between at least one pair of the computer-readable annotations;
wherein the unstructured text is from two or more different electronic documents, and the relation relates a first annotation from a first one of the electronic documents to a second annotation from a different one of the electronic documents;
storing characteristic data structures in a database, the characteristic data structures comprising the at least one pair of the computer-readable annotations and the at least one computer-readable relation;
receiving a query comprising at least one criterion;
returning results from the database, wherein the results comprise the at least one pair of the annotations and the at least one computer-readable relation;
generating an information result based on the results that are returned from the database and that comprise the at least one pair of the annotations and the at least one computer-readable relation;
wherein the relation relates the first annotation to the second annotation based on one or more similarities between contents of the first annotation and the second annotation, and wherein the relation represents a semantic relationship between the contents;
wherein the information result comprises a grammatical unit not present in its entirety in any of the at least one pair of the annotations and the at least one computer-readable relation returned from the database;
wherein generating the information result in response to the query includes generating the grammatical unit by applying a transformation to the at least one pair of the annotations and the at least one computer-readable relation returned from the database by combining the at least one pair of the annotations from at least two different sentences from the unstructured text according to the at least one relation to generate the grammatical unit.
1 Assignment
0 Petitions
Accused Products
Abstract
A system and method for obtaining information embedded in unstructured text is provided. The method comprising generating computer-readable annotations based on the unstructured text, at least one of the computer-readable annotations comprising an indication of a linguistic feature. A pair of the computer-readable annotations may be used to generate at least one computer-readable relation between the pair. The annotations and/or relations may be stored as characteristic data structures in a database. A query comprising at least one criterion may be received. In response to the query, an information result may be generated based on at least one of the characteristic data structures stored in the database.
177 Citations
55 Claims
-
1. A method for obtaining information embedded in unstructured text, comprising:
-
generating computer-readable annotations based on the unstructured text, at least one of the computer-readable annotations comprising an indication of a linguistic feature; generating at least one computer-readable relation between at least one pair of the computer-readable annotations; wherein the unstructured text is from two or more different electronic documents, and the relation relates a first annotation from a first one of the electronic documents to a second annotation from a different one of the electronic documents; storing characteristic data structures in a database, the characteristic data structures comprising the at least one pair of the computer-readable annotations and the at least one computer-readable relation; receiving a query comprising at least one criterion; returning results from the database, wherein the results comprise the at least one pair of the annotations and the at least one computer-readable relation; generating an information result based on the results that are returned from the database and that comprise the at least one pair of the annotations and the at least one computer-readable relation; wherein the relation relates the first annotation to the second annotation based on one or more similarities between contents of the first annotation and the second annotation, and wherein the relation represents a semantic relationship between the contents; wherein the information result comprises a grammatical unit not present in its entirety in any of the at least one pair of the annotations and the at least one computer-readable relation returned from the database; wherein generating the information result in response to the query includes generating the grammatical unit by applying a transformation to the at least one pair of the annotations and the at least one computer-readable relation returned from the database by combining the at least one pair of the annotations from at least two different sentences from the unstructured text according to the at least one relation to generate the grammatical unit. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25)
-
-
26. A non-transitory computer readable medium having stored thereon a program, the program executable by a processor for performing a method for obtaining information embedded in unstructured text, the method comprising:
-
generating computer-readable annotations based on the unstructured text, at least one of the computer-readable annotations comprising an indication of a linguistic feature; generating at least one computer-readable relation between at least one pair of the computer-readable annotations; wherein the unstructured text is from two or more different electronic documents, and the relation relates a first annotation from a first one of the electronic documents to a second annotation from a different one of the electronic documents; storing characteristic data structures in a database, the characteristic data structures comprising the at least one pair of the computer-readable annotations and the at least one computer-readable relation; receiving a query comprising at least one criterion; returning results from the database, wherein the results comprise the at least one pair of the annotations and the at least one computer-readable relation; generating an information result based on the results that are returned from the database and that comprise the at least one pair of the annotations and the at least one computer-readable relation; wherein the relation relates the first annotation to the second annotation based on one or more similarities between contents of the first annotation and the second annotation, and wherein the relation represents a semantic relationship between the contents; wherein the information result comprises a grammatical unit not present in its entirety in any of the at least one pair of the annotations and the at least one computer-readable relation returned from the database; wherein generating the information result in response to the query includes generating the grammatical unit by applying a transformation to the at least one pair of the annotations and the at least one computer-readable relation returned from the database by combining the at least one pair of the annotations from at least two different sentences from the unstructured text according to the at least one relation to generate the grammatical unit.
-
-
27. A method for obtaining information embedded in unstructured text, comprising:
-
prior to receiving a query; generating computer-readable annotations based on the unstructured text, at least one of the computer-readable annotations comprising an indication of a linguistic feature; generating at least one computer-readable relation between at least one pair of the computer-readable annotations; wherein a relation, from the at least one computer-readable relation, relates a first annotation from a first electronic document to a second annotation from a second electronic document, which is different than the first electronic document; storing characteristic data structures in a database, the characteristic data structures comprising the at least one pair of the computer-readable annotations and the at least one computer-readable relation; receiving the query comprising at least one criterion; and in response to receiving the query; generating an information result in response to the query based on at least one of the characteristic data structures stored in the database by transforming, into a different format, each annotation associated with results from the database based on the query, and scoring and ranking each transformed annotation; wherein transforming each annotation comprises performing;
transforming annotations comprising pronouns into preferred names based on entities that the pronouns refer to, transforming annotations into synonyms for concepts represented in the annotations, and combining multiple annotations, according to the at least one relation, to generate a single natural language sentence;wherein the at least one relation relates a first annotation from the multiple annotations to a second annotation from the multiple annotations based on one or more similarities between contents of the first annotation and the second annotation, and wherein the at least one relation represents a semantic relationship between the contents. - View Dependent Claims (28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40)
-
-
41. A non-transitory computer readable medium having stored thereon a program, the program executable by a processor for performing a method for obtaining information embedded in unstructured text, the method comprising:
-
prior to receiving a query; generating computer-readable annotations based on the unstructured text, at least one of the computer-readable annotations comprising an indication of a linguistic feature; generating at least one computer-readable relation between at least one pair of the computer-readable annotations; wherein a relation, from the at least one computer-readable relation, relates a first annotation from a first electronic document to a second annotation from a second electronic document, which is different than the first electronic document; storing characteristic data structures in a database, the characteristic data structures comprising the at least one pair of the computer-readable annotations and the at least one computer-readable relation; receiving the query comprising at least one criterion; and in response to receiving the query; generating an information result in response to the query based on at least one of the characteristic data structures stored in the database by transforming, into a different format, each annotation associated with results from the database based on the query, and scoring and ranking each transformed annotation; wherein transforming each annotation comprises performing;
transforming annotations comprising pronouns into preferred names based on entities that the pronouns refer to, transforming annotations into synonyms for concepts represented in the annotations, and combining multiple annotations, according to the at least one relation, to generate a single natural language sentence;wherein the at least one relation relates a first annotation from the multiple annotations to a second annotation from the multiple annotations based on one or more similarities between contents of the first annotation and the second annotation, and wherein the at least one relation represents a semantic relationship between the contents. - View Dependent Claims (42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55)
-
Specification