COMPOSING TEXT AND STRUCTURED DATABASES
First Claim
1. A system for linking information from at least two data sources, the system comprising:
- a first data source comprising a plurality of documents comprising text pertaining to at least one object;
a second data source comprising a plurality of structured records comprising at least one characteristic of the at least one object, each characteristic comprising one property name and an associated property value corresponding to the property name for the at least one object; and
a processor for determining one or more instance-based traits for each object and for associating at least one record in the second data source with the at least one document from the first data source that refers to each object, each trait comprising one or more characteristics that identifiably distinguish each object from all other objects in the plurality of documents.
2 Assignments
0 Petitions
Accused Products
Abstract
A framework is provided for composing texts about objects with structured information about these objects, and thus disclosed are methodologies for linking information from at least two data sources—one comprising a plurality of documents comprising text pertaining to at least one object, and one comprising a plurality of structured records comprising at least one characteristic of the at least one object, each characteristic comprising one property name and an associated property value corresponding to the property name for the at least one object—by determining one or more instance-based traits for each object in both data sources and associating at least one record with at least one document that refers to each object, each trait comprising one or more characteristics that identifiably distinguish each object from all other objects.
-
Citations
20 Claims
-
1. A system for linking information from at least two data sources, the system comprising:
-
a first data source comprising a plurality of documents comprising text pertaining to at least one object; a second data source comprising a plurality of structured records comprising at least one characteristic of the at least one object, each characteristic comprising one property name and an associated property value corresponding to the property name for the at least one object; and a processor for determining one or more instance-based traits for each object and for associating at least one record in the second data source with the at least one document from the first data source that refers to each object, each trait comprising one or more characteristics that identifiably distinguish each object from all other objects in the plurality of documents. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10)
-
-
11. A method for linking information from at least two data sources, the method comprising:
-
describing a plurality of objects using a set of characteristics wherein each object comprising the plurality of objects is identifiably distinguishable by one or more characteristics from among the plurality of characteristics; and using at least one additional characteristic that does not comprise the set of characteristic to score a relevancy of a document to a record from a first data source that shares at least one of the characteristics of another document from a second data source, wherein the additional characteristic is from the document. - View Dependent Claims (12, 13, 14, 15)
-
-
16. A computer-readable medium comprising computer-readable instructions for linking information from at least two data sources, the computer-readable instructions comprising instructions that cause a processor to:
-
map a plurality of documents in a first data source to a plurality of traits in a second data source; and score the relevancy of a document from among the plurality of documents to at least one record from the second data source. - View Dependent Claims (17, 18, 19, 20)
-
Specification