System And Method For Generating An Amalgamated Database
First Claim
1. A method for creating an amalgamated bioinformatics database from at least a first database and a second database comprising the steps of:
- identifying a first field from the records of the first database;
identifying a second field from the records of the second database, the second field having data related to the first field;
identifying a first set of concepts by traversing a mediating database using terms associated with the first field;
identifying a second set of concepts by traversing the mediating database using terms associated with the second field;
wherein at least one of the steps of identifying the first set of concepts or identifying the second set of concepts is performed using non-trivial terminological mapping;
determining a set of related concepts in the first set of concepts and the second set of concepts; and
generating a record in the amalgamated bioinformatics database comprising data from records of the first database, data from records of the second database and at least a portion of the related concepts from the mediating database.
1 Assignment
0 Petitions
Accused Products
Abstract
A method for creating an amalgamated bioinformatics database from at least a first database and a second database is presented. Concepts are identified in a first field from the records of the first database. A second field from the records of the second database which has data related to the first field is also identified. A first set of concepts is identified by traversing a mediating database using terms associated with the first field and a second set of concepts is also identified by traversing the mediating database using terms associated with the second field. Either the first set of concepts or the second set of concepts, or both, is identified using non-trivial terminological mapping. The set of related concepts in the first set of concepts and the second set of concepts is identified and a record is generated in the amalgamated bioinformatics database.
84 Citations
64 Claims
-
1. A method for creating an amalgamated bioinformatics database from at least a first database and a second database comprising the steps of:
-
identifying a first field from the records of the first database; identifying a second field from the records of the second database, the second field having data related to the first field; identifying a first set of concepts by traversing a mediating database using terms associated with the first field; identifying a second set of concepts by traversing the mediating database using terms associated with the second field; wherein at least one of the steps of identifying the first set of concepts or identifying the second set of concepts is performed using non-trivial terminological mapping; determining a set of related concepts in the first set of concepts and the second set of concepts; and generating a record in the amalgamated bioinformatics database comprising data from records of the first database, data from records of the second database and at least a portion of the related concepts from the mediating database. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 57, 58, 59, 60, 61, 62, 63, 64)
-
-
13. A method for creating an amalgamated bioinformatics database from at least a first database and a second database comprising the steps of:
-
identifying a first field from the records of the first database; identifying a second field from the records of the second database, the second field having data related to the first field; identifying a first set of concepts by traversing a mediating database using terms associated with the first field; identifying a second set of concepts by traversing the mediating database using terms associated with the second field; determining a set of related concepts in the first set of concepts and the second set of concepts; for least a portion of the related concepts, inheriting relationships of the related concepts from the mediating database; and generating a record in the amalgamated bioinformatics database comprising data from records of the first database, data from records of the second database and the related concepts and inherited relationships from the mediating database. - View Dependent Claims (14, 15, 16, 17, 18, 19, 20, 21, 22, 23)
-
-
24. A method for creating an amalgamated bioinformatics database from at least a first database and a second database comprising the steps of:
-
identifying a first field from the records of the first database; identifying a second field from the records of the second database, the second field having data related to the first field; identifying a first set of concepts by traversing a mediating database using terms associated with the first field; identifying a second set of concepts by traversing the mediating database using terms associated with the second field; wherein at least one of the steps of identifying the first set of concepts or identifying the second set of concepts is performed using terminological mapping; determining a set of related concepts in the first set of concepts and the second set of concepts; for least a portion of the related concepts, inheriting relationships of the related concepts from the mediating database; and generating a record in the amalgamated bioinformatics database comprising data from the records of the first database, data from the records of the second database and the related concepts and inherited relationships from the mediating database. - View Dependent Claims (25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39)
-
-
40. A method for creating a knowledge base of relationships between at least one biodata item that is a molecule and at least one other biodata item, comprising the steps of:
-
(a) using a first database storing at least one biodata item that is a molecule associated with at least one other biodata item, said other biodata item being contained in a first set; (b) using a second database storing a second set of at least one biodata item and any information associated therewith, wherein the first set and the second set are not identical; (c) using at least one non-trivial terminological mapping operation in connection with a mediating database for identifying an association between a biodata item of the first set with a biodata item of the second set, (d) for each association identified in step (c), finding a relationship between the biodata item that is a molecule associated with the other biodata item of the first set of the association and the information associated with the biodata item of the second set of the association; (e) storing each relationship found in step (d) in a knowledge base.
-
-
41. A method of integrating a first and second database which are interoperable heterogeneous databases without a common key,
wherein the first database contains a bioobject associated with a first record comprising a biodata item that is a molecule and a first correlating biodata item; -
wherein the second database contains a bioobject associated with a second record comprising a second correlating biodata item and a unique biodata item, where there is no equivalent to the unique biodata item in the first database; comprising the steps of; (a) using a mediating database to link the first correlating biodata item in the first database to the second correlating biodata item in the second data base using at least one non-trivial terminological mapping operation; (b) creating relationships between the biodata items in the first record and the second record, thereby producing an amalgamated third record comprising the biodata item which is the molecule and a plurality of biodata items, including the unique biodata item; and (c) storing the amalgamated record in an amalgamated database. - View Dependent Claims (43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56)
-
-
42. A method of integrating a first and second database which are interoperable heterogeneous databases without a common key,
wherein the first database contains a bioobject associated with a first record comprising a biodata item that is a molecule and a first correlating biodata item; -
wherein the second database contains a bioobject associated with a second record comprising a second correlating biodata item and a unique biodata item, where there is no equivalent to the unique biodata item in the first database; comprising the steps of; (a) transforming at least one of the databases into a generic format; (b) using at least one terminological mapping operation to a mediating database to link the first correlating biodata item in the first database to the second correlating biodata item in the second data base; (c) creating relationships between the biodata items in the first record and the second record, thereby producing an amalgamated third record comprising the biodata item which is the molecule and a plurality of biodata items, including the unique biodata item; and (d) storing the amalgamated record in an amalgamated database.
-
Specification