Method and System for Populating a Database With Bibliographic Data From Multiple Sources
First Claim
1. A method of populating a relational database of bibliographic data associated with one or more document-based collections, wherein said bibliographic data is sourced from two or more sources having distinct source-specific formats, comprising the steps of:
- accessing source data from the two or more sources;
independently standardizing said accessed data from each of the two or more sources in accordance with a common intermediate source-independent format dictated by an intermediate data structure, such that similar data elements from distinct source-specific formats are commonly identified within said intermediate format; and
further interpreting said standardized data in relation to stored database elements comprising at least some database elements derived from each of the two or more sources, for populating the database in accordance with said relation with at least some repetitive elements replaced with reference thereto, consistent with a refined database data structure distinct from said intermediate data structure.
1 Assignment
0 Petitions
Accused Products
Abstract
There is disclosed a method of populating a relational database of bibliographic data associated with one or more document-based collections, wherein the bibliographic data is sourced from two or more sources having distinct source-specific formats. The method generally comprises the steps of accessing source data from the two or more sources; independently standardizing the accessed data from each of the two or more sources in accordance with a common intermediate source-independent format dictated by an intermediate data structure, such that similar data elements from distinct source-specific formats are commonly identified within the intermediate format; and further interpreting the standardized data in relation to stored database elements comprising at least some database elements derived from each of the two or more sources, for populating the database in accordance with the relation with at least some repetitive elements replaced with reference thereto, consistent with a refined database data structure distinct from the intermediate data structure. A system and computer-readable medium for implementing the above method are also disclosed.
-
Citations
25 Claims
-
1. A method of populating a relational database of bibliographic data associated with one or more document-based collections, wherein said bibliographic data is sourced from two or more sources having distinct source-specific formats, comprising the steps of:
-
accessing source data from the two or more sources; independently standardizing said accessed data from each of the two or more sources in accordance with a common intermediate source-independent format dictated by an intermediate data structure, such that similar data elements from distinct source-specific formats are commonly identified within said intermediate format; and further interpreting said standardized data in relation to stored database elements comprising at least some database elements derived from each of the two or more sources, for populating the database in accordance with said relation with at least some repetitive elements replaced with reference thereto, consistent with a refined database data structure distinct from said intermediate data structure. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19)
-
-
20. A system for populating a relational database of bibliographic data associated with one or more document-based collections, wherein said bibliographic data is sourced from two or more sources having distinct source-specific formats, the system comprising:
-
one or more data storage devices configured to define an intermediate data structure and a refined database data structure distinct therefrom, and for storing database elements derived from each of the two or more sources in accordance with said refined database data structure; independent standardization modules for independently standardizing data accessed from the two or more sources in accordance with a common intermediate source-independent format dictated by said intermediate data structure, such that similar data elements from distinct source-specific formats are commonly identified within said intermediate format; and an interpreter for further interpreting said standardized data in relation to said stored database elements from each of the two or more sources for populating the database in accordance with said relation with at least some repetitive elements replaced with reference thereto, consistent with said refined database data structure. - View Dependent Claims (21, 22)
-
-
23. A computer-readable medium for populating a relational database of bibliographic data associated with one or more document-based collections accessed from two or more sources in distinct source-specific formats, comprising statements and instructions for implementation by a computing device to implement the steps of:
-
independently standardizing said accessed data from each of the two or more sources in accordance with a common intermediate source-independent format dictated by an intermediate data structure, such that similar data elements from distinct source-specific formats are commonly identified within said intermediate format; and further interpreting said standardized data in relation to stored database elements comprising at least some database elements derived from each of the two or more sources, for populating the database in accordance with said relation with at least some repetitive elements replaced with reference thereto, consistent with a refined database data structure distinct from said intermediate data structure. - View Dependent Claims (24, 25)
-
Specification