SIMILARITY AND RANKING OF DATABASES BASED ON DATABASE METADATA
1 Assignment
0 Petitions
Accused Products
Abstract
A processor selects a first database and a second database from a plurality of databases. The processor determines one or more terms found in the first and second database, wherein each term of the one or more terms includes metadata of a database of the plurality of databases. The processor identifies one or more common terms between the first database and the second database and determines the one or more common terms found in each of a plurality of groups of databases of the plurality of databases, wherein each group of databases corresponds to a number of databases which constitute the group of databases. The processor determines a similarity score between the first database and the second database of the plurality of databases based on the one or more common terms found in each group of databases of the plurality of databases.
18 Citations
31 Claims
-
1-14. -14. (canceled)
-
15. A computer program product for determining a similarity of databases, the computer program product comprising:
-
a computer-readable storage medium having computer readable program code embodied therewith, the computer readable program code comprising; (a) computer readable program code configured to select a first database and a second database from a plurality of databases; (b) computer readable program code configured to determine if one or more terms found in the first database are also found in the second database, wherein each term of the one or more terms includes metadata of a database of the plurality of databases, and wherein the one or more terms found in both databases are one or more common terms; (c) computer readable program code configured to determine a quantity of the one or more common terms found in each of a plurality of groups of databases of the plurality of databases, wherein each group of databases corresponds to a number of databases which constitute the group of databases; and (d) computer readable program code configured to determine a similarity score between the first database and the second database of the plurality of databases based on the quantity of the one or more common terms found in each group of databases of the plurality of databases. - View Dependent Claims (16, 17, 19, 20, 21)
-
-
18. (canceled)
-
22. A computer system for determining a similarity of databases, the computer program product comprising:
-
one or more computer processors; one or more computer readable storage media; and program instructions stored on the computer readable storage media for execution by at least one of the one or more processors, the program instructions comprising; (a) program instructions to select a first database and a second database from a plurality of databases; (b) program instructions to determine if one or more terms found in the first database are also found in the second database, wherein each term of the one or more terms includes metadata of a database of the plurality of databases, and wherein the one or more terms found in both databases are one or more common terms; (c) program instructions to determine a quantity of the one or more common terms found in each of a plurality of groups of databases of the plurality of databases, wherein each group of databases corresponds to a number of databases which constitute the group of databases; and (d) program instructions to determine a similarity score between the first database and the second database of the plurality of databases based on the quantity of the one or more common terms found in each group of databases of the plurality of databases. - View Dependent Claims (23, 24, 25, 26, 27)
-
-
28. A computer program product for determining a similarity of databases to search criteria, the method comprising:
-
(a) computer readable program code configured to receive search criteria, wherein the search criteria includes one or more terms; (b) computer readable program code configured to determine the one or more terms found in both the search criteria and a first database of a plurality of databases, wherein the one or more terms found in both the search criteria and a first database are one or more common terms; (c) computer readable program code configured to determine a quantity of the one or more common terms found in each of a plurality of groups of databases of the plurality of databases, wherein a group of databases of the plurality of groups of databases corresponds to a number of databases which constitutes the group of databases; and (d) computer readable program code configured to determine a similarity score of the first database of the plurality of databases based on the quantity of the one or more common terms found in each group of databases of the plurality of databases, wherein the similarity of the first database to the search criteria is based on the similarity score. - View Dependent Claims (29, 30, 31)
-
Specification