Similarity and ranking of databases based on database metadata
First Claim
1. A method for determining similarity of databases, the method comprising:
- (a) one or more processors selecting a first database and a second database from a plurality of databases that includes one or more additional databases;
(b) one or more processors identifying one or more terms found in the first database and found in the second database of the plurality of databases as one or more common terms, wherein each term of the one or more terms is comprised of metadata of a structure of a database of the plurality of databases that defines the objects in the database;
(c) one or more processors determining for a common term of the one or more common terms, a quantity of databases of the plurality of databases in which the common term is found, wherein the quantity of databases in which the common term of the one or more common terms is found constitutes a group, and wherein a range of groups includes each quantity of databases, from a group of two databases to a group of a quantity of databases equal to the plurality of databases; and
(d) one or more processors determining a similarity score between the first database and the second database of the plurality of databases based on a tuple formed from the quantity of the one or more common terms found in each group of the range of groups.
1 Assignment
0 Petitions
Accused Products
Abstract
A processor selects a first database and a second database from a plurality of databases. The processor determines one or more terms found in the first and second database, wherein each term of the one or more terms includes metadata of a database of the plurality of databases. The processor identifies one or more common terms between the first database and the second database and determines the one or more common terms found in each of a plurality of groups of databases of the plurality of databases, wherein each group of databases corresponds to a number of databases which constitute the group of databases. The processor determines a similarity score between the first database and the second database of the plurality of databases based on the one or more common terms found in each group of databases of the plurality of databases.
-
Citations
16 Claims
-
1. A method for determining similarity of databases, the method comprising:
-
(a) one or more processors selecting a first database and a second database from a plurality of databases that includes one or more additional databases; (b) one or more processors identifying one or more terms found in the first database and found in the second database of the plurality of databases as one or more common terms, wherein each term of the one or more terms is comprised of metadata of a structure of a database of the plurality of databases that defines the objects in the database; (c) one or more processors determining for a common term of the one or more common terms, a quantity of databases of the plurality of databases in which the common term is found, wherein the quantity of databases in which the common term of the one or more common terms is found constitutes a group, and wherein a range of groups includes each quantity of databases, from a group of two databases to a group of a quantity of databases equal to the plurality of databases; and (d) one or more processors determining a similarity score between the first database and the second database of the plurality of databases based on a tuple formed from the quantity of the one or more common terms found in each group of the range of groups. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9)
-
-
10. A method for determining a similarity of databases to search criteria, the method comprising:
-
(a) one or more processors receiving search criteria, wherein the search criteria includes one or more terms, wherein the one or more terms are comprised of metadata of a structure of a database of the plurality of databases that defines the objects in the database, and wherein the one or more terms are selected from a list presented to a user; (b) one or more processors determining the one or more terms found in both the search criteria and a first database of a plurality of databases, wherein the one or more terms found in both the search criteria and a first database are one or more common terms; (c) one or more processors determining, a quantity of the one or more common terms found in each of a plurality of groups of databases of the plurality of databases, wherein a group of databases of the plurality of groups of databases includes a quantity of databases in which a common term of the one or more common terms is found, and wherein a range of the plurality of groups of databases extends from two databases to the quantity of the plurality of databases; and (d) one or more processors determining a similarity score of the first database of the plurality of databases to the search criteria, based on a tuple formed from the quantity of the one or more common terms found in each group of the range of groups of databases, wherein the similarity of the first database to the search criteria is based on the similarity score. - View Dependent Claims (11, 12, 13, 14, 15, 16)
-
Specification