Method and system for identifying data and users of interest from patterns of user interaction with existing data
First Claim
Patent Images
1. A computer-based recommendation system comprising:
- a user interface connected to at least one processor; and
at least one set of memory for storing data element information, user information, and computer readable instructions that, when executed by the processor, cause the processor to perform the steps of;
a. receiving a user input query from the user interface;
b. calculating a distance between each data element and user based on;
i. similarity of structural data associated with each data element,ii. similarity of annotation information associated with each data element, andiii. usage information associated with data elements,wherein at least some of the structural data is selected from a list comprising;
type of data, domain relationships between data items, experimental subject, subject species, parameters of acquisition, sequence or timing,wherein at least some of the annotation information is selected from a list comprising;
keyword tags, key-value metadata, region-of-interest annotations in time and/or space, or user-inputted information, andwherein at least some of the usage information is selected from a list comprising;
order of interaction, views, incorporation in analysis, sharing with others, publishing, recommending to the user, or user interaction with the data elements;
c. inputting the calculated distances between each data element into a recommendation algorithm to identify data elements or users based on the query input, wherein the recommendation algorithm is based in part on the magnitude of the calculated distances;
andd. outputting a list of recommended data elements or users based on the query input.
0 Assignments
0 Petitions
Accused Products
Abstract
A system and method for identifying relevant data and experts from a large data set that are relevant to a researcher, scientific project, or other analysis project using a recommendation algorithm utilizing distance metrics based on structural, annotation, and usage information associated with data elements.
47 Citations
20 Claims
-
1. A computer-based recommendation system comprising:
-
a user interface connected to at least one processor; and at least one set of memory for storing data element information, user information, and computer readable instructions that, when executed by the processor, cause the processor to perform the steps of; a. receiving a user input query from the user interface; b. calculating a distance between each data element and user based on; i. similarity of structural data associated with each data element, ii. similarity of annotation information associated with each data element, and iii. usage information associated with data elements, wherein at least some of the structural data is selected from a list comprising;
type of data, domain relationships between data items, experimental subject, subject species, parameters of acquisition, sequence or timing,wherein at least some of the annotation information is selected from a list comprising;
keyword tags, key-value metadata, region-of-interest annotations in time and/or space, or user-inputted information, andwherein at least some of the usage information is selected from a list comprising;
order of interaction, views, incorporation in analysis, sharing with others, publishing, recommending to the user, or user interaction with the data elements;c. inputting the calculated distances between each data element into a recommendation algorithm to identify data elements or users based on the query input, wherein the recommendation algorithm is based in part on the magnitude of the calculated distances; and d. outputting a list of recommended data elements or users based on the query input. - View Dependent Claims (2, 3, 4, 5, 6)
-
-
7. A computer implemented recommendation method comprising the steps:
-
creating a database comprising multiple users and data elements represented by nodes, each node in the database having a structural, annotation, and usage field associated therewith; determining distances between each node of the database based on the structural, annotation, and usage field for each node, wherein at least one information set for the structural field is selected from a list comprising;
type of data, domain relationships between data items, experimental subject, subject species, parameters of acquisition, sequence or timing,wherein at least one information set for the annotation field is selected from a list comprising;
keyword tags, key-value metadata, region-of-interest annotations in time and/or space, or user-inputted information,wherein at least one information set for the usage field is selected from a list comprising;
order of interaction, views, incorporation in analysis, sharing with others, publishing, recommending to the user, or user interaction with the data elements;calculating a distance between the nodes and an input query, the distance being a function of the determined distances based on the structural, annotation, and usage field for each node, and wherein the step of calculating the distance includes application of a weighting factor; selecting at least one data element or user based on the magnitude of the calculated distance to the input query; and outputting to a display the selected data element or user.
-
-
8. A computer-based method for identifying a domain expert comprising the steps of:
-
receiving from a user, into a terminal connected to a database and a processor, an initial query related to a research subject; identifying multiple data elements and users associated with the database, wherein each data element and user is represented by a separate and distinct node; determining using a processor a distance between each node based on; 1) similarity of structural data associated with each node, 2) similarity of annotations information associated with each node, and 3) usage information associated with data elements; determining using the processor a distance between comparing the initial query to the nodes of the large database based on weighted; 1) structural data information, 2) annotation information, and 3) usage information; and outputting a recommendation node of a domain expert, based on the magnitude of the weighted distance step, to an output device. - View Dependent Claims (9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20)
-
Specification