Method and system for reducing a data set
First Claim
Patent Images
1. A computer-implemented method for applying a dimensionality reduction process to a medical data set, comprising:
- obtaining a data set stored in a medical database, the data set including a set of medical data entries relating to medical events for individuals in a population;
identifying a collection of the set of medical data entries relating to a common medical event for an individual in the population for replacement by a representative medical data entry for the common medical event, the identifying including identifying medical data entries of the set that are associated with medical events relating to the common medical event and occurring within a predetermined period of time; and
reducing the size of the medical database by;
replacing the identified collection of medical data entries with the representative medical data entry for the common medical event in the medical database, the representative medical data entry including a numerical indicator representing a number of the medical data entries in the identified collection; and
eliminating the identified collection of medical data entries from the medical database.
1 Assignment
0 Petitions
Accused Products
Abstract
System and methods are provided for applying a dimensionality reduction process to a data set. The method includes obtaining a data set including a first set of variables and identifying collections of the first set of variables for replacement by a second set of variables. The method also includes replacing the collections of the first set of variables with the second set of variables and eliminating the first set of variables from the data set.
50 Citations
13 Claims
-
1. A computer-implemented method for applying a dimensionality reduction process to a medical data set, comprising:
-
obtaining a data set stored in a medical database, the data set including a set of medical data entries relating to medical events for individuals in a population; identifying a collection of the set of medical data entries relating to a common medical event for an individual in the population for replacement by a representative medical data entry for the common medical event, the identifying including identifying medical data entries of the set that are associated with medical events relating to the common medical event and occurring within a predetermined period of time; and reducing the size of the medical database by; replacing the identified collection of medical data entries with the representative medical data entry for the common medical event in the medical database, the representative medical data entry including a numerical indicator representing a number of the medical data entries in the identified collection; and eliminating the identified collection of medical data entries from the medical database. - View Dependent Claims (2, 3, 4)
-
-
5. A computer-readable medium comprising instructions which, when executed by a processor, perform a method for applying a dimensionality reduction process to a medical data set, the method comprising:
-
obtaining a data set stored in a medical database, the data set including a set of medical data entries relating to medical events for individuals in a population; identifying a collection of the set of medical data entries relating to a common medical event for an individual in the population for replacement by a single medical data entry representative of at least some of the medical data entries relating to the common event, the identifying including identifying medical data entries of the set that are associated with medical events relating to the common medical event and occurring within a predetermined period of time; and reducing the size of the medical database by; replacing the identified collection of medical data entries with the representative medical data entry for the common medical event in the medical database, the representative medical data entry including a numerical indicator representing a number of the identified collection of medical data entries; and eliminating the identified collection of medical data entries from the medical database. - View Dependent Claims (6, 7, 8, 9)
-
-
10. A system for applying a dimensionality reduction process to a data set, the system comprising:
-
a medical database storing a data set including medical data entries relating to medical events for individuals in a population; at least one input device; and a central processing unit in communication with the medical database and the at least one input device, wherein the central processing unit; obtains the data set from the medical database; identifies a collection of the set of medical data entries relating to a common medical event for an individual in the population for replacement by a representative medical data entry for the common medical event, the identifying including identifying medical data entries of the set that are associated with medical events relating to the common medical event and occurring within a predetermined period of time; and reduces the size of the medical database by; replacing the identified collection of medical data entries with the representative medical data entry for the common medical event in the medical database, the representative medical data entry including a numerical indicator representing a number of the identified collection of medical data entries; and deleting at least some of the identified collection of medical data entries from the medical database. - View Dependent Claims (11, 12, 13)
-
Specification