Weight generation in machine learning
First Claim
1. A method to improve predictive capability of a machine learning system, the method comprising:
- receiving, by a computer, training data that includes one or more points;
identifying, by the computer, a training distribution of the one or more points of the training data;
receiving, by the computer, test data that includes one or more points;
identifying, by the computer, information about a test distribution of the one or more points of the test data;
identifying, by the computer, one or more coordinates for the one or more points of the training data and the one or more points of the test data;
determining, for each identified coordinate and by the computer differences between the one or more points of the test data and the one or more points of the training data;
determining, by the computer, weights for the one or more points of the training data based on the determined differences, wherein the weights are adapted to cause the training distribution to conform to the test distribution in response to the weights being applied to the training distribution;
generating, by the computer, a weighted function based on the determined weights and the training data; and
generating, by the computer, a first output based on an application of an input to the generated weighted function, wherein the first output is different than a second output generated by an application of the input to a non-weighted function, wherein the first output and the second output respectively correspond to a first predictive capability and a second predictive capability of the machine learning system, and wherein the first predictive capability is greater than the second predictive capability.
3 Assignments
0 Petitions
Accused Products
Abstract
Technologies are generally described for systems, devices and methods relating to a machine learning environment. In some examples, a processor may identify a training distribution of a training data. The processor may identify information about a test distribution of a test data. The processor may identify a coordinate of the training data and the test data. The processor may determine, for the coordinate, differences between the test distribution and the training distribution. The processor may determine weights based on the differences. The weights may be adapted to cause the training distribution to conform to the test distribution when the weights are applied to the training distribution.
59 Citations
20 Claims
-
1. A method to improve predictive capability of a machine learning system, the method comprising:
-
receiving, by a computer, training data that includes one or more points; identifying, by the computer, a training distribution of the one or more points of the training data; receiving, by the computer, test data that includes one or more points; identifying, by the computer, information about a test distribution of the one or more points of the test data; identifying, by the computer, one or more coordinates for the one or more points of the training data and the one or more points of the test data; determining, for each identified coordinate and by the computer differences between the one or more points of the test data and the one or more points of the training data; determining, by the computer, weights for the one or more points of the training data based on the determined differences, wherein the weights are adapted to cause the training distribution to conform to the test distribution in response to the weights being applied to the training distribution; generating, by the computer, a weighted function based on the determined weights and the training data; and generating, by the computer, a first output based on an application of an input to the generated weighted function, wherein the first output is different than a second output generated by an application of the input to a non-weighted function, wherein the first output and the second output respectively correspond to a first predictive capability and a second predictive capability of the machine learning system, and wherein the first predictive capability is greater than the second predictive capability. - View Dependent Claims (2, 3, 4, 5, 6)
-
-
7. A method to improve predictive capability of a machine learning system, the method comprising, by a computer:
-
identifying first points of training data; identifying information about test data, wherein the test data includes second points; identifying a coordinate of the first points and the second points, wherein the coordinate includes a range of values in a coordinate space; dividing the range of values in the coordinate space into bins, wherein the bins define subsets of the range of values; determining a first frequency, wherein the first frequency relates to a first percentage of the first points being located within a particular bin; determining a second frequency, wherein the second frequency relates to a second percentage of the second points being located within the particular bin; comparing the first frequency and the second frequency; determining a weight for the training data, based at least, in part, on the comparison of the first frequency and the second frequency, and on a number of the bins; generating a weighted function based on the determined weight and the training data; and generating a first output based on an application of an input to the generated weighted function, wherein the first output is different than a second output generated by an application of the input to a non-weighted function, wherein the first output and the second output respectively correspond to a first predictive capability and a second predictive capability of the machine learning system, and wherein the first predictive capability is greater than the second predictive capability. - View Dependent Claims (8, 9, 10, 11, 12)
-
-
13. A computing device, comprising:
-
a first processor; a second processor; and a memory configured to be in communication with the first processor and the second processor, the memory effective to store training data and test data, wherein the training data comprises first points and the test data comprises second points, and wherein; the first processor is effective to; identify a coordinate of the first points and the second points, wherein the coordinate includes a range of values in a coordinate space; divide the range of values in the coordinate space into bins, wherein the bins define subsets of the range of values; determine a first frequency, wherein the first frequency relates to a first percentage of the first points being located within a particular bin; determine a second frequency, wherein the second frequency relates to a second percentage of the second points being located within the particular bin; compare the first frequency and the second frequency; and determine a weight for the training data, based at least, in part, on the comparison of the first frequency and the second frequency, and on a number of bins, the second processor is effective to; generate a weighted function based on the determined weight and the training data; and generate a first output based on an application of an input to the generated weighted function, wherein the first output is different than a second output generated by an application of the input to a non-weighted function, wherein the first output and the second output respectively correspond to a first predictive capability and a second predictive capability of the computing device, and wherein the first predictive capability is greater than the second predictive capability, and the memory is further effective to store the determined weight. - View Dependent Claims (14, 15, 16, 17, 18, 19)
-
-
20. A computer-implemented method to improve predictive capability of a machine learning system, the method comprising:
-
receiving, by a weight generation module of the machine learning system, training data that includes one or more training points; identifying, by a processor of the machine learning system, a training distribution of the one or more training points of the training data; retrieving, by the weight generation module of the machine learning system, test data from a memory of the machine learning system, wherein the test data is different from the training data, and wherein the test data includes one or more test points; identifying, by the processor of the machine learning system, information about a test distribution of the one or more test points of the test data; identifying, by the weight generation module of the machine learning system, one or more coordinates for the one or more training points of the training data and the one or more test points of the test data; determining, based on the identified information and by the weight generation module of the machine learning system and for each identified coordinate, differences between the one or more test points of the test data and the one or more training points of the training data; determining, by the weight generation module of the machine learning system and based on the determined differences, weights for the one or more points of the training data, wherein; the weights are adapted to cause the training distribution to conform to the test distribution in response to the weights being applied to the training distribution, the training data includes a number of points, and each coordinate includes a range of values in a coordinate space; dividing the range of values in the coordinate space into bins; calculating a frequency of each bin based on a number of points in each bin and a total number of points included in the training data, wherein determining the weights is based on the calculated frequency of each bin and a number of the bins; transmitting, by the weight generation module of the machine learning system, the determined weights and the training data to a machine learning module of the machine learning system; producing, by the machine learning module of the machine learning system, a weighted function based on the determined weights and the training data, wherein the weighted function corresponds to a first predictive capability greater than a second predictive capability that corresponds to a function produced based on the training data; and operating the machine learning system to use the weighted function to provide the first predictive capability.
-
Specification