METHOD AND DEVICE FOR PROCESSING DATA
First Claim
1. A method for processing data, comprising:
- sorting samples from the data according to a primary key, wherein the primary key comprises a feature serial number and a sample serial number;
acquiring a statistic of each feature in each category by taking the primary key and the feature value as an input key-value pair, and performing a calculation with a first algorithm model, to obtain the feature serial number and the statistic as an output key-value pair; and
acquiring a contribution value of each feature to the category by performing a calculation on the output key-value pair with a second algorithm model, and selecting a feature based on the contribution value.
1 Assignment
0 Petitions
Accused Products
Abstract
A method and device for processing data in the field of data process are disclosed. The method includes: sorting samples according to primary keys, wherein the primary key includes a feature serial number and a sample serial number, and wherein a column value corresponding to the primary key is used as a feature value for the sample; acquiring a statistic of each feature in each category by taking the primary key and the feature value as an input key-value pair and calculating with a first algorithm model, and outputting the feature serial number and the statistic as an output key-value pair; and acquiring a contribution value of each feature to the category by performing calculation on the output key-value pair with a second algorithm model, and selecting a feature based on the contribution value. The device includes a sorting module, a first processing module and a second processing module.
-
Citations
16 Claims
-
1. A method for processing data, comprising:
-
sorting samples from the data according to a primary key, wherein the primary key comprises a feature serial number and a sample serial number; acquiring a statistic of each feature in each category by taking the primary key and the feature value as an input key-value pair, and performing a calculation with a first algorithm model, to obtain the feature serial number and the statistic as an output key-value pair; and acquiring a contribution value of each feature to the category by performing a calculation on the output key-value pair with a second algorithm model, and selecting a feature based on the contribution value. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8)
-
-
9. A device for processing data, comprising:
-
a sorting module, configured to sort samples from the data according to a primary key, wherein the primary key comprises a feature serial number and a sample serial number; a first processing module, configured to acquire a statistic of each feature in each category by taking the primary key and the feature value as an input key-value pair calculating with a first algorithm model, and output the feature serial number and the statistic as an output key-value pair; and a second processing module, configured to acquire a contribution value of each feature to the category by performing calculation on the output key-value pair with a second algorithm model, and select a feature based on the contribution value. - View Dependent Claims (10, 11, 12, 13, 14, 15, 16)
-
Specification