Filtering information using targeted filtering schemes
First Claim
1. An apparatus comprising:
- a computing device;
one or more storage media configured with executable instructions that, when executed by the computing device, instruct the computing device to perform acts comprising;
generating a plurality of filtering schemes for a plurality of types of target users, the generating comprising;
setting keywords and user-characteristic data for each type of target users;
eliminating invalid data from the user-characteristic data of each type of target users;
computing derived variables for each type of target users using remaining data in the user-characteristic data of the respective type of target user, the derived variables comprising;
an aggregated variable representing a number of targeted keyword types that appear,a ratio variable representing a ratio between an amount of information sent and an amount of information received, andan average variable representing an average frequency of an appearance of a keyword type;
filtering parameters of the user-characteristic data of each type of target users, the parameters comprising at least one of the derived variables; and
modeling each type of target users based on the filtered parameters of the user-characteristic of the respective type of target users, the modeling comprising;
computing multiple models for each type of target users based on different modeling algorithms and different filtered parameters of the user-characteristic data of the respective type of target users,testing the multiple models of each type of target users using training data which includes information of the target users of the respective type whose behavior is known, andselecting a respective model having the highest accuracy among the multiple models of each type of target users as a filtering scheme for the respective type of target users.
1 Assignment
0 Petitions
Accused Products
Abstract
A method for filtering user information takes into account not only specific keywords in the user information, but also related user-characteristic data (e.g., user activity data), and allows targeted user characteristics to be determined from multiple aspects of user activities. In one aspect, the disclosed method adopts different filtering schemes for different types of targeted users to improve the recognition accuracy with respect to the target user information. The method determines a suitable filtering scheme using a correspondence relationship between the filtering scheme and keywords and user-characteristic data. The method uses modeling of sample users and multiple candidate filtering schemes to formulate targeted filtering scheme. An apparatus for implementing the method is also disclosed.
-
Citations
20 Claims
-
1. An apparatus comprising:
-
a computing device; one or more storage media configured with executable instructions that, when executed by the computing device, instruct the computing device to perform acts comprising; generating a plurality of filtering schemes for a plurality of types of target users, the generating comprising; setting keywords and user-characteristic data for each type of target users; eliminating invalid data from the user-characteristic data of each type of target users; computing derived variables for each type of target users using remaining data in the user-characteristic data of the respective type of target user, the derived variables comprising; an aggregated variable representing a number of targeted keyword types that appear, a ratio variable representing a ratio between an amount of information sent and an amount of information received, and an average variable representing an average frequency of an appearance of a keyword type; filtering parameters of the user-characteristic data of each type of target users, the parameters comprising at least one of the derived variables; and modeling each type of target users based on the filtered parameters of the user-characteristic of the respective type of target users, the modeling comprising; computing multiple models for each type of target users based on different modeling algorithms and different filtered parameters of the user-characteristic data of the respective type of target users, testing the multiple models of each type of target users using training data which includes information of the target users of the respective type whose behavior is known, and selecting a respective model having the highest accuracy among the multiple models of each type of target users as a filtering scheme for the respective type of target users. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8)
-
-
9. A method comprising:
-
under control of a computing device configured with executable instructions; generating a plurality of filtering schemes for a plurality of types of target users, the generating comprising; setting keywords and user-characteristic data for each type of target users; eliminating invalid data from the user-characteristic data of each type of target users; computing derived variables for each type of target users using remaining data in the user-characteristic data of the respective type of target user, the derived variables comprising; an aggregated variable representing a number of targeted keyword types that appear, a ratio variable representing a ratio between an amount of information sent and an amount of information received, and an average variable representing an average frequency of an appearance of a keyword type; filtering parameters of the user-characteristic data of each type of target users, the parameters comprising at least one of the derived variables; and modeling each type of target users based on the filtered parameters of the user-characteristic of the respective type of target users, the modeling comprising; computing multiple models for each type of target users based on different modeling algorithms and different filtered parameters of the user-characteristic data of the respective type of target users, testing the multiple models of each type of target users using training data which includes information of the target users of the respective type whose behavior is known, and selecting a respective model having the highest accuracy among the multiple models of each type of target users as a filtering scheme for the respective type of target users. - View Dependent Claims (10, 11, 12, 13, 14, 15, 16)
-
-
17. One or more memory devices storing executable instructions that, when executed by a computing device, cause the computing device to perform acts comprising:
generating a plurality of filtering schemes for a plurality of types of target users, the generating comprising; setting keywords and user-characteristic data for each type of target users; eliminating invalid data from the user-characteristic data of each type of target users; computing derived variables for each type of target users using remaining data in the user-characteristic data of the respective type of target user, the derived variables comprising; an aggregated variable representing a number of targeted keyword types that appear, a ratio variable representing a ratio between an amount of information sent and an amount of information received, and an average variable representing an average frequency of an appearance of a keyword type; filtering parameters of the user-characteristic data of each type of target users, the parameters comprising at least one of the derived variables; and modeling each type of target users based on the filtered parameters of the user-characteristic of the respective type of target users, the modeling comprising; computing multiple models for each type of target users based on different modeling algorithms and different filtered parameters of the user-characteristic data of the respective type of target users, testing the multiple models of each type of target users using training data which includes information of the target users of the respective type whose behavior is known, and selecting a respective model having the highest accuracy among the multiple models of each type of target users as a filtering scheme for the respective type of target users. - View Dependent Claims (18, 19, 20)
Specification