Automatic identification of abstract online groups
First Claim
1. A computer-implemented method of automatically identifying online abstract groups comprising entities that exhibit shared interests and/or characteristics, the method comprising:
- Harvesting records from social media, each record comprising a social-media posting and being associated with one or more entities;
Extracting content-based and structure-based features from each record, each feature stored on a data storage device and comprising a computer-readable representation of an attribute of one or more records;
Grouping records into record groups according to the features of each record using clustering, classifying, and/or filtering algorithms executed by one or more processors;
Calculating an n-dimensional surface representing each record group, each n-dimensional surface described by a footprint that characterizes the respective record group as an online abstract group;
Defining an outlier as a record having feature-based distances measured from every n-dimensional surface that exceed a threshold value.
2 Assignments
0 Petitions
Accused Products
Abstract
Online abstract groups, in which members aren'"'"'t explicitly connected, can be automatically identified by computer-implemented methods. The methods involve harvesting records from social media and extracting content-based and structure-based features from each record. Each record includes a social-media posting and is associated with one or more entities. Each feature is stored on a data storage device and includes a computer-readable representation of an attribute of one or more records. The methods further involve grouping records into record groups according to the features of each record. Further still the methods involve calculating an n-dimensional surface representing each record group and defining an outlier as a record having feature-based distances measured from every n-dimensional surface that exceed a threshold value. Each of the n-dimensional surfaces is described by a footprint that characterizes the respective record group as an online abstract group.
-
Citations
18 Claims
-
1. A computer-implemented method of automatically identifying online abstract groups comprising entities that exhibit shared interests and/or characteristics, the method comprising:
-
Harvesting records from social media, each record comprising a social-media posting and being associated with one or more entities; Extracting content-based and structure-based features from each record, each feature stored on a data storage device and comprising a computer-readable representation of an attribute of one or more records; Grouping records into record groups according to the features of each record using clustering, classifying, and/or filtering algorithms executed by one or more processors; Calculating an n-dimensional surface representing each record group, each n-dimensional surface described by a footprint that characterizes the respective record group as an online abstract group; Defining an outlier as a record having feature-based distances measured from every n-dimensional surface that exceed a threshold value. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18)
-
Specification