System and method for clustering content items from content feeds
First Claim
1. A computer system for clustering content, comprising:
- content feeds, each having metadata that may be represented by text;
a clustering engine for clustering a text representing metadata of a content feed in a cluster along with one or more other texts representing metadata of the content feeds determined to be a good nearest neighbor of the text; and
a good nearest neighbor analyzer operably coupled to the clustering engine for determining the one or more other texts to be a good nearest neighbor of the text.
3 Assignments
0 Petitions
Accused Products
Abstract
An improved system and method for clustering text or content described by text is provided. Each text in a set of texts may be represented as a dimensional vector of words. Singleton texts that may not be similar to another text may be excluded from the set of texts for clustering. Texts identified as good nearest neighbors may then be grouped in the same cluster. In addition, metadata describing content may be used for clustering items of aggregated content from content feeds. Metadata describing items of content from content feeds may be converted into a set of texts and texts identified as good nearest neighbors may then be clustered. Items of content feeds described by the clustered texts may then be similarly clustered. Any types of items of content that may be described by text may be clustered, including audio, images, video, multimedia content, and so forth.
-
Citations
20 Claims
-
1. A computer system for clustering content, comprising:
-
content feeds, each having metadata that may be represented by text;
a clustering engine for clustering a text representing metadata of a content feed in a cluster along with one or more other texts representing metadata of the content feeds determined to be a good nearest neighbor of the text; and
a good nearest neighbor analyzer operably coupled to the clustering engine for determining the one or more other texts to be a good nearest neighbor of the text. - View Dependent Claims (2, 3, 4)
-
-
5. A computer-implemented method for clustering content, comprising:
-
converting metadata describing items of content from content feeds into a set of texts;
determining at least one text in a set of texts to be a good nearest neighbor of an other text in the set of texts;
clustering the other text in a cluster;
clustering the at least one text determined to be the good nearest neighbor of the other text in the cluster; and
outputting a cluster of items of content from the content feeds associated with the other text and the at least one text. - View Dependent Claims (6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18)
-
-
19. A computer system for clustering content, comprising:
-
means for converting metadata describing items of content from content feeds into a set of texts;
means for determining at least one text in a set of texts to be a good nearest neighbor of an other text in the set of texts;
means for clustering the other text in a cluster;
means for clustering the at least one text determined to be the good nearest neighbor of the other text in the cluster; and
means for outputting a cluster of items of content from the content feeds associated with the other text and the at least one text. - View Dependent Claims (20)
-
Specification