Spam detection for user-generated multimedia items based on appearance in popular queries
First Claim
Patent Images
1. A computer-implemented method of identifying spam in a collection of multimedia videos, comprising:
- storing the collection of videos;
establishing a set of top queries for the videos in the collection and identifying results of each query in the set of top queries, wherein the set of top queries comprises a set of at least ten thousand frequently made queries for the videos in the collection, and wherein the results of the set of top queries comprise one or more of the videos from the collection;
for a video in the collection, counting a number of times that the video appears in the results of the set of top queries; and
responsive to the number of times that the video appears in the results of the set of top queries exceeding a predetermined threshold, marking the video as spam.
2 Assignments
0 Petitions
Accused Products
Abstract
A system, a method, and various software tools enable a video hosting website to automatically identify posted video items that contain spam in the metadata associated with a respective video item. A spam detection tool for user-generated video items based on appearance in popular queries is provided that facilitates the detection of spam in the metadata associated with a video item.
-
Citations
9 Claims
-
1. A computer-implemented method of identifying spam in a collection of multimedia videos, comprising:
-
storing the collection of videos; establishing a set of top queries for the videos in the collection and identifying results of each query in the set of top queries, wherein the set of top queries comprises a set of at least ten thousand frequently made queries for the videos in the collection, and wherein the results of the set of top queries comprise one or more of the videos from the collection; for a video in the collection, counting a number of times that the video appears in the results of the set of top queries; and responsive to the number of times that the video appears in the results of the set of top queries exceeding a predetermined threshold, marking the video as spam. - View Dependent Claims (2, 3)
-
-
4. A system, comprising:
-
a computer processor; a non-transitory computer-readable storage medium storing computer program code, comprising; means for storing a collection of videos; means for establishing a set of top queries for the videos in the collection and identifying results of each query in the set of top queries, wherein the set of top queries comprises a set of at least ten thousand frequently made queries for the videos in the collection, and wherein the results of the set of top queries comprise one or more of the videos from the collection; means for counting a number of times a video in the collection appears in the results of the set of top queries; and means for marking the video as spam, the means for marking responsive to the number of times that the video appears in the results of the set of top queries exceeding a predetermined threshold. - View Dependent Claims (5, 6)
-
-
7. A non-transitory computer-readable storage medium, comprising computer program code for:
-
storing a collection of videos; establishing a set of top queries for the videos in the collection and identifying results of each query in the set of top queries, wherein the set of top queries comprises a set of at least ten thousand frequently made queries for the videos in the collection, and wherein the results of the set of top queries comprise one or more of the videos from the collection; for a video in the collection, counting a number of times that the video appears in the results of the set of top queries; and responsive to the number of times that the video appears in the results of the set of top queries exceeding a predetermined threshold, marking the video as spam. - View Dependent Claims (8, 9)
-
Specification