Multimedia content filtering
First Claim
1. A method, comprising:
- analyzing, by one or more processing devices, a web page content for predetermined parameters, wherein at least one of the predetermined parameters is based on an image media content;
generating a tag that encapsulates the at least one predetermined parameter;
processing the web page content to provide text representing the web page Content;
inserting the tag into the text to provide tokens;
inputting the tokens into a latent semantic mapping (LSM) filter;
mapping the tokens into a vector space of the latent semantic mapping filter;
analyzing, by the one or more processing devices, the web page content using the latent semantic mapping filter wherein the vector space of the latent semantic mapping filter includes a first plurality of vectors at a first location and a second plurality of vectors at a second location, wherein the first location comprises materials related to predefined legitimate multimedia content, the second location comprises materials related to predefined explicit multimedia content, and at least one input into the latent semantic mapping filter comprises one or more representations of the web page content that are mapped to a third location in the vector space;
determining, by the one or more processing devices, distances betweenthe third location and the first location, andthe third location and the second location; and
filtering, by the one or more processing devices, the web page content based on the determined distances.
3 Assignments
0 Petitions
Accused Products
Abstract
Methods and apparatuses to filter multimedia content are described. The multimedia content in one embodiment is analyzed for one or more parameters. The multimedia content in one embodiment is filtered based on the one or more parameters using a latent semantic mapping (“LSM”) filter. In one embodiment, the one or more parameters include information about a structure of the multimedia content. A tag that encapsulates the one or more parameters may be generated. Then, the tag is input into the latent semantic mapping filter. In one embodiment, the LSM filter is trained to recognize the multimedia content based on the one or more parameters. In one embodiment, more than two categories are provided for a multimedia content. The multimedia content is classified in more than two categories using the LSM filter. The multimedia content may be blocked based on the classifying.
-
Citations
53 Claims
-
1. A method, comprising:
-
analyzing, by one or more processing devices, a web page content for predetermined parameters, wherein at least one of the predetermined parameters is based on an image media content; generating a tag that encapsulates the at least one predetermined parameter; processing the web page content to provide text representing the web page Content; inserting the tag into the text to provide tokens; inputting the tokens into a latent semantic mapping (LSM) filter; mapping the tokens into a vector space of the latent semantic mapping filter; analyzing, by the one or more processing devices, the web page content using the latent semantic mapping filter wherein the vector space of the latent semantic mapping filter includes a first plurality of vectors at a first location and a second plurality of vectors at a second location, wherein the first location comprises materials related to predefined legitimate multimedia content, the second location comprises materials related to predefined explicit multimedia content, and at least one input into the latent semantic mapping filter comprises one or more representations of the web page content that are mapped to a third location in the vector space; determining, by the one or more processing devices, distances between the third location and the first location, and the third location and the second location; and filtering, by the one or more processing devices, the web page content based on the determined distances. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9)
-
-
10. A method, comprising:
-
analyzing a multimedia content for one or more parameters; generating one or more tags associated with the one or more parameters; providing the one or more tags to a latent semantic (LSM) filter; providing, by one or more processing devices, a vector space having at least two categories for the multimedia content, the vector space comprising a first plurality of vectors at a first location, and a second plurality of vectors at a second location, wherein the first location comprises materials related to predefined legitimate multimedia content, the second location comprises materials related to predefined explicit multimedia content, and wherein one or more representations of a new multimedia content are mapped to a third location in the vector space; determining by the one or more processing devices distances between the third location and the first location, and the third location and the second location; and classifying, by the one or more processing devices, a new multimedia content based on the distances determined. - View Dependent Claims (11, 12, 13, 14)
-
-
15. A method to classify a multimedia content, comprising:
-
processing, by one or more processing devices, the multimedia content to provide text for an analysis of the multimedia content; analyzing, by the one or more processing devices, the multimedia content for predetermined parameters, wherein at least one of the predetermined parameters is based on image media content; generating, by the one or more processing devices, a tag that encapsulates at least one of the predetermined parameters; associating, by the one or more processing devices, the tag with the text to provide one or more tokens; and mapping, by the one or more processing devices, the one or more tokens into a vector space containing a first plurality of vectors at a first location, and a second plurality of vectors at a second location, wherein the first location comprises materials related to predefined legitimate multimedia content, the second location comprises materials related to predefined explicit multimedia content, and wherein the one or more tokens are mapped into a third location in the vector space; determining by the one or more processing devices distances between the third location and the first location, and the third location and the second location; and determining, by the one or more processing devices, whether to filter the multimedia content based on the distances. - View Dependent Claims (16, 17, 18, 19)
-
-
20. An article of manufacture comprising:
-
a non-transitory machine-accessible storage medium storing data that, when accessed by a machine, cause the machine to perform operations comprising; analyzing a web page content for predetermined parameters, wherein at least one of the predetermined parameters is based on an image media content; generating a tag that encapsulates the at least one predetermined parameter; processing the web page content to provide text representing the web page content; inserting the tag into the text to provide tokens; inputting the tokens into a latent semantic mapping (LSM) filter; mapping the tokens into a vector space of the latent semantic mapping filter; analyzing the web page content using a latent semantic mapping filter, wherein the latent semantic mapping filter comprises a vector space containing a first plurality of vectors at a first location, and a second plurality of vectors at a second location, wherein the first location comprises materials related to predefined legitimate multimedia content, the second location comprises materials related to predefined explicit multimedia content, and wherein at least one input into the latent semantic mapping filter comprises one or more representations of the web page content that are mapped to a third location in a vector space; determining distances between the third location and the first location, and the third location and the second location; and determining whether to filter the web page content based on the distances. - View Dependent Claims (21, 22, 23, 24, 25, 26, 27, 28)
-
-
29. An article of manufacture comprising:
-
a non-transitory machine-accessible storage medium storing data that, when accessed by a machine, cause the machine to perform operations comprising; analyzing a multimedia content for one or more parameters; generating one or more tags associated with the one or more parameters; providing the one or more tags to a latent semantic (LSM) filter; providing a vector space having at least two categories for the multimedia content, the vector space comprising a first plurality of vectors at a first location, and a second plurality of vectors at a second location, wherein the first location comprises materials related to predefined legitimate multimedia content, the second location comprises materials related to predefined explicit multimedia content, and wherein one or more representations of a new multimedia content are mapped to a third location in the vector space; determining distances between the third location and the first location, and the third location and the second location; and classifying a new multimedia content based on the distances determined. - View Dependent Claims (30, 31, 32, 33)
-
-
34. An article of manufacture comprising:
-
a non-transitory machine-accessible storage medium storing data that, when accessed by a machine, cause the machine to perform operations to classify a multimedia content, comprising; processing the multimedia content to provide text for an analysis of the multimedia content; analyzing the multimedia content for predetermined parameters, wherein at least one parameter is based on image media content; generating a tag that encapsulates at least one of the predetermined parameters; associating the tag with the text to provide one or more tokens; and mapping the one or more tokens into a vector space containing a first plurality of vectors at a first location, and a second plurality of vectors at a second location, wherein the first location comprises materials related to predefined legitimate multimedia content, the second location comprises materials related to predefined explicit multimedia content, and wherein the one or more tokens are mapped into a third location in the vector space; determining distances between the third location and the first location, and the third location and the second location; and determining whether to filter the multimedia content based on the distances. - View Dependent Claims (35, 36, 37, 38)
-
-
39. A computer system, comprising:
-
a bus; a data storage device coupled to the bus; one or more processing devices coupled to the data storage device, wherein the data storage device stores instructions executed by the one or more processing devices to perform operations, comprising; analyzing a web page content for predetermined parameters, wherein at least one parameter is based on an image media content; generating a tag that encapsulates the at least one parameter; processing the web page content to provide text for an analysis of the web page content; inserting the tag into the text to provide tokens; mapping the tokens into a vector space of a latent semantic mapping filter; analyzing the web page content using the latent semantic mapping filter, wherein the latent semantic mapping filter comprises a vector space containing a first plurality of vectors at a first location, and a second plurality of vectors at a second location, wherein the first location comprises materials related to predefined legitimate multimedia content, the second location comprises materials related to predefined explicit multimedia content, and wherein at least one input into the latent semantic mapping filter comprises one or more representations of the image media content that are mapped to a third location in the vector space; determining distances between the third location and the first location, and the third location and the second location; and determining whether to filter the web page content based the distances. - View Dependent Claims (40, 41)
-
-
42. A computer system, comprising:
-
a bus; a data storage device coupled to the bus; one or more processing devices coupled to the storage device, wherein the storage device stores instructions executed by the one or more processing devices to perform operations, comprising; analyzing a multimedia content for one or more parameters; generating one or more tags associated with the one or more parameters; providing the one or more tags to a latent semantic (LSM) filter; providing a vector space having at least two categories for the multimedia content, the vector space comprising a first plurality of vectors at a first location, and a second plurality of vectors at a second location, wherein the first location comprises materials related to predefined legitimate multimedia content, the second location comprises materials related to predefined explicit multimedia content, and wherein one or more representations of a new multimedia content are mapped to a third location in the vector space; determining distances between the third location and the first location, and the third location and the second location; and classifying a new multimedia content based on distances determined. - View Dependent Claims (43, 44)
-
-
45. A computer system to classify a multimedia content, comprising:
-
a bus; a data storage device coupled to the bus; one or more processing devices coupled to the data storage device, wherein the data storage device stores instructions executed by the one or more processing devices to perform operations, comprising; processing the multimedia content to provide text for analysis of the multimedia content; analyzing the multimedia content for predetermined parameters, wherein at least one of the predetermined parameters is based on image media content; generating a tag that encapsulates the at least one of the predetermined parameters; associating the tag with the text to provide one or more tokens; and mapping the one or more tokens into a vector space containing a first plurality of vectors at a first location, and a second plurality of vectors at a second location, wherein the first location comprises materials related to predefined legitimate multimedia content, the second location comprises materials related to predefined explicit multimedia content, and wherein the one or more tokens are mapped into a third location in the vector space; determining distances between the third location and the first location, and the third location and the second location; and determining whether to filter the multimedia content based on the distances. - View Dependent Claims (46, 47)
-
-
48. A method, comprising:
-
processing, by one or more processing devices, multimedia content to provide processed content for use by a latent semantic mapping filter; analyzing the multimedia content for predetermined parameters; generating a tag that encapsulates at least one of the predetermined parameters; processing the multimedia content to provide text representing the multimedia content; inserting the tag into the text to provide tokens; inputting the tokens into the latent semantic mapping filter; mapping the tokens into a vector space of the latent semantic mapping filter; classifying, by the one or more processing devices, the processed content with the latent semantic mapping filter including the vector space, the vector space having at least two categories for the multimedia content, the vector space comprising a first plurality of vectors at a first location, and a second plurality of vectors at a second location, wherein the first location comprises materials related to predefined legitimate multimedia content, the second location comprises materials related to predefined explicit multimedia content, and wherein one or more representations of the multimedia content are mapped to a third location in the vector space and wherein distances between the third location and the first location, and the third location and the second location; and determining, by the one or more processing devices, whether to filter the multimedia content based upon the classifying.
-
-
49. An article of manufacture, comprising:
-
a non-transitory machine-readable storage medium storing executable program instructions which when executed by a data processing system cause the system to perform operations comprising; processing multimedia content to provide processed content for use by a latent semantic mapping filter;
analyzing the multimedia content for predetermined parameters;generating a tag that encapsulates at least one of the predetermined parameters; processing the multimedia content to provide text representing the multimedia content; inserting the tag into the text to provide tokens; inputting the tokens into the latent semantic mapping filter; mapping the tokens into a vector space of the latent semantic mapping filter; classifying the processed content with the latent semantic mapping filter including the vector space, the vector space having at least two categories for the multimedia content, the vector space comprising a first plurality of vectors at a first location, and a second plurality of vectors at a second location, wherein the first location comprises materials related to predefined legitimate multimedia content, the second location comprises materials related to predefined explicit multimedia content, and wherein one or more representations of the multimedia content are mapped to a third location in the vector space and wherein distances between the third location and the first location, and the third location and the second location are determined; and determining whether to filter the multimedia content based upon the classifying.
-
-
50. A method, comprising:
-
presenting a user interface on a display device; receiving, by one or more processing devices, input from the user interface; placing, by the one or more processing devices, a latent semantic mapping filter of a web page content to operate in a training mode to recognize which kind of multimedia content to filter out in response to the input from the user interface; analyzing, by the one or more processing devices, the web page content for predetermined parameters; generating a tag that encapsulates at least one predetermined parameter; processing the web page content to provide text representing the web page content; inserting the tag into the text to provide tokens; inputting the tokens into the latent semantic mapping filter; and mapping the tokens into a vector space of the latent semantic mapping filter; wherein the latent semantic mapping filter includes a first plurality of vectors at a first location in a vector space, and a second plurality of vectors at a second location in the vector space, wherein the first location comprises materials related to predefined legitimate multimedia content, the second location comprises materials related to predefined explicit multimedia content, and wherein one or more representations of the web page content are mapped to a third location in the vector space, and wherein distances between the third location and the first location, and the third location and the second location are determined to recognize the multimedia content. - View Dependent Claims (51, 52)
-
-
53. An article of manufacture, comprising:
-
a non-transitory machine-readable storage medium storing executable program instructions which when executed by a data processing system cause the system to perform operations comprising; presenting a user interface; receiving input from the user interface; and placing a latent semantic mapping filter of a web page content to operate in a training mode to recognize which kind of multimedia content to filter out in response to the input from the user interface; analyzing the web page content for predetermined parameters; generating a tag that encapsulates at least one predetermined parameter; processing the web page content to provide text representing the web page content; inserting the tag into the text to provide tokens; inputting the tokens into the latent semantic mapping filter; and mapping the tokens into a vector space of the latent semantic mapping filter; wherein the latent semantic mapping filter includes a first plurality of vectors at a first location in a vector space, and a second plurality of vectors at a second location in the vector space, wherein the first location comprises materials related to predefined legitimate multimedia content, the second location comprises materials related to predefined explicit multimedia content, and wherein one or more representations of the web page content are mapped to a third location in the vector space, and wherein distances between the third location and the first location, and the third location and the second location are determined to recognize the multimedia content.
-
Specification