METHOD FOR CATEGORIZING CONTENT PUBLISHED ON INTERNET
First Claim
1. A method for categorizing at least one content on Internet, the method comprising:
- gathering one or more feeds associated with the at least one content, wherein the at least one content is provided by at least one content provider;
extracting contextual information from the one or more feeds, wherein the contextual information is embedded into the one or more feeds;
categorizing the at least one content into at least one general web-based category, the at least one general web-based category belonging to a set of general web-based categories, the categorizing step comprising;
performing a semantic analysis of the contextual information, wherein the semantic analysis of the contextual information yields a keyword string corresponding to the contextual information; and
classifying the at least one content into the at least one general web-based category based on the keyword string;
translating the set of general web-based categories to a set of pre-defined categories, wherein one or more general web-based categories from the set of general web-based categories are translated to at least one pre-defined category in the set of pre-defined categories, wherein the at least one content belongs to at least one pre-defined category when translating the set of general web-based categories to the set of pre-defined categories.
4 Assignments
0 Petitions
Accused Products
Abstract
The present invention provides method and system for categorizing a content published on Internet. The method comprising gathering one or more feeds associated with the content. The method further comprises extracting contextual information from the one or more feeds. Thereafter, the content is categorized into one or more general web-based categories belonging to a set of general web-based categories. The categorizing step further comprises performing a semantic analysis of the contextual information that yields a keyword string. The content is classified into the one or more general web-based category based on the keyword string. Finally, the set of general web-based categories is translated to a set of pre-defined categories, such that one or more general web-based category is translated to a pre-defined category that is relevant to an end user.
-
Citations
20 Claims
-
1. A method for categorizing at least one content on Internet, the method comprising:
-
gathering one or more feeds associated with the at least one content, wherein the at least one content is provided by at least one content provider; extracting contextual information from the one or more feeds, wherein the contextual information is embedded into the one or more feeds; categorizing the at least one content into at least one general web-based category, the at least one general web-based category belonging to a set of general web-based categories, the categorizing step comprising; performing a semantic analysis of the contextual information, wherein the semantic analysis of the contextual information yields a keyword string corresponding to the contextual information; and classifying the at least one content into the at least one general web-based category based on the keyword string; translating the set of general web-based categories to a set of pre-defined categories, wherein one or more general web-based categories from the set of general web-based categories are translated to at least one pre-defined category in the set of pre-defined categories, wherein the at least one content belongs to at least one pre-defined category when translating the set of general web-based categories to the set of pre-defined categories. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12)
-
-
13. A system for categorizing at least one content on Internet, the system comprising:
-
a gathering module, the gathering module gathering one or more feeds associated with the at least one content, the at least one content provided by at least one content provider; an extracting module, the extracting module extracting contextual information from the one or more feeds, wherein the contextual information is embedded into the one or more feeds; a categorizing module, the categorizing module categorizing the at least one content into at least one general web-based category, the at least one general web-based category belonging to a set of general web-based categories, the categorizing module comprising; an analyzing module, the analyzing module performing a semantic analysis of the contextual information, wherein the semantic analysis of the contextual information yields a keyword string corresponding to the contextual information; and a classifying module, the classifying module classifying the at least one content into the at least one general web-based category based on the keyword string; and a translating module, the translating module translating the set of general web-based categories to a set of pre-defined categories, wherein one or more general web-based categories from the set of general web-based categories are translated to at least one pre-defined category in the set of pre-defined categories, wherein the at least one content belongs to at least one pre-defined category when translating the set of general web-based categories to the set of pre-defined categories. - View Dependent Claims (14, 15, 16, 17, 18)
-
-
19. A computer program product comprising a computer usable medium having a computer readable program for categorizing at least one content on Internet, wherein the computer readable program when executed on a computer causes the computer to:
-
gather one or more feeds associated with the at least one content, wherein the at least one content is provided by at least one content provider; extract contextual information from the one or more feeds, wherein the contextual information is embedded into the one or more feeds; categorize the at least one content into at least one general web-based category, the at least one general web-based category belonging to a set of general web-based categories, the computer readable program further causes the computer to; perform a semantic analysis of the contextual information, wherein the semantic analysis of the contextual information yields a keyword string corresponding to the contextual information; and classify the at least one content into the at least one general web-based category based on the keyword string; translate the set of general web-based categories to a set of pre-defined categories, wherein one or more general web-based categories from the set of general web-based categories are translated to at least one pre-defined category in the set of pre-defined categories, wherein the at least one content belongs to at least one pre-defined category when translating the set of general web-based categories to the set of pre-defined categories. - View Dependent Claims (20)
-
Specification