Method for categorizing content published on internet
First Claim
1. A method for categorizing at least one content on Internet, the method comprising:
- gathering one or more feeds associated with the at least one content, wherein the at least one content is provided by at least one content provider;
extracting contextual information from the one or more feeds, wherein the contextual information is embedded into the one or more feeds;
categorizing the at least one content into at least one general web-based category, the at least one general web-based category belonging to a set of general web-based categories, the categorizing step comprising;
performing a semantic analysis of the contextual information,wherein the semantic analysis of the contextual information yields a keyword string corresponding to the contextual information; and
classifying the at least one content into the at least one general web-based category based on the keyword string;
translating the set of general web-based categories to a set of pre-defined categories, wherein one or more general web-based categories from the set of general web-based categories are translated to at least one pre-defined category in the set of pre-defined categories, wherein the at least one content belongs to at least one pre-defined category when translating the set of general web-based categories to the set of pre-defined categories;
wherein the categorizing step further comprises computing a first relevance percentage corresponding to the at least one content classified into each of the at least one general web-based category.
4 Assignments
0 Petitions
Accused Products
Abstract
The present invention provides method and system for categorizing a content published on Internet. The method comprising gathering one or more feeds associated with the content. The method further comprises extracting contextual information from the one or more feeds. Thereafter, the content is categorized into one or more general web-based categories belonging to a set of general web-based categories. The categorizing step further comprises performing a semantic analysis of the contextual information that yields a keyword string. The content is classified into the one or more general web-based category based on the keyword string. Finally, the set of general web-based categories is translated to a set of pre-defined categories, such that one or more general web-based category is translated to a pre-defined category that is relevant to an end user.
48 Citations
17 Claims
-
1. A method for categorizing at least one content on Internet, the method comprising:
-
gathering one or more feeds associated with the at least one content, wherein the at least one content is provided by at least one content provider;
extracting contextual information from the one or more feeds, wherein the contextual information is embedded into the one or more feeds;
categorizing the at least one content into at least one general web-based category, the at least one general web-based category belonging to a set of general web-based categories, the categorizing step comprising;performing a semantic analysis of the contextual information, wherein the semantic analysis of the contextual information yields a keyword string corresponding to the contextual information; and classifying the at least one content into the at least one general web-based category based on the keyword string; translating the set of general web-based categories to a set of pre-defined categories, wherein one or more general web-based categories from the set of general web-based categories are translated to at least one pre-defined category in the set of pre-defined categories, wherein the at least one content belongs to at least one pre-defined category when translating the set of general web-based categories to the set of pre-defined categories; wherein the categorizing step further comprises computing a first relevance percentage corresponding to the at least one content classified into each of the at least one general web-based category. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11)
-
-
12. A computing system for categorizing at least one content on Internet, the computing system comprising:
- a processor implemented gathering module, the processor implemented gathering module gathering one or more feeds associated with the at least one content, the at least one content provided by at least one content provider;
a processor implemented extracting module, the processor implemented extracting module extracting contextual information from the one or more feeds, wherein the contextual information is embedded into the one or more feeds; a processor implemented categorizing module, the processor implemented categorizing module categorizing the at least one content into at least one general web-based category, the at least one general web-based category belonging to a set of general web-based categories, the processor implemented categorizing module comprising; a processor implemented analyzing module, the processor implemented analyzing module performing a semantic analysis of the contextual information, wherein the semantic analysis of the contextual information yields a keyword string corresponding to the contextual information; and a processor implemented classifying module, the processor implemented classifying module classifying the at least one content into the at least one general web-based category based on the keyword string; and a processor implemented translating module, the processor implemented translating module translating the set of general web-based categories to a set of pre-defined categories, wherein one or more general web-based categories from the set of general web-based categories are translated to at least one pre-defined category in the set of pre-defined categories, wherein the at least one content belongs to at least one pre-defined category when translating the set of general web-based categories to the set of pre-defined categories; and wherein the categorizing module further comprises a first computing module, the first computing module computing a first relevance percentage corresponding to the at least one content classified into each of the at least one general web-based category. - View Dependent Claims (13, 14, 15, 16)
- a processor implemented gathering module, the processor implemented gathering module gathering one or more feeds associated with the at least one content, the at least one content provided by at least one content provider;
-
17. A computer program product comprising a computer usable medium having a computer readable program for categorizing at least one content on Internet, wherein the computer readable program when executed on a computer causes the computer to:
-
gather one or more feeds associated with the at least one content, wherein the at least one content is provided by at least one content provider;
extract contextual information from the one or more feeds, wherein the contextual information is embedded into the one or more feeds;categorize the at least one content into at least one general web-based category, the at least one general web-based category belonging to a set of general web-based categories, the computer readable program further causes the computer to; perform a semantic analysis of the contextual information, wherein the semantic analysis of the contextual information yields a keyword string corresponding to the contextual information; and classify the at least one content into the at least one general web-based category based on the keyword string; translate the set of general web-based categories to a set of pre-defined categories, wherein one or more general web-based categories from the set of general web-based categories are translated to at least one pre-defined category in the set of pre-defined categories, wherein the at least one content belongs to at least one pre-defined category when translating the set of general web-based categories to the set of pre-defined categories; wherein the computer readable program when executed on the computer further causes the computer to;
compute a first relevance percentage corresponding to the at least one content classified into each of the at least one general web-based category; andcompute a second relevance percentage corresponding to the at least one content belonging to each of the at least one pre-defined category.
-
Specification