Identifying Content Relationship for Content Copied by a Content Identification Mechanism
First Claim
1. A method, in a data processing system comprising a processor and a memory configured to implement a natural language processing (NLP) system, for identifying content relationship for content copied by a content identification mechanism, the method comprising:
- identifying, by the content identification mechanism, the content from a website using natural language processing;
identifying, by the content identification mechanism, relationship content information associated with a current web page where the content is found on the website;
modifying, by the content identification mechanism, a file structure associated with the content with the relationship content information;
identifying, by the content identification mechanism, one or more classification identifiers in order to classify the content; and
transmitting, by the content identification mechanism, the content and the file structure to a specific corpus in the NLP system based on the one or more classification identifiers.
1 Assignment
0 Petitions
Accused Products
Abstract
A mechanism is provided, in a data processing system comprising a processor and a memory configured to implement a natural language processing (NLP) system, for identifying content relationship for content copied by a content identification mechanism. The content identification mechanism identifies content from a website and then identifies relationship content information associated with a current web page where the content is found. The content identification mechanism modifies a file structure associated with the content with the relationship content information. The content identification mechanism identifies one or more classification identifiers in order to classify the content. Finally, the content identification mechanism transmits the content and the file structure to a specific corpus based on the one or more classification identifiers.
30 Citations
20 Claims
-
1. A method, in a data processing system comprising a processor and a memory configured to implement a natural language processing (NLP) system, for identifying content relationship for content copied by a content identification mechanism, the method comprising:
-
identifying, by the content identification mechanism, the content from a website using natural language processing; identifying, by the content identification mechanism, relationship content information associated with a current web page where the content is found on the website; modifying, by the content identification mechanism, a file structure associated with the content with the relationship content information; identifying, by the content identification mechanism, one or more classification identifiers in order to classify the content; and transmitting, by the content identification mechanism, the content and the file structure to a specific corpus in the NLP system based on the one or more classification identifiers. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8)
-
-
9. A computer program product comprising a computer readable storage medium having a computer readable program stored therein, wherein the computer readable program, when executed on a computing device, causes the computing device to:
-
identify content from a website using natural language processing (NLP); identify relationship content information associated with a current web page where the content is found on the website; modify a file structure associated with the content with the relationship content information; identify one or more classification identifiers in order to classify the content; and transmit the content and the file structure to a specific corpus in a QA system based on the one or more classification identifiers. - View Dependent Claims (10, 11, 12, 13, 14)
-
-
15. An apparatus comprising:
-
a processor; and a memory coupled to the processor, wherein the memory comprises instructions which, when executed by the processor, cause the processor to; identify content from a website using natural language processing (NLP); identify relationship content information associated with a current web page where the content is found on the website; modify a file structure associated with the content with the relationship content information; identify one or more classification identifiers in order to classify the content; and transmit the content and the file structure to a specific corpus in a QA system based on the one or more classification identifiers. - View Dependent Claims (16, 17, 18, 19, 20)
-
Specification