SVO-BASED TAXONOMY-DRIVEN TEXT ANALYTICS
1 Assignment
0 Petitions
Accused Products
Abstract
Organizing textual data into statement clusters. Sentences are extracted from textual data and parsed. A verb usage pattern is identified and an SVO triplet is determined. The SVO triplet is compared to a taxonomy associated with the domain of the data and a sentiment is derived. A statement cluster is constructed comprising a higher level SVO triplet sensitive to the taxonomy and verb usage pattern, as well as the derived sentiment. Accordingly, the statement clusters may be organized by grouping.
15 Citations
20 Claims
-
1. (canceled)
-
2. (canceled)
-
3. (canceled)
-
4. (canceled)
-
5. (canceled)
-
6. (canceled)
-
7. (canceled)
-
8. A computer program product for classifying data, the computer program product comprising a computer readable storage medium having program code embodied therewith, the program code being executable by a processor to:
-
receive textual data, and to analyze the received data, including the processor to extract at least one sentence from the received data; parse the at least one sentence, including the processor to extract and identify a subject, a verb, and an object, within the parsed sentence; identify a verb usage pattern in the parsed sentence; categorize the extracted and identified subject, verb, and object, the categorization of the verb responsive to the identified verb usage pattern; and classify the sentence based on the categorized subject, verb, and object. - View Dependent Claims (9, 10, 11, 12, 13, 14)
-
-
15. A system comprising:
-
a processing unit in communication with data storage; a functional unit having memory and in communication with the processing unit, the functional unit having tools to support data classification, the tools comprising; an extraction manager in communication with data storage, the extraction manager to extract at least one sentence from textual data and to parse the extracted sentence, including extraction of a subject, verb, and object from the sentence; an identification manager in communication with the extraction manager, the identification manager to identify the subject, verb, object, and a verb usage pattern associated with the verb in the parsed sentence; an organization manager in communication with the identification manager, the organization manager to categorize the extracted and identified subject, verb, and object responsive to the identified verb usage pattern, and to classify the sentence based on the categorized subject, verb, and object. - View Dependent Claims (16, 17, 18, 19, 20)
-
Specification