SVO-based taxonomy-driven text analytics
First Claim
Patent Images
1. A method comprising:
- receiving textual data, and storing the received data in memory;
analyzing the stored data, wherein the analysis comprises;
identifying at least one sentence from the stored data;
parsing parts of speech of each word of the at least one identified sentence using a linguistic parser, including parsing a verb from the at least one parsed sentence, and identifying a verb usage pattern in the at least one identified sentence;
forming a low level subject-verb-object (SVO) triplet for the at least one parsed sentence, including identifying a subject, verb, and object of the at least one parsed sentence, wherein the identification of the subject, verb, and object comprises joining the identified verb usage pattern with a form of the identified verb to ascertain linguistic taxonomy;
forming a high level SVO triplet for the at least one parsed sentence, including determining a subject category for the subject, a verb category for the verb, and an object category for the object based on the taxonomy; and
classifying the at least one parsed sentence based on the high level SVO triplet; and
summarizing the analysis of the stored data, including;
producing an analysis report reflective of the analysis; and
converting the produced analysis report into a summary report reflective of the analysis report, including clustering the received textual data into one or more statement clusters, wherein the summary report comprises the statement clusters.
1 Assignment
0 Petitions
Accused Products
Abstract
Textual data is organized into statement clusters. Sentences are extracted from textual data and parsed. A verb usage pattern is identified and an SVO triplet is determined. The SVO triplet is compared to a taxonomy associated with the domain of the data and a sentiment is derived. A statement cluster is constructed comprising a higher level SVO triplet sensitive to the taxonomy and verb usage pattern, as well as the derived sentiment. Accordingly, the statement clusters may be organized by grouping.
13 Citations
8 Claims
-
1. A method comprising:
-
receiving textual data, and storing the received data in memory; analyzing the stored data, wherein the analysis comprises; identifying at least one sentence from the stored data; parsing parts of speech of each word of the at least one identified sentence using a linguistic parser, including parsing a verb from the at least one parsed sentence, and identifying a verb usage pattern in the at least one identified sentence; forming a low level subject-verb-object (SVO) triplet for the at least one parsed sentence, including identifying a subject, verb, and object of the at least one parsed sentence, wherein the identification of the subject, verb, and object comprises joining the identified verb usage pattern with a form of the identified verb to ascertain linguistic taxonomy; forming a high level SVO triplet for the at least one parsed sentence, including determining a subject category for the subject, a verb category for the verb, and an object category for the object based on the taxonomy; and classifying the at least one parsed sentence based on the high level SVO triplet; and summarizing the analysis of the stored data, including; producing an analysis report reflective of the analysis; and converting the produced analysis report into a summary report reflective of the analysis report, including clustering the received textual data into one or more statement clusters, wherein the summary report comprises the statement clusters. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8)
-
Specification