Generating and applying event data extraction templates
First Claim
1. A computer-implemented method, comprising:
- providing a data extraction template that indicates one or more transient markup language paths, including a given transient markup language path, wherein the data extraction template includes an event-related semantic data type assigned to the given transient markup language path and a strength of association between the given transient structural path and the event-related semantic data type;
applying the data extraction template to a plurality of structured communications to extract, from each of the plurality of structured communications, event data associated with the given transient markup language path indicated in the data extraction template;
obtaining feedback concerning the event data extracted from one or more of the structured communications based on the given transient markup language path indicated in the data extraction template;
altering, in the data extraction template, the strength of association between the given transient markup language path and the event-related semantic data type assigned to the transient markup language path based on the feedback; and
subsequent to the altering, applying the data extraction template to a subsequent structured communication to extract new event data from the subsequent structured communication based on the given transient markup language path, wherein the new event data is extracted based on the altered strength of association.
2 Assignments
0 Petitions
Accused Products
Abstract
Techniques are described herein for generating and applying event data extraction templates. In various implementations, a data extraction template may be applied to structured communications to extract, from each structured communication, event data associated with a transient markup language path indicated in the data extraction template. The data extraction template may include an event-related semantic data type assigned to the transient markup language path and a strength of association between the transient structural path and the event-related semantic data type. Feedback may be obtained concerning event data extracted from one or more of the structured communications. Based on the feedback, the strength of association between the transient markup language path and the event-related semantic data type may be altered. The data extraction template may then be applied to a subsequent structured communication to extract new event data from the structured communication based on the altered strength of association.
-
Citations
14 Claims
-
1. A computer-implemented method, comprising:
-
providing a data extraction template that indicates one or more transient markup language paths, including a given transient markup language path, wherein the data extraction template includes an event-related semantic data type assigned to the given transient markup language path and a strength of association between the given transient structural path and the event-related semantic data type; applying the data extraction template to a plurality of structured communications to extract, from each of the plurality of structured communications, event data associated with the given transient markup language path indicated in the data extraction template; obtaining feedback concerning the event data extracted from one or more of the structured communications based on the given transient markup language path indicated in the data extraction template; altering, in the data extraction template, the strength of association between the given transient markup language path and the event-related semantic data type assigned to the transient markup language path based on the feedback; and subsequent to the altering, applying the data extraction template to a subsequent structured communication to extract new event data from the subsequent structured communication based on the given transient markup language path, wherein the new event data is extracted based on the altered strength of association. - View Dependent Claims (2, 3, 4, 5, 6)
-
-
7. A system including memory and one or more processors operable to execute instructions stored in the memory, comprising instructions to:
-
providing a data extraction template that indicates one or more transient markup language paths, including a given transient markup language path, wherein the data extraction template includes an event-related semantic data type assigned to the given transient markup language path and a strength of association between the given transient structural path and the event-related semantic data type; apply the data extraction template to a plurality of structured communications to extract, from each of the plurality of structured communications, event data associated with the given transient markup language path indicated in the data extraction template; obtain feedback concerning the event data extracted from one or more of the structured communications based on the transient markup language path indicated in the data extraction template; alter, in the data extraction template, the strength of association between the given transient markup language path and the event-related semantic data type assigned to the transient markup language path based on the feedback; and subsequent to the altering, apply the data extraction template to a subsequent structured communication to extract new event data from the subsequent structured communication, wherein the new event data is extracted based on the altered strength of association. - View Dependent Claims (8, 9, 10, 11, 12)
-
-
13. At least one non-transitory computer-readable medium comprising instructions that, in response to execution of the instructions by one or more processors, cause the one or more processors to perform the following operations:
-
providing a data extraction template that indicates one or more transient markup language paths, including a given transient markup language path, wherein the data extraction template includes an event-related semantic data type assigned to the given transient markup language path and a strength of association between the given transient structural path and the event-related semantic data type; applying the data extraction template to a plurality of structured communications to extract, from each of the plurality of structured communications, event data associated with the given transient markup language path indicated in the data extraction template; obtaining feedback concerning the event data extracted from one or more of the structured communications based on the transient markup language path indicated in the data extraction template; altering, in the data extraction template, the strength of association between the given transient markup language path and the event-related semantic data type assigned to the transient markup language path based on the feedback; and subsequent to the altering, applying the data extraction template to a subsequent structured communication to extract new event data from the subsequent structured communication based on the given transient markup language path, wherein the new event data is extracted based on the altered strength of association. - View Dependent Claims (14)
-
Specification