Method and apparatus for automatically structuring free form hetergeneous data
First Claim
1. A method, performed on a data processing system comprising a memory and a data processor coupled to the memory, of automatically structuring free form heterogeneous data, the method comprising the steps of:
- obtaining free form heterogeneous data;
segmenting the free form heterogeneous data into one or more units, wherein the one or more units includes a sentence;
automatically labeling the one or more units based on one or more machine learning techniques, wherein each unit is associated with a label indicating an information structure type, wherein automatically labeling one or more units includes labeling a sentence with a label that indicates a type of information provided by the sentence; and
structuring the one or more labeled units in a format to facilitate one or more operations that use at least a portion of the labeled units.
1 Assignment
0 Petitions
Accused Products
Abstract
Techniques are provided for automatically structuring free form heterogeneous data. In one aspect of the invention, the techniques include obtaining free form heterogeneous data, segmenting the free form heterogeneous data into one or more units, automatically labeling the one or more units based on one or more machine learning techniques, wherein each unit is associated with a label indicating an information type, and structuring the one or more labeled units in a format to facilitate one or more operations that use at least a portion of the labeled units, e.g., information technology (IT) operations.
40 Citations
14 Claims
-
1. A method, performed on a data processing system comprising a memory and a data processor coupled to the memory, of automatically structuring free form heterogeneous data, the method comprising the steps of:
-
obtaining free form heterogeneous data; segmenting the free form heterogeneous data into one or more units, wherein the one or more units includes a sentence; automatically labeling the one or more units based on one or more machine learning techniques, wherein each unit is associated with a label indicating an information structure type, wherein automatically labeling one or more units includes labeling a sentence with a label that indicates a type of information provided by the sentence; and structuring the one or more labeled units in a format to facilitate one or more operations that use at least a portion of the labeled units. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8)
-
-
9. An apparatus for automatically structuring free form heterogeneous data, comprising:
-
a memory; and at least one processor coupled to the memory and operative to; obtain free form heterogeneous data; segment the free form heterogeneous data into one or more units, wherein the one or more units includes a sentence; automatically label the one or more units based on one or more machine learning techniques, wherein each unit is associated with a label indicating an information structure type, wherein a sentence is automatically labeled with a label that indicates a type of information provided by the sentence; and structure the one or more labeled units in a format to facilitate one or more operations that use at least a portion of the labeled units. - View Dependent Claims (10, 11)
-
-
12. A computer program product comprising a computer useable storage medium having computer useable program code for automatically structuring free form heterogeneous data, the computer program product including:
-
computer useable program code for obtaining free form heterogeneous data; computer useable program code for segmenting the free form heterogeneous data into one or more units, wherein the one or more units includes a sentence; computer useable program code for automatically labeling the one or more units based on one or more machine learning techniques, wherein each unit is associated with a label indicating an information structure type, wherein automatically labeling one or more units includes labeling a sentence with a label that indicates a type of information provided by the sentence; and computer useable program code for structuring the one or more labeled units in a format to facilitate one or more operations that use at least a portion of the labeled units. - View Dependent Claims (13, 14)
-
Specification