SYSTEMS AND METHODS FOR TRAINING AND CLASSIFYING DATA
First Claim
1. A method comprising:
- receiving, by a processing device, a data set, wherein the data set is annotated with at least a first annotation and at least a second annotation, wherein at least the first annotation and the second annotation represent characteristics within the data set;
determining, by the processing device, a first identifier from the first annotation and a second identifier from the second annotation;
associating, by the processing device, the first identifier to the second identifier to generate a joined identifier;
computing, by the processing device, feature weights and transition weights for the annotated data set based on at least the first identifier, at least the second identifier, and at least the joined identifier and transitions between each of the first, the second and the joined identifiers;
receiving, by the processing device, a second data set, wherein the second data set is un-annotated; and
classifying, by the processing device, the second data set based on the computed feature weights and the transition weights.
7 Assignments
0 Petitions
Accused Products
Abstract
A mechanism for training and classifying data is disclosed. The method includes receiving a data set having at least a first annotation and at least a second annotation. The first annotation and the second annotation represent characteristics within the data set. The method also includes determining a first identifier from the first annotation and a second identifier from the second annotation and associating the first identifier to the second identifier to generate a joined identifier. The method also includes computing feature weights and transition weights for the annotated data set based on the at least a first identifier, at least a second identifier, and at least a joined identifier and transitions between each of the first, the second and the joined identifiers. The method further includes receiving a second un-annotated data set and classifying the second data set based on the computed feature weights and the transition weights.
-
Citations
16 Claims
-
1. A method comprising:
-
receiving, by a processing device, a data set, wherein the data set is annotated with at least a first annotation and at least a second annotation, wherein at least the first annotation and the second annotation represent characteristics within the data set; determining, by the processing device, a first identifier from the first annotation and a second identifier from the second annotation; associating, by the processing device, the first identifier to the second identifier to generate a joined identifier; computing, by the processing device, feature weights and transition weights for the annotated data set based on at least the first identifier, at least the second identifier, and at least the joined identifier and transitions between each of the first, the second and the joined identifiers; receiving, by the processing device, a second data set, wherein the second data set is un-annotated; and classifying, by the processing device, the second data set based on the computed feature weights and the transition weights. - View Dependent Claims (2, 3, 4, 5)
-
-
6. A system comprising:
-
a memory; a processing device coupled to the memory, the processing device configured to; receive a data set, wherein the data set is annotated with at least a first annotation and at least a second annotation, wherein at least the first annotation and the second annotation represent within the data set; determine a first identifier from the first annotation and a second identifier from the second annotation; associate, the first identifier to the second identifier to generate a joined identifier; compute feature weights and transition weights for the annotated data set based on at least the first identifier, at least the second identifier, and at least the joined identifier and transitions between each of the first, the second and the joined identifiers; receive a second data set, wherein the second data set is un-annotated; and classify the second data set based on the computed feature weights and the transition weights. - View Dependent Claims (7, 8, 9, 10)
-
-
11. A non-transitory machine-readable storage medium including data that, when accessed by a machine, cause the machine to perform a method comprising:
-
receiving, by a processing device, a data set, wherein the data set is annotated with at least a first annotation and at least a second annotation, wherein at least the first annotation and the second annotation represent characteristics within the data set; determining, by the processing device, a first identifier from the first annotation and a second identifier from the second annotation; associating, by the processing device, the first identifier to the second identifier to generate a joined identifier; computing, by the processing device, feature weights and transition weights for the annotated data set based on at least the first identifier, at least the second identifier, and at least the joined identifier and transitions between each of the first, the second and the joined identifiers; receiving, by the processing device, a second data set, wherein the second data set is un-annotated; and classifying, by the processing device, the second data set based on the computed feature weights and the transition weights. - View Dependent Claims (12, 13, 14, 15, 16)
-
Specification