SYSTEMS AND METHODS FOR DISTRIBUTED DATA ANNOTATION
First Claim
1. A distributed data annotation server system, comprising:
- at least one storage device configured to store source data, one or more annotators, annotation tasks, and a distributed data annotation application; and
a processor;
wherein the distributed data annotation application configures the processor to;
receive source data, where the source data comprises one or more pieces of source data;
select one or more annotators for at least one piece of source data;
create one or more annotation tasks for the selected annotators and at least one piece of source data;
request one or more annotations for at least one piece of source data using the created annotation tasks;
receive annotations for at least one piece of source data;
determine source data metadata for at least one piece of source data using the received annotations, where the source data metadata includes source data characteristics;
generate annotator metadata for at least one annotator using the received annotations and at least one piece of source data, where the annotator metadata includes identified annotator characteristics; and
estimate the ground truth for at least one piece of source data using the source data metadata and the annotator metadata.
5 Assignments
0 Petitions
Accused Products
Abstract
Systems and methods for distributed data annotation in accordance embodiments of the invention are disclosed. In one embodiment of the invention, a distributed data annotation server system includes a storage device configured to store source data, one or more annotators, annotation tasks and a processor, wherein a distributed data annotation application configures the processor to receive source data including one or more pieces of source data, select one or more annotators, create one or more annotation tasks for the selected annotators and source data, request one or more annotations for the source data using the annotation tasks, receive annotations, determine source data metadata for at least one piece of source data using the received annotations, generate annotator metadata for at least one annotator using the received annotations and the source data, and estimate the ground truth for the source data using the source data metadata and the annotator metadata.
25 Citations
33 Claims
-
1. A distributed data annotation server system, comprising:
-
at least one storage device configured to store source data, one or more annotators, annotation tasks, and a distributed data annotation application; and a processor; wherein the distributed data annotation application configures the processor to; receive source data, where the source data comprises one or more pieces of source data; select one or more annotators for at least one piece of source data; create one or more annotation tasks for the selected annotators and at least one piece of source data; request one or more annotations for at least one piece of source data using the created annotation tasks; receive annotations for at least one piece of source data; determine source data metadata for at least one piece of source data using the received annotations, where the source data metadata includes source data characteristics; generate annotator metadata for at least one annotator using the received annotations and at least one piece of source data, where the annotator metadata includes identified annotator characteristics; and estimate the ground truth for at least one piece of source data using the source data metadata and the annotator metadata. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16)
-
-
17. A method for distributed data annotation, comprising:
-
receiving source data using a distributed data annotation server system, where the source data comprises one or more pieces of source data; selecting one or more annotators for at least one piece of source data using the distributed data annotation server system; creating one or more annotation tasks for the selected annotators and at least one piece of source data using the distributed data annotation server system; requesting one or more annotations for at least one piece of source data using the created annotation tasks and the distributed data annotation server system; receiving annotations for at least one piece of source data using the distributed data annotation server system; determining source data metadata for at least one piece of source data using the received annotations and the distributed data annotation server system, where the source data metadata includes source data characteristics; generating annotator metadata for at least one annotator using the received annotations, at least one piece of source data, and the distributed data annotation server system, where the annotator metadata includes identified annotator characteristics; and estimating the ground truth for at least one piece of source data using the source data metadata, the annotator metadata, and the distributed data annotation server system. - View Dependent Claims (18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32)
-
-
33. A distributed data annotation server system, comprising:
-
at least one storage device configured to store source data, one or more annotators, annotation tasks, and a distributed data annotation application; and a processor; wherein the distributed data annotation application configures the processor to; receive source data, where the source data comprises one or more pieces of source data; select one or more annotators for at least one piece of source data; create one or more annotation tasks for the selected annotators and at least one piece of source data; request one or more annotations for at least one piece of source data using the created annotation tasks; receive annotations for at least one piece of source data; and perform an iterative a posteriori estimation of the ground truth for at least one piece of source data, the difficulty of describing the at least one piece of source data and the competence of the selected annotators that annotated the at least one piece of source data based upon the received annotations.
-
Specification