MULTI-LABEL CONTENT RECATEGORIZATION
First Claim
1. A computing apparatus, comprising:
- one or more logic elements, including at least one hardware logic element, comprising a classification engine to;
receive a clean multi-labeled dataset comprising a plurality of document each assigned to one or more of a plurality of categories;
receive an unclean multi-labeled dataset; and
produce a recategorized and cleansed dataset from the unclean multi-labeled dataset, comprising predicting a number of labels {circumflex over (l)} for a document j, and comparing {circumflex over (l)} to an existing number of labels l.
13 Assignments
0 Petitions
Accused Products
Abstract
In an example, there is disclosed a computing apparatus, including one or more logic elements, including at least one hardware logic element, comprising a classification engine to: receive a clean multi-labeled dataset comprising a plurality of document each assigned to one or more of a plurality of categories; receive an unclean multi-labeled dataset; and produce a recategorized and cleansed dataset from the unclean multi-labeled dataset, comprising predicting a number of labels {circumflex over (l)} for a document j, and comparing {circumflex over (l)} to an existing number of labels l. There is also disclosed a method of providing a classification engine.
19 Citations
25 Claims
-
1. A computing apparatus, comprising:
one or more logic elements, including at least one hardware logic element, comprising a classification engine to; receive a clean multi-labeled dataset comprising a plurality of document each assigned to one or more of a plurality of categories; receive an unclean multi-labeled dataset; and produce a recategorized and cleansed dataset from the unclean multi-labeled dataset, comprising predicting a number of labels {circumflex over (l)} for a document j, and comparing {circumflex over (l)} to an existing number of labels l. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12)
-
13. One or more tangible, non-transitory computer-readable mediums having stored thereon executable instructions for providing a classification engine to:
-
receive a clean multi-labeled dataset comprising a plurality of document each assigned to one or more of a plurality of categories; receive an unclean multi-labeled dataset; and produce a recategorized and cleansed dataset from the unclean multi-labeled dataset, comprising predicting a number of labels {circumflex over (l)} for a document j, and comparing {circumflex over (l)} to an existing number of labels l. - View Dependent Claims (14, 15, 16, 17, 18, 19, 20, 21, 22)
-
-
23. A computer-implemented method of providing multi-label content recategorization, comprising:
-
receiving a clean multi-labeled dataset comprising a plurality of document each assigned to one or more of a plurality of categories; receiving an unclean multi-labeled dataset; and producing a recategorized and cleansed dataset from the unclean multi-labeled dataset, comprising predicting a number of labels {circumflex over (l)} for a document j, and comparing {circumflex over (l)} to an existing number of labels l. - View Dependent Claims (24, 25)
-
Specification