×

JOINT APPROACH TO FEATURE AND DOCUMENT LABELING

  • US 20160203209A1
  • Filed: 01/12/2015
  • Published: 07/14/2016
  • Est. Priority Date: 01/12/2015
  • Status: Active Grant
First Claim
Patent Images

1. A document labeling system comprising:

  • an electronic data processing device configured to label documents comprising text of a set of documents by operations including;

    (i) receiving L labeled topics each labeled with a word list comprising words representative of the labeled topic;

    (ii) performing probabilistic classification of the documents of the set of documents to generate for each labeled topic of the L labeled topics a document vector whose elements store scores of the documents for the labeled topic and a word vector whose elements store scores of words of a vocabulary for the labeled topic; and

    (iii) performing non-negative matrix factorization (NMF) to generate a NMF model that clusters the set of documents into k topics where k>

    L and the performing NMF includes initializing NMF factors representing L topics of the k topics to the document and word vectors for the L labeled topics generated in the operation (ii).

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×