×

Joint approach to feature and document labeling

  • US 10,055,479 B2
  • Filed: 01/12/2015
  • Issued: 08/21/2018
  • Est. Priority Date: 01/12/2015
  • Status: Expired due to Fees
First Claim
Patent Images

1. A system operating on a set of documents comprising text, the system comprising:

  • a computer;

    a user interfacing device including a display and at least one user input device; and

    a non-transitory storage medium storing instructions programming the computer to perform operations including;

    (i) receiving, via the user interfacing device, L labeled topics each labeled with a word list comprising words representative of the labeled topic;

    (ii) performing probabilistic classification of the documents of the set of documents to generate for each labeled topic of the L labeled topics a document vector whose elements store scores of the documents for the labeled topic and a word vector whose elements store scores of words of a vocabulary for the labeled topic;

    (iii) performing non-negative matrix factorization (NMF) to generate a NMF model that clusters the set of documents into k topics where k>

    L and the performing NMF includes initializing NMF factors representing L topics of the k topics to the document and word vectors for the L labeled topics generated in the operation (ii); and

    performing data mining using the NMF model.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×