×

Methods and systems for analyzing discrete-valued datasets

  • US 10,394,898 B1
  • Filed: 09/15/2015
  • Issued: 08/27/2019
  • Est. Priority Date: 09/15/2014
  • Status: Active Grant
First Claim
Patent Images

1. A system for determining the structure of an electronic dataset, the system comprising:

  • one or more processors configured to perform the steps of;

    receiving a matrix with a first dimension corresponding to items, a second dimension corresponding to features, and discrete-valued elements indicating a presence, absence, or frequency of the features in the items;

    generating an engineered features set and a weights set for the matrix, the engineered features set and the weights set corresponding to latent structures in the matrix, generating the engineered features set and the weights set comprising;

    generating a first engineered feature and a first weights vector corresponding to a first latent structure in the matrix, generating the first engineered feature and the first weights vector comprising;

    updating the first engineered feature of the engineered features set using the matrix and the first weights vector of the weights set, andupdating the first weights vector of the weights set using a mutual information of the matrix and the first engineered feature; and

    generating a second engineered feature and a second weights vector corresponding to a second latent structure using a subset of the matrix associated with the first latent structure, generating the second engineered feature and the second weights vector comprising;

    determining the subset of the matrix using at least one of the first engineered feature and the first weights vector;

    updating the second engineered feature of the engineered features set using the subset and the second weights vector of the weights set; and

    updating the second weights vector of the weights set using a mutual information of the subset and the second engineered feature;

    receiving a request indicating at least one of the engineered features set;

    identifying items based on the matrix and the indicated at least one of the engineered features set; and

    providing a response based on the identified items.

View all claims
  • 0 Assignments
Timeline View
Assignment View
    ×
    ×