×

Consumer insights analysis using word embeddings

  • US 10,685,183 B1
  • Filed: 01/04/2018
  • Issued: 06/16/2020
  • Est. Priority Date: 01/04/2018
  • Status: Active Grant
First Claim
Patent Images

1. A method comprising:

  • by a computing device in an online social network, receiving a request to generate a visualization of public sentiments regarding a particular subject by a plurality of clusters, wherein each cluster comprises a plurality of words semantically close to each other, wherein the request comprises one or more conditions characterizing a group of users, and wherein the request comprises an input n-gram representing the particular subject;

    by the computing device, identifying users of the online social network who satisfy the one or more conditions;

    by the computing device, constructing a first corpus of text by collecting text containing the input n-gram from a plurality of content objects in the online social network created by the identified users;

    by the computing device, identifying a list of unique n-grams appearing in the first corpus of text;

    by the computing device, generating, using a word embedding model, a table comprising unique n-grams in the list and their corresponding word vectors, wherein the word embedding model was trained using a second corpus of text collected from a plurality of user-created content objects in the online social network as training data, wherein each of the word vectors represents a semantic context of a corresponding n-gram as a point in a d-dimensional embedding space;

    by the computing device, classifying word vectors in the table into a plurality of clusters based on semantic similarities of the word vectors;

    by the computing device, calculating, for each of the plurality of clusters, a Term Frequency-Inverse Document Frequency (TF-IDF) score associated with the cluster;

    by the computing device, determining k most relevant clusters to the particular subject based on the calculated TF-IDF scores; and

    by the computing device, sending, as a response to the request, instructions to display n-grams associated with the determined k clusters in a two-dimensional display space, wherein n-grams corresponding to word vectors that belong to a cluster are displayed together.

View all claims
  • 2 Assignments
Timeline View
Assignment View
    ×
    ×