×

Generating a domain ontology using word embeddings

  • US 10,248,718 B2
  • Filed: 06/22/2016
  • Issued: 04/02/2019
  • Est. Priority Date: 07/04/2015
  • Status: Active Grant
First Claim
Patent Images

1. A device, comprising:

  • one or more processors to;

    generate a set of distributed word vectors from a list of terms determined from a text using a vector model associated with generating the set of distributed word vectors,the set of distributed word vectors representing a plurality of real numbers for each term in the list of terms;

    determine a quantity of term clusters, to be generated to form an ontology of terms in the text, based on the set of distributed word vectors and using a statistical technique;

    generate term clusters, representing concepts of the ontology of terms, based on the quantity of term clusters and using a recursive divisive clustering technique;

    perform a frequency analysis for terms included in the ontology of terms;

    determine non-hierarchical relationships or attributes for relationships between the terms included in the ontology of terms based on the frequency analysis; and

    output the term clusters, and data identifying the non-hierarchical relationships or attributes for relationships, to permit another device to analyze a set of documents using the term clusters.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×