×

Ontology mapper

  • US 8,856,156 B1
  • Filed: 10/05/2012
  • Issued: 10/07/2014
  • Est. Priority Date: 10/07/2011
  • Status: Active Grant
First Claim
Patent Images

1. A non-transitory Computer-readable media having computer-executable instructions embodied thereon that when executed provide a method for facilitating decision support by determining nomenclature linkages between variables in databases that have different ontologies, the method comprising:

  • identifying a first set of documents from a first record system having a first ontology;

    identifying a second set of documents from a second record system having a second ontology that is different than the first ontology;

    determining a use-case present in the first and second sets of documents;

    determining a set of variables relevant to the use-case;

    receiving from the first set of documents, a first document containing at least one first-document variable from the set of variables;

    wherein each first-document variable has a first-document value associated with it;

    receiving from the second set of documents, a second document containing at least one second-document variable from the set of variables;

    (1) wherein the second-document variable has a second-document value associated with it, and(2) wherein the second-document variable is also contained in the first document;

    based on the determined use-case and set of variables, generating a decision-tree classifier;

    for each first-document variable contained in the first document, applying the decision tree classifier to transform the first-document value associated with the first-document variable to a categorical datatype;

    for each second-document variable contained in the second document, applying the decision tree classifier to transform the second-document value associated with the second-document variable to a categorical datatype;

    based on the categorical datatypes of the first document and the categorical datatypes of the second document, generating a set of textmatrices;

    applying latent semantic analysis to the set of textmatrices to determine a latent semantic space associated with the at least one first-document variable and the at least one second document variable;

    specifying a threshold of similarity;

    for a first comparison-variable, from the at least one first-document variables associated with the latent semantic space;

    determining a measure of similarity to a second-comparison variable from the at least one second-document variables associated with the latent semantic space;

    performing a comparison of the measure similarity to the threshold; and

    based on the comparison, determining that the measure similarity satisfies the threshold, associating the first comparison variable with the second comparison variable, and designating the association as a synonymy, wherein the threshold is satisfied if the measure of similarity is greater than the threshold.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×