×

Clinically relevant medical concept clustering

  • US 10,839,947 B2
  • Filed: 01/06/2016
  • Issued: 11/17/2020
  • Est. Priority Date: 01/06/2016
  • Status: Active Grant
First Claim
Patent Images

1. A computer program product for clustering concepts extracted from electronic content, the computer program product comprising one or more non-transitory computer readable storage media collectively having program instructions embodied therewith, the program instructions executable by a processor to cause the processor to:

  • identify, within a plurality of different taxonomies, relations between taxonomy categories of the different taxonomies and the concepts extracted from the electronic content, wherein the electronic content is from a medical record, the concepts include medical concepts extracted from the medical record, and the plurality of different taxonomies includes medical taxonomies, and wherein the relations represent semantic similarities for the concepts and the identifying relations further includes;

    mapping the concepts to each of the plurality of different taxonomies, wherein mapping the concepts includes;

    determining a first concept extracted from the electronic content not found in a selected taxonomy of the plurality of different taxonomies;

    identifying one or more other taxonomies of the plurality of different taxonomies containing the first concept and determining a second concept that resides in the selected taxonomy and the identified one or more other taxonomies; and

    mapping the first concept to the second concept within the selected taxonomy when the second concept is closest to the first concept in the identified one or more other taxonomies and within a distance limit of the first concept, wherein the first concept remains unmapped to the selected taxonomy in response to the second concept not satisfying the distance limit;

    generating concept vectors relating each of the concepts to one or more corresponding taxonomy categories of the different taxonomies, wherein each concept vector is associated with a concept and includes a plurality of values with each value indicating a relationship between that associated concept and a corresponding taxonomy category, and wherein at least one concept has relations to taxonomy categories in two or more taxonomies; and

    determining a similarity measure between each of the concept vectors of the concepts based on distances between the concept vectors;

    cluster the concepts based on the determined similarity measure between the concept vectors; and

    generate a visualization of the electronic content with information arranged according to the clustered concepts to identify information within the electronic content relevant to a situation.

View all claims
  • 2 Assignments
Timeline View
Assignment View
    ×
    ×