×

Methods and apparatus for formatting text for clinical fact extraction

  • US 9,905,229 B2
  • Filed: 06/05/2012
  • Issued: 02/27/2018
  • Est. Priority Date: 02/18/2011
  • Status: Active Grant
First Claim
Patent Images

1. A method comprising:

  • receiving an original free-form text narrative regarding a patient encounter provided by a clinician;

    re-formatting the original free-form text narrative, using at least one processor, at least in part by adding, removing, and/or correcting sentence boundaries and/or section boundaries with respect to the original free-form text narrative to produce a formatted text including the added and/or corrected sentence boundaries and/or section boundaries, the re-formatting comprisingapplying at least one statistical model to the original free-form text narrative to generate, for a word or a sequence of words in the original free-form text narrative, a probability that the word or the sequence of words would be followed by a sentence boundary and/or a section boundary, wherein the at least one statistical model is trained at least in part with other free-form text narratives having correct sentence boundaries and/or section boundaries, andin response to determining that the probability satisfies one or more criteria, adding, removing, and/or correcting a sentence boundary and/or a section boundary following the word or the sequence of words, with respect to the original free-form text narrative;

    extracting one or more clinical facts from the formatted text, wherein a first fact of the one or more clinical facts is extracted from a first portion of the formatted text, wherein the first portion of the formatted text is a formatted version of a first portion of the original free-form text narrative, the extracting comprisinganalyzing the formatted text to identify a set of one or more features of at least the first portion of the formatted text,correlating the set of features to one or more abstract semantic concepts, andgenerating computer-readable data that expresses the one or more abstract semantic concepts as the one or more clinical facts extracted from the formatted text; and

    providing to a user an indicator that distinguishes the first portion of the original free-form text narrative that resulted in extraction of the first fact, from other portions of the original free-form text narrative that did not result in the extraction of the first fact.

View all claims
  • 3 Assignments
Timeline View
Assignment View
    ×
    ×