×

Enhancing reading accuracy, efficiency and retention

  • US 10,515,138 B2
  • Filed: 04/17/2015
  • Issued: 12/24/2019
  • Est. Priority Date: 04/25/2014
  • Status: Active Grant
First Claim
Patent Images

1. A computer-implemented method for extracting, from a machine-readable natural language text, nested structures of sentences with embedded clauses, the method comprising:

  • accessing, by a computing system, computer readable memory storing one or more files containing machine-readable natural language text;

    enriching, by the computing system, the machine-readable natural language text by adding word-specific attribute tags to the text, wherein each word-specific attribute tag comprises a computer readable code associated with a particular location within the machine-readable natural language text;

    accessing, by the computing system, one or more databases containing domain-specific multi-word terms developed through a process of statistical frequency analysis of a plurality of texts in the domain;

    identifying, by the computing system and within the one or more databases, a plurality of domain-specific multi-word binding tags that are applicable to the machine-readable natural language text;

    adding the identified of domain-specific multi-word binding tags to the machine-readable natural language text;

    adding, to the machine readable natural language text, clustering tags that identify groups of individual words as discrete multi-word clusters,adding, to the machine readable natural language text, phrase-bracketing tags, using recursive, context-specific rules that assemble words and multi-word clusters into phrases, based on the attribute tags and the multi-word binding tags;

    adding, to the machine readable natural language text, clause-encapsulating tags, using context-specific rules that examine the phrase-bracketing tags for specific patterns of phrase-segments that meet clause pattern criteria;

    adding, to the machine readable natural language text, envelope tags that denote open and closed states of phrases that contain other phrases and clauses, using context-specific rules that examine attributes of phrases, including open, closed and inter-phrase touching states of adjacent phrases, to allow or disallow absorption of phrases by other phrases and absorption of closed clauses by phrases, and to determine the closure of phrases when criteria for sets of inter-phrase attributes and sentence-concluding punctuation boundaries are met;

    storing, by the computing system and on one or more computer readable memory devices, the machine-readable natural language text along with the word-specific attribute tags, the domain-specific multi-word binding tags, the clustering tags, the phrase-bracketing tags, the clause-encapsulating tags, and the envelope tags such that each of the word-specific attribute tags, the domain-specific multi-word binding tags, the clustering tags, the phrase-bracketing tags, the clause-encapsulating tags, and the envelope tags is associated with a particular portion of the machine-readable natural language text within the one or more computer readable memory devices; and

    displaying, on a display screen of the computing system, the machine readable natural language text with indications of at least some of the word-specific attribute tags, domain-specific multi-word binding tags, clustering tags, phrase-bracketing tags, clause-encapsulating tags, or envelope tags.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×