Enhancing reading accuracy, efficiency and retention
First Claim
1. A computer-implemented method for extracting, from a machine-readable natural language text, nested structures of sentences with embedded clauses, the method comprising:
- accessing, by a computing system, computer readable memory storing one or more files containing machine-readable natural language text;
enriching, by the computing system, the machine-readable natural language text by adding word-specific attribute tags to the text, wherein each word-specific attribute tag comprises a computer readable code associated with a particular location within the machine-readable natural language text;
accessing, by the computing system, one or more databases containing domain-specific multi-word terms developed through a process of statistical frequency analysis of a plurality of texts in the domain;
identifying, by the computing system and within the one or more databases, a plurality of domain-specific multi-word binding tags that are applicable to the machine-readable natural language text;
adding the identified of domain-specific multi-word binding tags to the machine-readable natural language text;
adding, to the machine readable natural language text, clustering tags that identify groups of individual words as discrete multi-word clusters,adding, to the machine readable natural language text, phrase-bracketing tags, using recursive, context-specific rules that assemble words and multi-word clusters into phrases, based on the attribute tags and the multi-word binding tags;
adding, to the machine readable natural language text, clause-encapsulating tags, using context-specific rules that examine the phrase-bracketing tags for specific patterns of phrase-segments that meet clause pattern criteria;
adding, to the machine readable natural language text, envelope tags that denote open and closed states of phrases that contain other phrases and clauses, using context-specific rules that examine attributes of phrases, including open, closed and inter-phrase touching states of adjacent phrases, to allow or disallow absorption of phrases by other phrases and absorption of closed clauses by phrases, and to determine the closure of phrases when criteria for sets of inter-phrase attributes and sentence-concluding punctuation boundaries are met;
storing, by the computing system and on one or more computer readable memory devices, the machine-readable natural language text along with the word-specific attribute tags, the domain-specific multi-word binding tags, the clustering tags, the phrase-bracketing tags, the clause-encapsulating tags, and the envelope tags such that each of the word-specific attribute tags, the domain-specific multi-word binding tags, the clustering tags, the phrase-bracketing tags, the clause-encapsulating tags, and the envelope tags is associated with a particular portion of the machine-readable natural language text within the one or more computer readable memory devices; and
displaying, on a display screen of the computing system, the machine readable natural language text with indications of at least some of the word-specific attribute tags, domain-specific multi-word binding tags, clustering tags, phrase-bracketing tags, clause-encapsulating tags, or envelope tags.
1 Assignment
0 Petitions
Accused Products
Abstract
This document provides systems and methods for altering text presentation to increase reading accuracy, efficiency, and retention. This can include identification text specific attributes from machine readable text (through parsing of the text), varying the text presentation in accordance with the attributes, and creating an enhanced visual product for enhancing the reading experience. For example, a computer system can extract attributes such as parts of speech from an input sentence and display that sentence in cascading text segments down and across a display screen. The system can further use domain-specific dictionaries derived from domain-specific texts to identify domain-specific compound noun phrases and verb phrases that require specific linguistic tagging to be usable in other linguistic analysis steps.
48 Citations
15 Claims
-
1. A computer-implemented method for extracting, from a machine-readable natural language text, nested structures of sentences with embedded clauses, the method comprising:
-
accessing, by a computing system, computer readable memory storing one or more files containing machine-readable natural language text; enriching, by the computing system, the machine-readable natural language text by adding word-specific attribute tags to the text, wherein each word-specific attribute tag comprises a computer readable code associated with a particular location within the machine-readable natural language text; accessing, by the computing system, one or more databases containing domain-specific multi-word terms developed through a process of statistical frequency analysis of a plurality of texts in the domain; identifying, by the computing system and within the one or more databases, a plurality of domain-specific multi-word binding tags that are applicable to the machine-readable natural language text; adding the identified of domain-specific multi-word binding tags to the machine-readable natural language text; adding, to the machine readable natural language text, clustering tags that identify groups of individual words as discrete multi-word clusters, adding, to the machine readable natural language text, phrase-bracketing tags, using recursive, context-specific rules that assemble words and multi-word clusters into phrases, based on the attribute tags and the multi-word binding tags; adding, to the machine readable natural language text, clause-encapsulating tags, using context-specific rules that examine the phrase-bracketing tags for specific patterns of phrase-segments that meet clause pattern criteria; adding, to the machine readable natural language text, envelope tags that denote open and closed states of phrases that contain other phrases and clauses, using context-specific rules that examine attributes of phrases, including open, closed and inter-phrase touching states of adjacent phrases, to allow or disallow absorption of phrases by other phrases and absorption of closed clauses by phrases, and to determine the closure of phrases when criteria for sets of inter-phrase attributes and sentence-concluding punctuation boundaries are met; storing, by the computing system and on one or more computer readable memory devices, the machine-readable natural language text along with the word-specific attribute tags, the domain-specific multi-word binding tags, the clustering tags, the phrase-bracketing tags, the clause-encapsulating tags, and the envelope tags such that each of the word-specific attribute tags, the domain-specific multi-word binding tags, the clustering tags, the phrase-bracketing tags, the clause-encapsulating tags, and the envelope tags is associated with a particular portion of the machine-readable natural language text within the one or more computer readable memory devices; and displaying, on a display screen of the computing system, the machine readable natural language text with indications of at least some of the word-specific attribute tags, domain-specific multi-word binding tags, clustering tags, phrase-bracketing tags, clause-encapsulating tags, or envelope tags. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15)
-
Specification