GENERAL PURPOSE CORRECTION OF GRAMMATICAL AND WORD USAGE ERRORS
First Claim
1. A computer-implemented error correction system, comprising:
- an error detection component that collects scores based on word usage in a sequence of words intended to form a fluent language expression, generates a signal for presence of an error in a subsequence of words of the sequence based on the scores, and detects an erroneous subsequence of words in the sequence;
a potential candidate generation component that generates potential candidate strings of words related to the erroneous subsequence of words from a corpus;
a candidate selection component that selects and presents one or more of the potential candidate strings as candidate strings for correction of language fluency introduced by the erroneous subsequence of words; and
a processor that executes computer-executable instructions associated with at least one of the error detection component, potential candidate generation component, or candidate selection component.
2 Assignments
0 Petitions
Accused Products
Abstract
Architecture that detects and corrects writing errors in a human language based on the utilization of three different stages: error detection, correction candidate generation, and correction candidate ranking. The architecture is a generic framework for generating fluent alternatives to non-grammatical word sequences in a written sample. Error detection is addressed by a suite of language model related scores and other scores such as parse scores that can identify a particularly unlikely sequence of words. Correction candidate generation is addressed by a lookup in a very large corpus of “correct” English that looks for alternative arrangements of the same or similar words or subsequences of these words in the same context. Correction candidate ranking is addressed by a language model ranker.
64 Citations
20 Claims
-
1. A computer-implemented error correction system, comprising:
-
an error detection component that collects scores based on word usage in a sequence of words intended to form a fluent language expression, generates a signal for presence of an error in a subsequence of words of the sequence based on the scores, and detects an erroneous subsequence of words in the sequence; a potential candidate generation component that generates potential candidate strings of words related to the erroneous subsequence of words from a corpus; a candidate selection component that selects and presents one or more of the potential candidate strings as candidate strings for correction of language fluency introduced by the erroneous subsequence of words; and a processor that executes computer-executable instructions associated with at least one of the error detection component, potential candidate generation component, or candidate selection component. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9)
-
-
10. A computer-implemented error correction method, comprising acts of:
-
receiving a sequence of words intended to form a fluent language expression; detecting an erroneous subsequence of words in the sequence defined by a sliding window over word subsequences; generating potential candidate strings of words related to the erroneous subsequence of words from a corpus of alternative arrangements of same or similar words, or subsequences of words in a same context; selecting candidate strings from one or more of the potential candidate strings for correction of dysfluency of the language expression by the erroneous subsequence of words and presenting potential corrections to a user; and utilizing a processor that executes instructions stored in memory to perform at least one of the acts of detecting, generating, or selecting. - View Dependent Claims (11, 12, 13, 14, 15)
-
-
16. A computer-implemented error correction method, comprising acts of:
-
receiving a sequence of words intended to form a fluent language expression; collecting scores from one or more language models based on subsequences of words in the sequence defined by a sliding window; detecting an erroneous subsequence of words based on a signal generated in accordance with presence of the erroneous subsequence of words in the sequence; generating potential candidate strings of words related to the erroneous subsequence of words from a corpus of alternative arrangements of same or similar words or subsequences of words, based on context-related criteria; selecting candidate strings from the one or more of the potential candidate strings for correction of dysfluency of the language expression by the erroneous subsequence of words and presenting the candidate strings to a user as corrections to the dysfluency; and utilizing a processor that executes instructions stored in memory to perform at least one of the acts of collecting, detecting, generating, or selecting. - View Dependent Claims (17, 18, 19, 20)
-
Specification