AUTOMATIC CONTEXT SENSITIVE LANGUAGE CORRECTION AND ENHANCEMENT USING AN INTERNET CORPUS
First Claim
Patent Images
1. A computer-assisted language correction system comprising:
- an alternatives generator, generating on the basis of an input sentence a text-based representation providing multiple alternatives for each of a plurality of words in the sentence;
a selector for selecting among at least said multiple alternatives for each of said plurality of words in the sentence, based at least partly on an internet corpus; and
a correction generator operative to provide a correction output based on selections made by said selector.
1 Assignment
0 Petitions
Accused Products
Abstract
A computer-assisted language correction system including spelling correction functionality, misused word correction functionality, grammar correction functionality and vocabulary enhancement functionality utilizing contextual feature-sequence functionality employing an internet corpus.
280 Citations
66 Claims
-
1. A computer-assisted language correction system comprising:
-
an alternatives generator, generating on the basis of an input sentence a text-based representation providing multiple alternatives for each of a plurality of words in the sentence; a selector for selecting among at least said multiple alternatives for each of said plurality of words in the sentence, based at least partly on an internet corpus; and a correction generator operative to provide a correction output based on selections made by said selector. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10)
-
-
11. A computer-assisted language correction system comprising:
-
at least one of; spelling correction functionality; misused word correction functionality; grammar correction functionality; and vocabulary enhancement functionality; and contextual feature-sequence functionality cooperating with at least one of said spelling correction functionality;
said misused word correction functionality, grammar correction functionality and said vocabulary enhancement functionality and employing an internet corpus. - View Dependent Claims (12, 13, 14, 15, 16, 17)
-
-
18. A computer-assisted language correction system comprising:
-
an alternatives generator, generating on the basis of a language input a text-based representation providing multiple alternatives for each of a plurality of words in the sentence; a selector for selecting among at least said multiple alternatives for each of said plurality of words in the language input, based at least partly on a relationship between selected ones of said multiple alternatives for at least some of said plurality of words in said language input; and a correction generator operative to provide a correction output based on selections made by said selector. - View Dependent Claims (19, 20, 21, 22, 23, 24, 25, 26, 27)
-
-
28. A computer-assisted language correction system comprising:
-
a misused-word suspector evaluating at least most of the words in an language input on the basis of their fit within a context of the language input; and a correction generator operative to provide a correction output based at least partially on an evaluation performed by said suspector. - View Dependent Claims (29, 30, 31, 32)
-
-
33. A computer-assisted language correction system comprising:
-
a misused-word suspector evaluating words in an language input; an alternatives generator, generating multiple alternatives for at least some of the words in the language input evaluated as suspect words by said suspector, at least one of said multiple alternatives for a word in the language input being consistent with a contextual feature of said word in the language input in an internet corpus; a selector for selecting among at least said multiple alternatives; and a correction generator operative to provide a correction output based at least partially on a selection made by said selector. - View Dependent Claims (39, 40, 41, 42, 43, 44, 45)
-
-
34. A computer-assisted language correction system comprising:
-
a misused-word suspector evaluating words in an language input and identifying suspect words; an alternatives generator, generating multiple alternatives for said suspect words; a selector, grading each said suspect word as well as ones of said multiple alternatives therefor generated by said alternatives generator according to multiple selection criteria, and applying a bias in favor of said suspect word vis-à
-vis ones of said multiple alternatives therefor generated by said alternatives generator; anda correction generator operative to provide a correction output based at least partially on a selection made by said selector.
-
-
35. A computer-assisted language correction system comprising:
-
an alternatives generator, generating on the basis of an input multiple alternatives for at least one word in the input; a selector, grading each said at least one word as well as ones of said multiple alternatives therefor generated by said alternatives generator according to multiple selection criteria, and applying a bias in favor of said at least one word vis-à
-vis ones of said multiple alternatives therefor generated by said alternatives generator, said bias being a function of an input uncertainty metric indicating uncertainty of a person providing said input; anda correction generator operative to provide a correction output based on a selection made by said selector.
-
-
36. A computer-assisted language correction system comprising:
-
an incorrect word suspector evaluating at least most of the words in a language input, said suspector being at least partially responsive to an input uncertainty metric indicating uncertainty of a person providing said input, said suspector providing a suspected incorrect word output; and an alternatives generator, generating a plurality of alternatives for suspected incorrect words identified by said suspected incorrect word output; a selector for selecting among each suspected incorrect word and the plurality of alternatives generated by said alternatives generator; and a correction generator operative to provide a correction output based on a selection made by said selector.
-
-
37. A computer-assisted language correction system comprising:
-
at least one of a spelling correction module, a misused-word correction module, a grammar correction module and a vocabulary enhancement module receiving a multi-word input and providing a correction output, each of said at least one of a spelling correction module, a misused-word correction module, a grammar correction module and a vocabulary enhancement module including; an alternative word candidate generator including; phonetic similarity functionality operative to propose alternative words based on phonetic similarity to a word in said input and to indicate a metric of phonetic similarity; and character string similarity functionality operative to propose alternative words based on character string similarity to a word in said input and to indicate a metric of character string similarity for each alternative word; and a selector operative to select either a word in said output or an alternative word candidate proposed by said alternative word candidate generator by employing said phonetic similarity and character string similarity metrics together with context-based selection functionality.
-
-
38. A computer-assisted language correction system comprising:
-
suspect word identification functionality, receiving a multi-word language input and providing a suspect word output which indicates suspect words; feature identification functionality operative to identify features including said suspect words; an alternative selector identifying alternatives to said suspect words; feature occurrence functionality employing a corpus and providing an occurrence output, ranking various features including said alternatives as to their frequency of use in the corpus; and a selector employing said occurrence output to provide a correction output, said feature identification functionality comprising feature filtration functionality including at least one of; functionality for eliminating features containing suspected errors; functionality for negatively biasing features which contain words introduced in an earlier correction iteration of said multi-word input and which have a confidence level below a confidence level predetermined threshold; and functionality for eliminating features which are contained in another feature having an frequency of occurrence above a predetermined frequency threshold.
-
-
46. A computer-assisted language correction system comprising:
-
suspect word identification functionality, receiving a multi-word language input and providing a suspect word output which indicates suspect words; feature identification functionality operative to identify features including said suspect words; an alternative selector identifying alternatives to said suspect words; occurrence functionality employing a corpus and providing an occurrence output, ranking features including said alternatives as to their frequency of use in the corpus; and a correction output generator, employing said occurrence output to provide a correction output, said feature identification functionality comprising; at least one of; N-gram identification functionality; and co-occurrence identification functionality; and at least one of; skip-gram identification functionality; switch-gram identification functionality; and previously used by user feature identification functionality.
-
-
47. A computer-assisted language correction system comprising:
-
a grammatical error suspector evaluating at least most of the words in an language input on the basis of their fit within a context of the language input; and a correction generator operative to provide a correction output based at least partially on an evaluation performed by said suspector. - View Dependent Claims (48, 49, 50)
-
-
51. A computer-assisted language correction system comprising:
-
a grammatical error suspector evaluating words in a language input; an alternatives generator, generating multiple alternatives for at least some of the words in the language input evaluated as suspect words by said suspector, at least one of said multiple alternatives for a word in the language input being consistent with a contextual feature of said word in the language input; a selector for selecting among at least said multiple alternatives; and a correction generator operative to provide a correction output based at least partially on a selection made by said selector. - View Dependent Claims (53)
-
-
52. A computer-assisted language correction system comprising:
-
a grammatical error suspector evaluating words in an language input and identifying suspect words; an alternatives generator, generating multiple alternatives for said suspect words; a selector, grading each said suspect word as well as ones of said multiple alternatives therefor generated by said alternatives generator according to multiple selection criteria, and applying a bias in favor of said suspect word vis-à
-vis ones of said multiple alternatives therefor generated by said alternatives generator; anda correction generator operative to provide a correction output based at least partially on a selection made by said selector.
-
- 54. A computer-assisted language correction system comprising context based scoring of various alternative corrections, based at least partially on contextual feature-sequence (CFS) frequencies of occurrences in an internet corpus.
-
59. A computer-assisted language correction system comprising vocabulary enhancement functionality including:
-
vocabulary-challenged words identification functionality; alternative vocabulary enhancements generation functionality; and context based scoring functionality, based at least partially on contextual feature-sequence (CFS) frequencies of occurrences in an internet corpus, said alternative vocabulary enhancements generation functionality including thesaurus pre-processing functionality operative to generate alternative vocabulary enhancements.
-
-
60. A computer-assisted language correction system comprising:
-
an alternatives generator, generating on the basis of an input sentence a text-based representation providing multiple alternatives for each of a plurality of words in the sentence; a selector for selecting among at least said multiple alternatives for each of said plurality of words in the sentence; a confidence level assigner operative to assign a confidence level to a selected alternative from said multiple alternatives; and a correction generator operative to provide a correction output based on selections made by said selector and at least partially on said confidence level. - View Dependent Claims (61)
-
-
62. A computer-assisted language correction system comprising:
-
a punctuation error suspector evaluating at least some of the words and punctuation in a language input on the basis of their fit within a context of the language input based on frequency of occurrence of feature-grams of said language input in an internet corpus; and a correction generator operative to provide a correction output based at least partially on an evaluation performed by said suspector. - View Dependent Claims (63)
-
-
64. A computer-assisted language correction system comprising:
-
a grammatical element error suspector evaluating at least some of the words in a language input on the basis of their fit within a context of the language input based on frequency of occurrence of feature-grams of said language input in an internet corpus; and a correction generator operative to provide a correction output based at least partially on an evaluation performed by said suspector. - View Dependent Claims (65, 66)
-
Specification