Automated evaluation of overly repetitive word use in an essay
First Claim
Patent Images
1. A method for automatically evaluating an essay to detect at least one writing style error, comprising:
- electronically receiving an essay on a computer system;
assigning a feature value for each of one or more features for one or more text segments in the essay, wherein the feature values are automatically calculated by the computer system;
storing the feature values for the one or more text segments on a data storage device accessible by the computer system;
comparing the feature values for each one or more text segments with a model configured to identify at least one writing style error, wherein the model is based on at least one human evaluated essay; and
using the results of the comparison to the model to identify writing style errors in the essay.
2 Assignments
0 Petitions
Accused Products
Abstract
To automatically evaluate an essay for overly repetitive word usage, a word is identified in the essay and at least one feature associated with the word is determined. In addition, a probability of the word being used in an overly repetitive manner is determined by mapping the feature to a model. The model having been generated by a machine learning application based on at least one evaluated essay. Furthermore, the essay is annotated to indicate the word is used in an overly repetitive manner in response to the probability exceeding a threshold probability.
-
Citations
54 Claims
-
1. A method for automatically evaluating an essay to detect at least one writing style error, comprising:
-
electronically receiving an essay on a computer system;
assigning a feature value for each of one or more features for one or more text segments in the essay, wherein the feature values are automatically calculated by the computer system;
storing the feature values for the one or more text segments on a data storage device accessible by the computer system;
comparing the feature values for each one or more text segments with a model configured to identify at least one writing style error, wherein the model is based on at least one human evaluated essay; and
using the results of the comparison to the model to identify writing style errors in the essay. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15)
-
-
16. A system for automatically evaluating an essay to detect at least one writing style error, comprising:
-
a computer system configured to electronically receive an essay;
a feature extractor configured to assign a feature value for each of one or more features for one or more text segments in the essay;
a data storage device, connected to the computer system, configured to store the feature values for the one or more text segments;
a feature analyzer configured to evaluate the essay for at least one writing style error by comparing the feature values for each one or more text segments with a model; and
a display for presenting the evaluated essay. - View Dependent Claims (17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30)
-
-
31. A method for generating a model for determining overly repetitive text segment use, comprising:
-
electronically receiving training data on a computer system wherein the training data comprises an essay annotated to identify one or more text segments used in an overly repetitive manner;
assigning a feature value for each of one or more features for each text segment in the essay, wherein the feature values are automatically calculated by the computer system;
assigning an indicator value for each text segment in the essay, wherein the indicator value is set at a first value and if the text segment has been used in an overly repetitive manner;
storing the feature values and the indicator value for each text segment in the essay in a data storage device accessible by the computer system; and
creating a model for overly repetitive use of the one or more text segments in the essay by identifying patterns in the feature values wherein the patterns are identified by a machine learning tool. - View Dependent Claims (32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43)
-
-
44. A system for generating a model useful in determining overly repetitive text segment use, comprising:
-
a computer system configured to receive training data, wherein the training data comprises an essay annotated to identify one or more text segments used in an overly repetitive manner;
a feature extractor configured to calculate a feature value for each of one or more features for each text segment in the essay and to assign an indicator value for each text segment in the annotated essay, wherein the indicator value indicates whether the text segment has been used in an overly repetitive manner;
a data storage device configured to store the feature values and the indicator value for each text segment in the essay;
a machine learning tool configured to analyze the features to identify patterns; and
a model builder to create a model for overly repetitive use of the text segments, wherein the model is constructed from the identified patterns. - View Dependent Claims (45, 46, 47, 48, 49, 50, 51, 52, 53, 54)
-
Specification