Automatic text skimming using lexical chains
First Claim
Patent Images
1. A method for filtering adjectives from a lexical chain, the method comprising:
- receiving, at a processor, the lexical chain comprising an adjective, the lexical chain being a component of an input document;
calculating a non-characteristic-ness score based on the adjective'"'"'s usage within the input document according to linguistic tests, the non-characteristic-ness score being a function of at least a frequency of the adjective'"'"'s usage in the input document and a gradability non-characteristic-ness score for the adjective;
testing, by the processor, the adjective to determine if the adjective is at least one of the following;
a characteristic adjective and a non-characteristic adjective, wherein testing the adjective comprises comparing the non-characteristic-ness score to a threshold score;
removing, by the processor, the adjective from the lexical chain when the non-characteristic-ness score is above the threshold score; and
leaving, by the processor, the adjective in the lexical chain when the non-characteristic-ness score is below the threshold score.
0 Assignments
0 Petitions
Accused Products
Abstract
Automatic text skimming using lexical chains may be provided. First, at least one lexical chain may be created from an electronic document. Next, a list of positions within the electronic document may be created. The positions may include where at least one concept represented by one of the at least one lexical chain is mentioned. In addition, a list of the position where the at least one concept is mentioned may be assembled. A selection of at least one concept may be received from the list.
-
Citations
15 Claims
-
1. A method for filtering adjectives from a lexical chain, the method comprising:
-
receiving, at a processor, the lexical chain comprising an adjective, the lexical chain being a component of an input document; calculating a non-characteristic-ness score based on the adjective'"'"'s usage within the input document according to linguistic tests, the non-characteristic-ness score being a function of at least a frequency of the adjective'"'"'s usage in the input document and a gradability non-characteristic-ness score for the adjective; testing, by the processor, the adjective to determine if the adjective is at least one of the following;
a characteristic adjective and a non-characteristic adjective, wherein testing the adjective comprises comparing the non-characteristic-ness score to a threshold score;removing, by the processor, the adjective from the lexical chain when the non-characteristic-ness score is above the threshold score; and leaving, by the processor, the adjective in the lexical chain when the non-characteristic-ness score is below the threshold score. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11)
-
-
12. A method for filtering adjectives before or during formation of a lexical chain, the method comprising:
-
receiving, at a processor, an adjective; calculating, by the processor, a non-characteristic-ness score for the adjective, wherein calculating the non-characteristic-ness score comprises assigning a point value to the adjective based on linguistic tests and the adjective'"'"'s usage within at least one input document, the non-characteristic-ness score being a function of a predication non-characteristic-ness score and a gradability non-characteristic-ness score for the adjective; testing, by the processor, the adjective to determine if the adjective is at least one of the following;
a characteristic adjective and a non-characteristic adjective, wherein testing the adjective comprises comparing the non-characteristic-ness score to a threshold score;when the adjective is a non-characteristic adjective, forming, by the processor, the lexical chain without using the adjective; when the adjective is a characteristic adjective, forming, by the processor, the lexical chain, wherein the lexical chain comprises the adjective. - View Dependent Claims (13)
-
-
14. A method for filtering adjectives from a lexical chain, the method comprising:
-
receiving, at a processor, the lexical chain comprising an adjective, the lexical chain being a component of an input document; calculating a non-characteristic-ness score based on the adjective'"'"'s usage within the input document according to linguistic tests, the non-characteristic-ness score being a function of at least a frequency of the adjective'"'"'s usage in the input document and a predication non-characteristic-ness score for the adjective; testing, by the processor, the adjective to determine if the adjective is at least one of the following;
a characteristic adjective and a non-characteristic adjective, wherein testing the adjective comprises comparing the non-characteristic-ness score to a threshold score;removing, by the processor, the adjective from the lexical chain when the non-characteristic-ness score is above the threshold score; and leaving, by the processor, the adjective in the lexical chain when the non-characteristic-ness score is below the threshold score. - View Dependent Claims (15)
-
Specification