Method of identifying topic of text using nouns
First Claim
1. A method of identifying a topic of a text, comprising the steps ofa) receiving the text, where the text includes words;
- b) identifying in a hardware device each unique word in the text that is a noun;
c) determining in the hardware device a singular form of each noun identified in step (b);
d) creating in the hardware device combinations of the singular forms of the nouns determined in step (c), where the number of singular forms of the nouns in each combination is a user-definable;
e) determining in the hardware device a frequency of occurrence in the text of each noun identified in step (b);
f) assigning in the hardware device a score to each singular form noun, where the score of each singular form noun is the frequency of occurrence of the corresponding noun determined in step (e);
g) assigning in the hardware device a score to each combination of singular form nouns, where the score of each combination of singular form nouns is a sum of the scores of the singular form nouns in the combination;
h) selecting in the hardware device a user-definable number of scores of singular form nouns that are greatest in value;
i) selecting in the hardware device a user-definable number of scores of combinations of singular form nouns that are greatest in value; and
j) returning from the hardware device the singular forms of nouns and the combinations of singular forms nouns that correspond to the scores selected in step (h) and step (i) as the topic of the text.
1 Assignment
0 Petitions
Accused Products
Abstract
A method of identifying a topic of a text. Text is received. Then, the nouns in the text are identified. The singular form of each identified noun is determined. Combinations are created of the singular form of the identified nouns, where the number of singular forms of the nouns in the combinations is user-definable. The frequency of occurrence in the text of each noun that corresponds to its singular form is determined. Each frequency of occurrence is assigned as a score to its corresponding singular form noun. Each combination of singular form nouns is assigned a score that is equal to the sum of the scores of its constituent singular form nouns. The user-definable number of top scoring singular form nouns and combinations of singular form nouns are selected as the topic of the text.
11 Citations
2 Claims
-
1. A method of identifying a topic of a text, comprising the steps of
a) receiving the text, where the text includes words; -
b) identifying in a hardware device each unique word in the text that is a noun; c) determining in the hardware device a singular form of each noun identified in step (b); d) creating in the hardware device combinations of the singular forms of the nouns determined in step (c), where the number of singular forms of the nouns in each combination is a user-definable; e) determining in the hardware device a frequency of occurrence in the text of each noun identified in step (b); f) assigning in the hardware device a score to each singular form noun, where the score of each singular form noun is the frequency of occurrence of the corresponding noun determined in step (e); g) assigning in the hardware device a score to each combination of singular form nouns, where the score of each combination of singular form nouns is a sum of the scores of the singular form nouns in the combination; h) selecting in the hardware device a user-definable number of scores of singular form nouns that are greatest in value; i) selecting in the hardware device a user-definable number of scores of combinations of singular form nouns that are greatest in value; and j) returning from the hardware device the singular forms of nouns and the combinations of singular forms nouns that correspond to the scores selected in step (h) and step (i) as the topic of the text. - View Dependent Claims (2)
-
Specification