Information abstracting method, information abstracting apparatus, and weighting method
First Claim
1. An information abstracting method comprising the steps of:
- accepting an input of character string data divided into prescribed units, with each individual character represented by a character code;
extracting a keyword for each of said prescribed units from said input character string data;
weighting said extracted keyword by taking into account a state of occurrence, in the other prescribed units, of keywords that are identical to said extracted keyword;
selecting at least one keyword from said extracted keywords on the basis of the weighted result; and
outputting said selected keyword as an information abstract relating to said character string data.
1 Assignment
0 Petitions
Accused Products
Abstract
An information abstracting method and apparatus for extracting and displaying keywords as an information abstract. Given a large number of character string data sets divided into prescribed units, the extracted keywords are significant and effective in describing a topic common to the plurality of units. The information abstracting apparatus comprises an input section for accepting an input of character string data divided into prescribed units, with each individual character represented by a character code, and an output section for displaying the result of information abstracting. Keywords contained in each of the prescribed units are extracted by a keyword extracting section from the character string input data from the input section. A score is calculated for each keyword by a score calculating section, so that a higher score is given to a keyword extracted from a larger number of units. On the basis of the calculated scores, keywords are selected by an abstracting section and are outputted as an information abstract by the output section.
105 Citations
25 Claims
-
1. An information abstracting method comprising the steps of:
-
accepting an input of character string data divided into prescribed units, with each individual character represented by a character code; extracting a keyword for each of said prescribed units from said input character string data; weighting said extracted keyword by taking into account a state of occurrence, in the other prescribed units, of keywords that are identical to said extracted keyword; selecting at least one keyword from said extracted keywords on the basis of the weighted result; and outputting said selected keyword as an information abstract relating to said character string data. - View Dependent Claims (2, 18, 19, 20)
-
-
3. An information abstracting apparatus comprising:
-
input means for accepting an input of character string data divided into prescribed units, with each individual character represented by a character code; keyword extracting means for extracting a keyword for each of said prescribed units from the character string data input from said input means; weighting means for weighting said extracted keyword by taking into account a state of occurrence, in the other prescribed units, of keywords that are identical to said extracted keyword; keyword selecting means for selecting at least one keyword from said extracted keywords on the basis of the weighted result; and output means for outputting said selected keyword as an information abstract relating to said character string data. - View Dependent Claims (4)
-
-
5. An information abstracting method comprising the steps of:
-
accepting an input of character string data divided into prescribed units each subdivided into prescribed paragraphs, with each individual character represented by a character code; extracting a keyword for each paragraph in each of said prescribed units from said input character string data; generating a keyword association by associating one keyword with another among keywords obtained from the same paragraph; weighting said extracted keyword by taking into account a state of occurrence, in the other prescribed units, of keywords that are identical to said extracted keyword, and also weighting said generated keyword association by taking into account a state of occurrence, in the other prescribed paragraphs, of keyword associations that are identical to said generated keyword association; selecting keywords and keyword associations from said extracted keywords and said generated keyword associations on the basis of the weighted results; and outputting said selected keywords and keyword associations as an information abstract relating to said character string data. - View Dependent Claims (6, 7, 8)
-
-
9. An information abstracting apparatus comprising:
-
input means for accepting an input of character string data divided into prescribed units each subdivided into prescribed paragraphs, with each individual character represented by a character code; keyword extracting means for extracting a keyword for each paragraph in each of said prescribed units from the character string data input from said input means; keyword associating means for generating a keyword association by associating one keyword with another among keywords obtained from the same paragraph; weighting means for weighting said extracted keyword by taking into account a state of occurrence, in the other prescribed units, of keywords that are identical to said extracted keyword, and for weighting said generated keyword association by taking into account a state of occurrence, in the other prescribed paragraphs, of keyword associations that are identical to said generated keyword association; selecting means for selecting keywords and keyword associations from said extracted keywords and said generated keyword associations on the basis of the weighted results; and output means for outputting said selected keywords and keyword associations as an information abstract relating to said character string data. - View Dependent Claims (10, 21, 22)
-
-
11. An information abstracting apparatus comprising:
-
input means for accepting an input of character string data divided into prescribed units, with each individual character represented by a character code; keyword extracting means for extracting a keyword for each of said prescribed units from said input character string data; similarity calculating means for calculating similarity between keywords thus extracted; weighting means for weighting said extracted keyword by taking into account a state of occurrence, in the other prescribed units, of keywords that are identical or similar to said extracted keyword; keyword selecting means for selecting keywords from said extracted keywords on the basis of the weighted result; and output means for outputting said selected keywords as an information abstract relating to said character string data. - View Dependent Claims (12, 23)
-
-
13. An information abstracting apparatus comprising:
-
input means for accepting an input of character string data divided into prescribed units each subdivided into prescribed paragraphs, with each individual character represented by a character code; keyword extracting means for extracting a keyword for each paragraph in each of said prescribed units from said character string data input from said input means; keyword associating means for generating a keyword association by associating one keyword with another among keywords obtained from the same paragraph; similarity calculating means for calculating similarity between keywords thus extracted, on the basis of a plurality of factors including said keyword association; weighting means for weighting said extracted keyword by taking into account a state of occurrence, in the other prescribed units, of keywords that are identical or similar to said extracted keyword, and for weighting said generated keyword association by taking into account a state of occurrence, in the other prescribed paragraphs, of keyword associations that are identical to said generated keyword association; selecting means for selecting keywords and keyword associations from said extracted keywords and said generated keyword associations on the basis of the weighted results; and outputting said selected keywords and keyword associations as an information abstract relating to said character string data. - View Dependent Claims (14, 15, 24, 25)
-
-
16. An information abstracting apparatus comprising:
-
input means for accepting an input of character string data divided into prescribed units each subdivided into prescribed paragraphs, with each individual character represented by a character code; keyword extracting means for extracting a keyword for each paragraph in each of said prescribed units from the character string data input from said input means; keyword associating means for generating a keyword association by associating one keyword with another among keywords obtained from the same paragraph; weighting means for weighting said generated keyword association by taking into account a state of occurrence, in the other prescribed paragraphs, of keyword associations that are identical to said generated keyword association; selecting means for selecting keyword associations from said generated keyword associations on the basis of the weighted result; and output means for outputting an information abstract, generated based on the selection results, relating to said character string data.
-
-
17. An information abstracting apparatus comprising:
-
input means for accepting an input of character string data divided into prescribed units each subdivided into prescribed paragraphs, with each individual character represented by a character code; keyword extracting means for extracting a keyword for each paragraph in each of said prescribed units from said character string data input from said input means; keyword associating means for generating a keyword association by associating one keyword with another among keywords obtained from the same paragraph; similarity calculating means for calculating similarity between keywords thus extracted; keyword association/similarity calculating means for, by using said similarity calculated between keywords constituting said generated keyword association and keywords constituting another keyword association, calculating similarity between the keyword associations; weighting means for weighting said generated keyword association by taking into account a state of occurrence, in the other prescribed paragraphs, of keyword associations that are identical or similar to said generated keyword association; selecting means for selecting keyword associations from said generated keyword associations on the basis of the weighted result; and outputting said selected keyword associations as an information abstract relating to said character string data.
-
Specification