System and method for classifying and searching existing document information to identify related information
First Claim
1. An information processing apparatus for presenting related information about existing document information and about specific document information, the information processing apparatus comprising:
- a processor for processing the existing document information;
a memory for storing the existing document information;
extracting means for extracting a first characteristic word from the existing document information and a second characteristic word from the specific document information;
attribute information extracting means for extracting attribute information from the existing document information and the specific document information;
classifying means for classifying the existing document information into one or more groups based on the attribute information;
weight calculating means for calculating, for each group, a first weight of the first characteristic word and a second weight of the second characteristic word following the extraction by the extracting means;
weight modifying means for modifying, for each group, the calculated first weight based on the extracted attribute information and based on frequency of either transmissions to a destination or receptions from the destination, wherein the calculated first weight of the first characteristic word in a group associated with a first frequency of transmission is increased if the same or similar words appear in another group having a second frequency of transmission, the first frequency of transmission being lower than the second frequency of transmission;
acquiring means for acquiring the related information corresponding to the existing document information based on the first characteristic word and based on the first weight modified by the weight modifying means;
searching means for searching for the existing document information related to the specific document information, based on the second characteristic word; and
display controlling means for controlling display of the related information corresponding to the existing document information searched for by the searching means.
1 Assignment
0 Petitions
Accused Products
Abstract
An information processing apparatus for processing related information about existing document information and specific document information is provided. The apparatus includes: an acquiring element for acquiring the related information corresponding to the existing document information. The acquiring is based on firstly, a first characteristic word extracted by an extracting element, and secondly a first weight modified by a weight modifying element. The apparatus further includes: a display controlling element for controlling display of the related information corresponding to the existing document information searched for by a searching element; and an attribute information extracting element for extracting attribute information from the existing document information and the specific document information; wherein the weight modifying element modifies the first weight based on the attribute information extracted byte attribute information extracting element.
-
Citations
61 Claims
-
1. An information processing apparatus for presenting related information about existing document information and about specific document information, the information processing apparatus comprising:
-
a processor for processing the existing document information; a memory for storing the existing document information; extracting means for extracting a first characteristic word from the existing document information and a second characteristic word from the specific document information; attribute information extracting means for extracting attribute information from the existing document information and the specific document information; classifying means for classifying the existing document information into one or more groups based on the attribute information; weight calculating means for calculating, for each group, a first weight of the first characteristic word and a second weight of the second characteristic word following the extraction by the extracting means; weight modifying means for modifying, for each group, the calculated first weight based on the extracted attribute information and based on frequency of either transmissions to a destination or receptions from the destination, wherein the calculated first weight of the first characteristic word in a group associated with a first frequency of transmission is increased if the same or similar words appear in another group having a second frequency of transmission, the first frequency of transmission being lower than the second frequency of transmission; acquiring means for acquiring the related information corresponding to the existing document information based on the first characteristic word and based on the first weight modified by the weight modifying means; searching means for searching for the existing document information related to the specific document information, based on the second characteristic word; and display controlling means for controlling display of the related information corresponding to the existing document information searched for by the searching means. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19)
-
-
20. An information processing method for use with an information processing apparatus for presenting related information about existing document information and about specific document information, the information processing method comprising the steps of:
-
extracting a first characteristic word from the existing document information and a second characteristic word from the specific document information; extracting attribute information from the existing document information and the specific document information; classifying the existing document information into one or more groups based on the attribute information; calculating, for each group, a first weight of the first characteristic word and a second weight of the second characteristic word following the extraction in the extracting step; modifying, for each group, the calculated first weight based on the extracted attribute information and based on frequency of either transmission to a destination or receptions from the destination, wherein the calculated first weight of the first characteristic word in a group associated with a first frequency of transmission is increased if the same or similar words appear in another group having a second frequency of transmission, the first frequency of transmission being lower than the second frequency of transmission; controlling acquisition of the related information corresponding to the existing document information based on the first characteristic word and based on the first weight modified in the weight modifying step; searching for the existing document information related to the specific document information, based on the second characteristic word; and displaying related information corresponding to the existing document information searched for in the searching step. - View Dependent Claims (21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33)
-
-
34. A storage medium that stores a computer-readable program for controlling an information processing apparatus for presenting related information about existing document information and about specific document information, the program comprising the steps of:
-
extracting a first characteristic word from the existing document information and a second characteristic word from the specific document information; extracting attribute information from the existing document information and the specific document information; classifying the existing document information into one or more groups based on the attribute information; calculating, for each group, a first weight of the first characteristic word and a second weight of the second characteristic word following the extraction in the extracting step; modifying, for each group, the calculated first weight based on the extracted attribute information and based on frequency of either transmissions to a destination or receptions from the destination, wherein the calculated first weight of the first characteristic word in a group associated with a first frequency of transmission is increased if the same or similar words appear in another group having a second frequency of transmission, the first frequency of transmission being lower than the second frequency of transmission; controlling acquisition of the related information corresponding to the existing document information based on the first characteristic word and based on the first weight modified in the weight modifying step; searching for the existing document information related to the specific document information, based on the second characteristic word; and controlling display of the related information corresponding to the existing document information searched for in the searching step. - View Dependent Claims (35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47)
-
-
48. A storage medium that stores a computer-readable program for controlling an information processing apparatus for presenting related information about existing document information and about specific document information, the program causing a computer to execute the steps of:
-
extracting a first characteristic word from the existing document information and a second characteristic word from the specific document information; extracting attribute information from the existing document information and the specific document information; classifying the existing document information into one or more groups; calculating, for each group, a first weight of the first characteristic word and a second weight of the second characteristic word following the extraction in the extracting step; modifying, for each group, the calculated first weight based on the extracted attribute information and based on frequency of either transmissions to a destination or receptions from the destination, wherein the calculated first weight of the first characteristic word in a topic associated with a frequency of transmission is increased if the same or similar words appear in another group having a second frequency of transmission, the first frequency of transmission being lower than the second frequency of transmission; controlling acquisition of the related information corresponding to the existing document information based on the first characteristic word and based on the first weight modified in the weight modifying step; searching for the existing document information related to the specific document information, based on the second characteristic word; controlling display, using an animated desktop agent, of the related information corresponding to the existing document information searched for in the searching step. - View Dependent Claims (49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60, 61)
-
Specification