×

Dictionary creation device and dictionary creation method

  • US 7,921,113 B2
  • Filed: 08/02/2006
  • Issued: 04/05/2011
  • Est. Priority Date: 12/26/2003
  • Status: Active Grant
First Claim
Patent Images

1. A dictionary creation device that creates a dictionary which is used for searching, classifying, or filtering information written as text and in which keywords are registered per category, the dictionary creation device comprising:

  • a classification information acquisition unit that acquires classification information regarding categories and text information from at least a first information source and a second information source which differ from an information source for information written as text and searched;

    a keyword extraction unit that extracts a keyword from the acquired text information;

    a dictionary registration and deletion unit that registers or deletes the extracted keyword in dictionaries corresponding to the first information source and the second information source, in accordance with a category of the first information source and a category of the second information source, respectively, based upon the classification information acquired by said classification information acquisition unit and the keyword extracted by said keyword extraction unit;

    a keyword database that stores the extracted keyword, said keyword database being a non-transitory computer-readable storage medium; and

    a dictionary combining and editing unit that edits the category of the first information source in the dictionary corresponding to the first information source and the category of the second information source in the dictionary corresponding to the second information source to create, as a category level structure of a combined dictionary, a new category level structure including the category of the first information source and the category of the second information source, based on a degree of overlap between characteristic keywords that are keywords characterizing classification information regarding the category of the first information source and characteristic keywords that are keywords characterizing classification information regarding the category of the second information source,wherein said dictionary combining and editing unit (i) compares a first set, which is a set of characteristic keywords in a first category included in the first information source, with a second set, which is a set of characteristic keywords in a second category included in the second information source, and (ii) edits and combines the dictionaries corresponding to the first information source and the second information source such that the second category is placed in a lower level subordinate to the first category as an intersecting set of the first set and the second set is less common to the first set and more common to the second set.

View all claims
  • 0 Assignments
Timeline View
Assignment View
    ×
    ×