×

SEARCH APPARATUS, SEARCH METHOD, AND RECORDING MEDIUM STORING PROGRAM

  • US 20110219000A1
  • Filed: 11/06/2009
  • Published: 09/08/2011
  • Est. Priority Date: 11/26/2008
  • Status: Active Grant
First Claim
Patent Images

1. A search apparatus comprising:

  • an abstract matrix storage unit that, when information which is created from a plurality of regions obtained by dividing a matrix representing a co-occurrence relationship between a word set and a document set and which also represents a subset included in the document set is provided, stores information which enables calculation or estimation of a frequency of a word in each of the to plurality of regions as abstract information;

    a region upper limit calculation unit that, when the information representing the subset is input, examines a relationship between the information representing the subset and the plurality of regions, refers to the abstract information for each of the plurality of regions from the obtained result, and calculates, for each of the plurality of regions, an upper limit of the frequency of the word included in each of the plurality of regions for the subset;

    a word frequency calculation unit that adds the upper limit of the frequency for each of the plurality of regions by each region with a common word, and specifies the obtained added value as the upper limit of the frequency of the word for each region with the common word; and

    a document frequency reference unit that obtains a region to be searched according to the upper limit of the frequency of the word for each region with the common word, further specifies a specified number of words in order of higher frequency according to the obtained region to be searched, and outputs the specified word as a word characteristic to the subset.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×