×

Information analysing apparatus

  • US 20040088308A1
  • Filed: 08/13/2003
  • Published: 05/06/2004
  • Est. Priority Date: 08/16/2002
  • Status: Abandoned Application
First Claim
Patent Images

1. Information analysing apparatus for clustering information elements in items of information into groups of related information elements, the apparatus comprising:

  • a count data provider for providing count data representing the number of occurrences of elements in each item of information;

    an initial model parameter determiner for determining first model parameters representing a probability distribution for the groups, second model parameters representing for each element the probability for each group of that element being associated with that group, and third model parameters representing for each item the probability for each group of that item being associated with that group;

    a user input receiver for enabling a user to input prior information relating to the relationship between at least some of the groups and at least some of the elements;

    a prior data determiner for determining from prior information input by a user using the user input receiver prior probability data for at least some of the second model parameters;

    an expected probability calculator for receiving the first, second and third model parameters and the prior probability data and for calculating, for each item of information and for each information element of that item, the expected probability of that item and that element being associated with each group using the first, second and third model parameters and the prior probability data determined by the prior data determiner;

    a model parameter updater for updating the first, second and third model parameters in accordance with the expected probabilities calculated by the expected probability calculator and the count data stored by the count data provider;

    a likelihood calculator for calculating a likelihood on the basis of the expected probabilities and the count data stored by the count data provider; and

    a controller for causing for causing the expected probability calculator, the model parameter updater and the likelihood calculator to recalculate the expected probabilities using the prior probability data and updated model parameters, to update the model parameters and to recalculate the likelihood, respectively, until the likelihood meets a given criterion.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×