×

System and method for efficiently generating cluster groupings in a multi-dimensional concept space

  • US 6,778,995 B1
  • Filed: 08/31/2001
  • Issued: 08/17/2004
  • Est. Priority Date: 08/31/2001
  • Status: Active Grant
First Claim
Patent Images

1. A system for building a multi-dimensional semantic concept space over a stored document collection, comprising:

  • an extraction module identifying a plurality of documents within a stored document collection containing substantially correlated terms reflecting syntactic content, comprising;

    an extractor extracting the terms in literal form from the documents;

    a selector selecting the terms having frequencies of occurrence falling within a predefined threshold as being substantially correlated;

    a vector module generating a vector reflecting latent semantic similarities discovered between substantially correlated documents logically projected at an angle θ

    from a common axis in a concept space;

    a cluster module forming one or more arbitrary clusters at an angle σ

    from the common axis in the concept space, each cluster comprising documents having such an angle θ

    falling within a predefined variance of the angle σ

    for the cluster, and constructing a new arbitrary cluster at an angle σ

    from the common axis in the concept space, each new cluster comprising documents having such an angle θ

    falling outside the predefined variance of the angle σ

    for the remaining clusters.

View all claims
  • 12 Assignments
Timeline View
Assignment View
    ×
    ×