×

Automatic clustering of tokens from a corpus for grammar acquisition

  • US 7,356,462 B2
  • Filed: 09/15/2003
  • Issued: 04/08/2008
  • Est. Priority Date: 07/26/2001
  • Status: Expired due to Fees
First Claim
Patent Images

1. A machine-readable medium having stored thereon executable instructions that when executed by a processor, cause the processor to:

  • generate frequency vectors for each non-context token in a corpus based upon counted occurrences of a predetermined relationship of the non-context tokens to context tokens; and

    cluster the non-context tokens into a cluster tree based upon the frequency vectors according to a lexical correlation among the non-context tokens, wherein the cluster tree is used in a pattern recognition system.

View all claims
  • 4 Assignments
Timeline View
Assignment View
    ×
    ×