×

Automatic clustering of tokens from a corpus for grammar acquisition

  • US 20040064303A1
  • Filed: 09/15/2003
  • Published: 04/01/2004
  • Est. Priority Date: 07/26/2001
  • Status: Active Grant
First Claim
Patent Images

1. A machine-readable medium having stored thereon executable instructions that when executed by a processor, cause the processor to:

  • generate frequency vectors for each non-context token in a corpus based upon counted occurrences of a predetermined relationship of the non-context tokens to context tokens; and

    cluster the non-context tokens into a cluster tree based upon the frequency vectors according to a lexical correlation among the non-context tokens.

View all claims
  • 4 Assignments
Timeline View
Assignment View
    ×
    ×