×

Automatic clustering of tokens from a corpus for grammar acquisition

  • US 7,966,174 B1
  • Filed: 02/14/2008
  • Issued: 06/21/2011
  • Est. Priority Date: 12/07/1998
  • Status: Expired due to Fees
First Claim
Patent Images

1. A system that recognizes patterns, the system comprising:

  • a first module configured to control a processor to generate frequency vectors for each non-context token in a corpus based upon counted occurrences of a predetermined relationship of the non-context tokens to context tokens;

    a second module configured to control the processor to cluster the non-context tokens into a cluster tree based upon the frequency vectors according to a lexical correlation among the non-context tokens; and

    a third module configured to control the processor to use the cluster tree for pattern recognition.

View all claims
  • 4 Assignments
Timeline View
Assignment View
    ×
    ×