×

Automatic clustering of tokens from a corpus for grammar acquisition

  • US 20020002454A1
  • Filed: 07/26/2001
  • Published: 01/03/2002
  • Est. Priority Date: 12/07/1998
  • Status: Active Grant
First Claim
Patent Images

1. A grammar learning method from a corpus, comprising:

  • identify context tokens within the corpus, for each non-context token in the corpus, counting occurrence of predetermined relationship of the non-context token to a context token, generating frequency vectors for each non-context token based upon the counted occurrences, and clustering non-context tokens based upon the frequency vectors.

View all claims
  • 4 Assignments
Timeline View
Assignment View
    ×
    ×