×

Method and apparatus for tokenizing text

  • US 5,721,939 A
  • Filed: 08/03/1995
  • Issued: 02/24/1998
  • Est. Priority Date: 08/03/1995
  • Status: Expired due to Term
First Claim
Patent Images

1. A method for tokenizing text with a tokenizing transducer comprising the steps of:

  • (a) storing all current configurations including at least one current configuration in a configuration storage unit, the at least one current configuration including an extension state and an output node;

    (b) selecting one said at least one current configuration that has not been processed;

    (c) processing said selected current configuration wherein processing includes creating any next configurations;

    (d) repeating steps b and c until all of said current configurations have been processed;

    (e) freeing all of said current configurations;

    (f) redefining all of the any next configurations as current configurations and the next text position as the current text position;

    (g) counting said current configurations; and

    (h) providing output when exactly one next configuration exists.

View all claims
  • 4 Assignments
Timeline View
Assignment View
    ×
    ×