×

Method and apparatus for automatic identification of word boundaries in continuous text and computation of word boundary scores

  • US 6,185,524 B1
  • Filed: 12/31/1998
  • Issued: 02/06/2001
  • Est. Priority Date: 12/31/1998
  • Status: Expired due to Fees
First Claim
Patent Images

1. A computerized method for identifying word boundaries in a continuous text input, the method comprising the following digital processes:

  • (a) comparing the continuous text to a set of varying length strings to identify candidate word-initial boundaries and candidate word-final boundaries in the continuous text, each candidate word-initial boundary and candidate word-final boundary being a character in the continuous text and having an associated probability value;

    (b) identifying each candidate word boundary in the continuous text by calculating a word boundary score for such candidate word boundary using the probability values associated with the candidate word-initial boundaries and candidate word-final boundaries identified in step (a), the candidate word boundaries defining segments of the continuous text; and

    (c) verifying each segment defined by the candidate word boundaries identified in step (b) against a string database.

View all claims
  • 3 Assignments
Timeline View
Assignment View
    ×
    ×