×

MULTI-LINGUAL WORD HYPHENATION USING INDUCTIVE MACHINE LEARNING ON TRAINING DATA

  • US 20090182550A1
  • Filed: 01/16/2008
  • Published: 07/16/2009
  • Est. Priority Date: 01/16/2008
  • Status: Active Grant
First Claim
Patent Images

1. A method comprising:

  • receiving training data that include a plurality of hyphenated words;

    inductively generating hyphenation patterns that represent substrings occurring within the words, wherein the hyphenation patterns include at least the substrings and include hyphenation codes associated respectively with characters occurring in the substrings, wherein the hyphenation codes identify hyphenation points within the patterns;

    receiving at least one induction parameter applicable to generating the hyphenation patterns; and

    storing at least the substrings and the hyphenation codes into a language-specific lexicon file.

View all claims
  • 2 Assignments
Timeline View
Assignment View
    ×
    ×