×

Character string dividing or separating method and related system for segmenting agglutinative text or document into words

  • US 20010009009A1
  • Filed: 12/26/2000
  • Published: 07/19/2001
  • Est. Priority Date: 12/28/1999
  • Status: Abandoned Application
First Claim
Patent Images

1. A character string dividing system for segmenting a character string into a plurality of words, comprising:

  • input means for receiving a document;

    document data storing means serving as a document database for storing a received document;

    character joint probability calculating means for calculating a joint probability of two neighboring characters appearing in said document database;

    probability table storing means for storing a table of calculated joint probabilities;

    character string dividing means for segmenting an objective character string into a plurality of words with reference to said table of calculated joint probabilities; and

    output means for outputting a division result of said objective character string.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×