×

System and method for detecting and correcting mismatched Chinese character

  • US 8,725,497 B2
  • Filed: 10/05/2011
  • Issued: 05/13/2014
  • Est. Priority Date: 10/05/2011
  • Status: Active Grant
First Claim
Patent Images

1. A non-transitory computer-readable medium on which is stored a computer program for detecting and correcting mismatched Chinese characters in a phrase, the computer program comprising computer instructions that when executed by a computing device performs the steps of:

  • (S1) inputting an input phrase;

    (S2) a mismatched character detection module comparing the input phrase with Chinese phrases stored in a database;

    (S3) in step (S2), if there is an exact match, the mismatched character detection module determining the input phrase as constructed correctly;

    (S4) in step (S2), if there is no exact match and an edit distance is greater than two, the mismatched character detection module determining the input phrase as an unknown phrase;

    (S5) in step (S2), if there is no exact match, the edit distance is two and the phrase length is two, the mismatched character detection module determining the input phrase as an unknown phrase;

    (S6) in step (S2), if there is no exact match, the edit distance is one, or the edit distance is two and the phrase length is greater than two, the mismatched character detection module looking up a database to retrieve the associated pronunciation and radical composition for each of the mismatched Chinese characters in the input phrase and sending the retrieved information to an mismatched character correction module;

    (S7) the mismatched character correction module performing an adjacency match by swapping two adjacent and mismatched characters in the input phrase and see if the input phrase would match any of the partially matched phrases, if a match being found, the mismatched character correction module adding the partially matched phrase to a completely matched phrase list;

    (S8) the mismatched character correction module comparing the pronunciation of the mismatched character(s) with that of the character(s) in the same position of the partially matched phrase, if the pronunciation is identical, the mismatched character correction module adding the partially matched phrase to the completely matched phrase list;

    (S9) the mismatched character correction module comparing the pronunciation of the mismatched character(s) with that of the character(s) in the same position of the partially matched phrase, if the pronunciation is similar, the mismatched character correction module adding the partially matched phrase to the completely matched phrase list;

    (S10) the mismatched character correction module compare the radical composition of the mismatched character(s) with that of the character(s) in the same position of the partially matched phrase, if there is a radical match, the mismatched character correction module adding the partially matched phrase to the completely matched phrase list.

View all claims
  • 0 Assignments
Timeline View
Assignment View
    ×
    ×