×

Method and system for processing text

  • US 8,566,080 B2
  • Filed: 04/29/2010
  • Issued: 10/22/2013
  • Est. Priority Date: 04/30/2009
  • Status: Expired due to Fees
First Claim
Patent Images

1. A method for text processing, comprising:

  • determining a plurality of characters in a text, wherein the text comprises double-byte coded characters;

    determining whether a number of bytes included in each text segment is even or odd;

    detecting which of the plurality of characters represent punctuations;

    dividing the text into a plurality of different text segments using the detected punctuations as separators between the different text segments; and

    performing a plurality of discrete decoding operations, one for each of the plurality of different text segments, wherein one or more of the plurality of different text segments comprise at least one occurrence of unrecognizable codes that are unable to be successfully decoded as comprehensible characters without inferences being made, wherein decoding operations on text segments lacking unrecognizable codes are unaffected by other decoding operations on text segments including unrecognizable codes; and

    when performing the plurality of discrete decoding operations and only when the number of word segments included in one of the text segments is odd, decoding from a head of the text segment rearward, as a first decoding result of the text segment, and decoding from a tail of the text segment frontward, as a second decoding result of the text segment.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×