×

Script-agnostic text reflow for document images

  • US 8,542,926 B2
  • Filed: 11/19/2010
  • Issued: 09/24/2013
  • Est. Priority Date: 11/19/2010
  • Status: Expired due to Fees
First Claim
Patent Images

1. A computer-implemented process for reflowing a binarized region of text from a document image into a specified-width display area on an electronic display in a script-agnostic manner, comprising:

  • using a computer to perform the following process actions;

    segmenting the text region into candidate lines of text, said candidate lines of text comprising at least one line of text comprising substantially only accent or diacritic marks, or both;

    merging candidate lines of text comprising substantially only accent or diacritic marks, or both, into the closest adjacent candidate text line;

    designating each remaining candidate text line as a final text line;

    segmenting each final text line into candidate text words;

    identifying inter-word punctuation and diacritic marks, if any, and merging each identified mark into the closest adjacent candidate text word;

    designating final text words based on the remaining candidate text words;

    segmenting the final text lines into paragraphs; and

    for each paragraph, reflowing the final text words found therein so as to fit into said specified-width display area while maintaining the original sequential order.

View all claims
  • 2 Assignments
Timeline View
Assignment View
    ×
    ×