×

Detection and reconstruction of east asian layout features in a fixed format document

  • US 9,330,070 B2
  • Filed: 03/11/2013
  • Issued: 05/03/2016
  • Est. Priority Date: 03/11/2013
  • Status: Active Grant
First Claim
Patent Images

1. A method for detecting Chinese, Japanese, or Korean text in a fixed format document, method comprising:

  • receiving a fixed format document, the fixed document comprising one or more text runs on one or more pages;

    analyzing the one or more text runs on a page for finding at least one Chinese, Japanese, or Korean character;

    if at least one Chinese, Japanese, or Korean character is found on the page, analyzing the one or more text runs on the page for determining a text direction for the page, comprising;

    analyzing the one or more text runs in a horizontal line and in a vertical line;

    for each text run, determining if the text run fits a horizontal or a vertical sequence of text runs;

    counting a number of characters in each horizontal text run and each vertical text run; and

    if more characters are in the vertical text runs than in the horizontal text runs, determining the page comprises vertical text;

    if the page comprises vertical text, rotating the page 90°

    counterclockwise for layout analysis for reconstruction in a flow format document; and

    reconstructing the fixed format document to a flow format document.

View all claims
  • 3 Assignments
Timeline View
Assignment View
    ×
    ×