Please download the dossier by clicking on the dossier button x
×

Detection and reconstruction of East Asian layout features in a fixed format document

  • US 10,127,221 B2
  • Filed: 05/02/2016
  • Issued: 11/13/2018
  • Est. Priority Date: 03/11/2013
  • Status: Active Grant
First Claim
Patent Images

1. A method for detecting ruby text in a fixed format document, the method comprising:

  • receiving, at a parser, a fixed format document containing one or more lines of text on one or more pages;

    detecting, by a line detection engine, one or more lines in the fixed format document containing one or more attributes of a ruby line;

    retaining the one or more lines in the fixed format document containing one or more attributes of a ruby line as ruby line candidates and a line successive to the one or more lines as ruby base line candidates;

    analyzing, by a document processor, the ruby line candidate for finding one or more ruby texts contained in the ruby line candidate;

    matching the one or more ruby texts with a corresponding ruby base text in a successive ruby base line candidate for reconstruction in a flow format document; and

    reconstructing, by a serializer, the fixed format document to a flow format document containing the matched one or more ruby texts and corresponding ruby base text.

View all claims
  • 2 Assignments
Timeline View
Assignment View
    ×
    ×