×

Formula detection engine

  • US 9,928,225 B2
  • Filed: 01/23/2012
  • Issued: 03/27/2018
  • Est. Priority Date: 01/23/2012
  • Status: Active Grant
First Claim
Patent Images

1. A method for converting a fixed format document containing a formula into a flow format document, the method comprising the acts of:

  • detecting a formula seed of a formula in data parsed from a page of a fixed format document, wherein the formula seed comprises a text element that carries an indication of being part of a formula;

    creating a formula area bounding the detected formula seed;

    expanding the formula area to include one or more mathematical elements that are detected on the page based on a proximity to the detected formula seed, wherein together the detected formula seed and the included one or more mathematical elements comprise a plurality of captured elements;

    placing the plurality of captured elements into one or more groups based on a vertical position of the plurality of captured elements relative to a line of normal text that overlaps the formula area;

    creating a new formula area around each of the one or more groups;

    splitting each new formula area based on a horizontal spacing between the captured elements in the new formula area;

    selecting a set of split formula areas based on an overlap of the split formula areas; and

    merging the split formula areas within the set into a single formula for display on a single line of a page of a flow format document.

View all claims
  • 2 Assignments
Timeline View
Assignment View
    ×
    ×