×

Method for inset detection in document layout analysis

  • US 6,377,704 B1
  • Filed: 04/30/1998
  • Issued: 04/23/2002
  • Est. Priority Date: 04/30/1998
  • Status: Expired due to Term
First Claim
Patent Images

1. A document layout analysis method for determining document structure data from input data including the content and characteristics of regions of a portion of at least one page forming the document, the method comprising the steps of:

  • segmenting the regions within the page to identify regions characterized as text and graphics;

    analyzing text regions to identify and characterize certain text regions as insets comprising the steps of;

    a) finding a pair of horizontal rulings in general vertical alignment with one another; and

    b) identifying text present between said horizontal rulings;

    producing an output of the recomposed text regions of the image in reading order, wherein the reading order is a function of the column boundaries; and

    performing optical character recognition on the text regions.

View all claims
  • 4 Assignments
Timeline View
Assignment View
    ×
    ×