×

System and methods for extracting document images from images featuring multiple documents

  • US 10,621,676 B2
  • Filed: 02/02/2016
  • Issued: 04/14/2020
  • Est. Priority Date: 02/04/2015
  • Status: Active Grant
First Claim
Patent Images

1. A method for extracting document images from images featuring multiple documents, comprising:

  • receiving a multiple-document image including a plurality of document images, wherein each document image is associated with a document;

    extracting a plurality of visual identifiers from the multiple-document image, wherein each visual identifier is text indicating information related to one of the plurality of document images;

    analyzing the plurality of visual identifiers to identify each document image, wherein each document image is identified based on at least one threshold visual identifier requirement representing a portion of the plurality of visual identifiers that need to be included in each of the identified document image;

    identifying, for each identified document image that meets the at least one threshold visual identifier requirement, a boundary based on the analysis, the boundary occupying a textless border around the respective identified document image and enclosing all of the plurality of visual identifiers that need to be included within the document image as represented by the at least one threshold visual identifier requirement;

    determining, based on the analysis, an image area of each document image, wherein the image area of the document image is defined by the boundary; and

    extracting each document image based on its image area, wherein extracting each document image further comprises generating a file including the document image.

View all claims
  • 4 Assignments
Timeline View
Assignment View
    ×
    ×