×

Document analysis system for integration of paper records into a searchable electronic database

  • US 20070168382A1
  • Filed: 01/03/2007
  • Published: 07/19/2007
  • Est. Priority Date: 01/03/2006
  • Status: Abandoned Application
First Claim
Patent Images

1. A computer-readable medium, the medium being characterized in that:

  • the computer-readable medium contains code which, when executed in a processor, performs document analysis by the steps of;

    electronically receiving at least one input scan containing at least one field for containing data;

    analyzing the input scan to identify lines and fields within the input scan, by the steps of;

    locating at least one. shaded region or line segment;

    filtering any shaded region found;

    detecting and filling in any gaps in any located line segment;

    clustering any line segments co-located within a specified shift distance; and

    determining a length and a location for each line segment or line segment cluster;

    comparing the analyzed input scan against a library of form templates;

    identifying the form template that best matches the input scan;

    based on the identified form template, identifying at least one field or line within the input scan; and

    extracting data from the identified field or line.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×