×

PAGE CLASSIFIER ENGINE

  • US 20090144605A1
  • Filed: 12/03/2007
  • Published: 06/04/2009
  • Est. Priority Date: 12/03/2007
  • Status: Active Grant
First Claim
Patent Images

1. One or more computer-storage media having computer-executable instructions embodied thereon that, when executed, perform a method for classifying a page type of a portion of an electronic document, the method comprising:

  • receiving an OCR file associated with the portion of the electronic document, wherein the OCR file includes semantic information about text in the portion of the electronic document;

    applying one or more features to the semantic information;

    based on the application of the one or more features to the semantic information, determining the page type of the portion of the electronic document; and

    storing an indication of the page type.

View all claims
  • 2 Assignments
Timeline View
Assignment View
    ×
    ×