×

SYSTEM AND METHOD FOR IDENTIFYING DOCUMENT GENRES

  • US 20100284623A1
  • Filed: 05/07/2009
  • Published: 11/11/2010
  • Est. Priority Date: 05/07/2009
  • Status: Active Grant
First Claim
Patent Images

1. A computer-implemented method for generating genre models used to identify genres of a document, comprising:

  • on a computer system having one or more processors executing one or more programs stored on memory of the computer system;

    for each document image in a set of document images that are associated with one or more genres,segmenting the document image into a plurality of tiles, wherein the tiles in the plurality of tiles are sized so that document page features are identifiable; and

    computing features of the document image and the plurality of tiles; and

    training at least one genre classifier to classify document images as being associated with one or more genres based on the features of the document images in the set of document images, the features of the plurality of tiles of the set of documents images, and the one or more genres associated with each document image in the set of documents images.

View all claims
  • 2 Assignments
Timeline View
Assignment View
    ×
    ×