×

Capture of stylized TV table data via OCR

  • US 8,763,038 B2
  • Filed: 01/26/2009
  • Issued: 06/24/2014
  • Est. Priority Date: 01/26/2009
  • Status: Active Grant
First Claim
Patent Images

1. A method of detecting text in a television video display table, comprising:

  • saving a frame of video representing an image to a memory device;

    determining that the frame of video contains a table having cells containing text;

    storing a working copy of the frame of video to a memory;

    isolating text in the table by;

    removing any table boundaries from the image;

    removing any cell boundaries from the image;

    determining if the image has three dimensional or shadow attributes in the table boundaries or cell boundaries and removing any three dimensional or shadow attributes identified, wherein determining if the image has three dimensional or shadow attributes in the table boundaries or cell boundaries is carried out by finding line patterns adjacent and outside table or cell boundaries that track a table or cell boundary;

    where determining if a cell has three dimensional or shadow attributes is carried out by subtracting a variable rectangular band of pixels from other cell values to see what size band maximizes cancellation in order to distinguish the cell from the text area;

    thereby producing text isolated against a contrasting color background; and

    processing the isolated text using an optical character recognition (OCR) engine to extract the text as data.

View all claims
  • 4 Assignments
Timeline View
Assignment View
    ×
    ×