×

Methods and systems for analyzing data in media material having layout

  • US 7,801,358 B2
  • Filed: 11/03/2006
  • Issued: 09/21/2010
  • Est. Priority Date: 11/03/2006
  • Status: Expired due to Fees
First Claim
Patent Images

1. A computer-implemented method for analyzing data representative of media material having a layout, comprising:

  • identifying block segments associated with columnar body text in the media material; and

    determining which of the identified block segments belong to one or more articles in the media material based on language statistics information and layout information,wherein the data representative of media material comprises pixel data of an image of the media material, and the block segment identifying includes analyzing the pixel data to identify regions having similar pixel value change complexity,wherein the data representative of media material further includes text data representing text in the media material, and the block segment identifying includes a step of associating the text data with corresponding image regions identified as having similar pixel value change complexity based on the location of the text data and the corresponding regions in the media material, andwherein the text data associating step includes;

    mapping words found in the text data to an initial set of the corresponding image regions identified as having similar pixel value change complexity; and

    adjusting the initial set of image regions to obtain a final set of image regions to the regions based on the distribution of words in the word mapping.

View all claims
  • 2 Assignments
Timeline View
Assignment View
    ×
    ×