×

Automatic classification of segmented portions of web pages

  • US 9,514,216 B2
  • Filed: 09/08/2014
  • Issued: 12/06/2016
  • Est. Priority Date: 08/10/2009
  • Status: Expired due to Fees
First Claim
Patent Images

1. A method comprising:

  • with one or more special purpose computing devices coupled to a memory;

    accessing a plurality of segmented portions of at least one of a plurality of displayable web pages represented by one or more digital signals of one or more files stored in a memory, wherein a particular displayable web page of the plurality of displayable web pages comprises at least two of the plurality of segmented portions;

    using one or more machine learned models for;

    identifying one or more feature properties of the plurality of segmented portions within the one or more files, or otherwise inferable from the one or more files,classifying the at least two of the plurality of segmented portions as being at least one of a plurality of segment types based, at least in part, on the one or more identified feature properties, the one or more identified feature properties comprising at least language feature properties of a language model of content to be displayed in one or more of the at least two of the plurality of segmented portions, anddetermining content quality scores for at least two of the plurality of segmented portions of at least the particular displayable web page; and

    storing one or more digital signals in the memory as part of an index for the plurality of segmented portions, the index being based, at least in part, on the segment type, the index indicating the content quality scores.

View all claims
  • 9 Assignments
Timeline View
Assignment View
    ×
    ×