×

Content identification

  • US 8,788,503 B1
  • Filed: 09/25/2013
  • Issued: 07/22/2014
  • Est. Priority Date: 10/17/2007
  • Status: Active Grant
First Claim
Patent Images

1. A method, comprising:

  • generating, by a device comprising a processor, a plurality of distinct clusters from training content, wherein the plurality of clusters represent features of content items in the training content;

    identifying one or more conjunctions of the plurality of distinct clusters based on a respective probability of observing a feature of a cluster of the one or more conjunctions in a collection of the content items;

    scoring an identified one or more conjunctions based on a conditional probability that the identified one or more conjunctions is associated with a label;

    selecting, as a current conjunction, one of the scored identified one or more conjunctions that has a score that meets a defined condition; and

    generating one or more higher-order child conjunctions for the current conjunction, wherein at least one of the one or more higher-order child conjunctions is a conjunction of the conjoined clusters of the current conjunction with one or more additional clusters not included in the conjoined clusters, and wherein the generating the one or more higher-order child conjunctions is performed if a stopping condition has not been reached.

View all claims
  • 2 Assignments
Timeline View
Assignment View
    ×
    ×