×

Method and system for characterizing web content

  • US 9,213,767 B2
  • Filed: 08/10/2009
  • Issued: 12/15/2015
  • Est. Priority Date: 08/10/2009
  • Status: Active Grant
First Claim
Patent Images

1. A method of processing Web activity data, comprising:

  • obtaining, by a computer processor, a database of Website organizational data at a networked computer from a plurality of Websites, the Website organizational data comprising a plurality of Website-specific category names and corresponding item identifiers, wherein each of the plurality of Website-specific category names is associated with one or more of the corresponding item identifiers and each item identifier uniquely identifying a particular item within the plurality of Websites;

    generating, by the computer processor, a data structure in the networked computer from the database of Website organizational data, the data structure comprising a matrix including a plurality of entries, whereineach entry of the matrix comprising one of the corresponding item identifiers on a first axis of the matrix and one of the plurality of Website-specific category names on a second axis of the matrix, andeach entry in the matrix indicates that the one of the corresponding item identifiers is organized under the one of the plurality of Website-specific category names;

    generating, by the computer processor, a reduced-rank classification structure in the networked computer from the data structure via a matrix decomposition including at least a singular-value decomposition of the data structure, wherein the reduced-rank classification structure combines the plurality of the Website-specific category names from multiple Websites of the plurality Websites into a single combined category-name grouping used to categorize content of the multiple Websites; and

    storing, by the computer processor, the reduced-rank classification structure at the networked computer.

View all claims
  • 8 Assignments
Timeline View
Assignment View
    ×
    ×