Method and system for characterizing web content
First Claim
Patent Images
1. A method of processing Web activity data, comprising:
- obtaining, by a computer processor, a database of Website organizational data at a networked computer from a plurality of Websites, the Website organizational data comprising a plurality of Website-specific category names and corresponding item identifiers, wherein each of the plurality of Website-specific category names is associated with one or more of the corresponding item identifiers and each item identifier uniquely identifying a particular item within the plurality of Websites;
generating, by the computer processor, a data structure in the networked computer from the database of Website organizational data, the data structure comprising a matrix including a plurality of entries, whereineach entry of the matrix comprising one of the corresponding item identifiers on a first axis of the matrix and one of the plurality of Website-specific category names on a second axis of the matrix, andeach entry in the matrix indicates that the one of the corresponding item identifiers is organized under the one of the plurality of Website-specific category names;
generating, by the computer processor, a reduced-rank classification structure in the networked computer from the data structure via a matrix decomposition including at least a singular-value decomposition of the data structure, wherein the reduced-rank classification structure combines the plurality of the Website-specific category names from multiple Websites of the plurality Websites into a single combined category-name grouping used to categorize content of the multiple Websites; and
storing, by the computer processor, the reduced-rank classification structure at the networked computer.
8 Assignments
0 Petitions
Accused Products
Abstract
An exemplary embodiment of the present invention provides a method of processing Web activity data. The method includes obtaining a database of Website organizational data. The method also includes generating a data structure from the database of Website organizational data comprising an Item identifier and a Website category corresponding to the item identifier. The method also includes generating a reduced-rank classification structure from the data structure, the reduced-rank classification structure including a category grouping corresponding to one or more of the Website categories.
-
Citations
17 Claims
-
1. A method of processing Web activity data, comprising:
-
obtaining, by a computer processor, a database of Website organizational data at a networked computer from a plurality of Websites, the Website organizational data comprising a plurality of Website-specific category names and corresponding item identifiers, wherein each of the plurality of Website-specific category names is associated with one or more of the corresponding item identifiers and each item identifier uniquely identifying a particular item within the plurality of Websites; generating, by the computer processor, a data structure in the networked computer from the database of Website organizational data, the data structure comprising a matrix including a plurality of entries, wherein each entry of the matrix comprising one of the corresponding item identifiers on a first axis of the matrix and one of the plurality of Website-specific category names on a second axis of the matrix, and each entry in the matrix indicates that the one of the corresponding item identifiers is organized under the one of the plurality of Website-specific category names; generating, by the computer processor, a reduced-rank classification structure in the networked computer from the data structure via a matrix decomposition including at least a singular-value decomposition of the data structure, wherein the reduced-rank classification structure combines the plurality of the Website-specific category names from multiple Websites of the plurality Websites into a single combined category-name grouping used to categorize content of the multiple Websites; and storing, by the computer processor, the reduced-rank classification structure at the networked computer. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8)
-
-
9. A computer system, comprising:
-
a computer processor to execute computer-readable instructions; a storage device to store a database of Website organizational data received from a plurality of Websites, the Website organizational data comprising a plurality of Website-specific category names and corresponding item identifiers, wherein each of the plurality of Website-specific category names is associated with one or more of the corresponding item identifiers and each item identifier uniquely identifying a particular item within the plurality of Websites; and a memory device that stores instructions, executed by the computer processor, to direct the computer processor to; generate a data structure from the database of Website organizational data, the data structure comprising a matrix including a plurality of entries, wherein each entry of the matrix comprising one of the corresponding item identifiers on a first axis of the matrix and one of the plurality of Website-specific category names on a second axis of the matrix, and each entry in the matrix indicates that the one of the corresponding item identifiers is organized under the one of the plurality of Website-specific category names; and generate a reduced-rank classification structure from the data structure via a matrix decomposition including at least a singular-value decomposition of the data structure, wherein the reduced-rank classification structure combines the plurality of the Website-specific category names from multiple Websites of the plurality Websites into a single combined category-name grouping used to categorize content of the multiple Websites; and store the reduced-rank classification structure at the storage device. - View Dependent Claims (10, 11, 12, 13, 14, 15)
-
-
16. A non-transitory computer-readable medium, comprising code configured to direct a computer processor to:
-
obtain a database of Website organizational data at a networked computer from a plurality of Websites, the Website organizational data comprising a plurality of Website-specific category names and corresponding item identifiers, wherein each of the plurality of Website-specific category names is associated with one or more of the corresponding item identifiers and each item identifier uniquely identifying a particular item within the plurality of Websites; generate a data structure in the networked computer from the database of Website organizational data, the data structure comprising a matrix including a plurality of entries, wherein each entry of the matrix comprising one of the corresponding item identifiers on a first axis of the matrix and one of the plurality of Website-specific category names on a second axis of the matrix, and each entry in the matrix indicates that the one of the corresponding item identifiers is organized under the one of the plurality Website-specific category names; and generate a reduced-rank classification structure in the networked computer from the data structure via a matrix decomposition including at least a singular-value decomposition of the data structure, wherein the reduced-rank classification structure combines the plurality of the Website specific category names from multiple Websites of the plurality Websites into a single combined category-name grouping used to categorize content of the multiple Websites; and storing, by the computer processor, the reduced-rank classification structure at the networked computer. - View Dependent Claims (17)
-
Specification