Personalization engine for characterizing a document
First Claim
1. A method executed by one or more computing devices for characterizing a document, the method comprising:
- generating, by at least one of the one or more computing devices, one or more sets of taxonomic nouns corresponding to a document, wherein the one or more sets of taxonomic nouns comprise one or more of;
a set of first taxonomic nouns based upon classification information generated by an author of the document;
a set of second taxonomic nouns based upon one or more user-generated tags characterizing at least a portion of the document;
a set of third taxonomic nouns based upon one or more search terms utilized to access the document;
ora set of fourth taxonomic nouns based upon the attributes related to a method of access of the document;
generating, by at least one of the one or more computing devices, a set of fifth taxonomic nouns by processing the document based upon one or more pattern rules and a dictionary of known terms, wherein the one or more pattern rules specify a method of extracting terms from the document; and
aggregating, by at least one of the one or more computing devices, at least one of the one or more sets of taxonomic nouns with the set of fifth taxonomic nouns into a composite set of taxonomic nouns; and
characterizing, by at least one of the one or more computing devices, the document based on the composite set of taxonomic nouns.
1 Assignment
0 Petitions
Accused Products
Abstract
A dynamic classification dictionary is built for use in profiling and targeting users for additional relevant content. Behavioral data is gathered from user activity, and user documents and actions are categorized. Author-generated document classification information is analyzed and assigned a first taxonomic noun to characterize the document. User-generated tags characterizing a portion of the document are assigned a second taxonomic noun. Search terms that resulted in the user accessing the document are identified and assigned a third taxonomic noun. Attributes related to the manner in which the document was accessed are evaluated and assigned a fourth taxonomic noun. The document is processed using pattern rules to extract a fifth taxonomic noun. The taxonomic nouns are aggregated into a composite set of taxonomic nouns, and the dynamic classification dictionary is build by storing the composite set of taxonomic nouns.
4 Citations
18 Claims
-
1. A method executed by one or more computing devices for characterizing a document, the method comprising:
-
generating, by at least one of the one or more computing devices, one or more sets of taxonomic nouns corresponding to a document, wherein the one or more sets of taxonomic nouns comprise one or more of; a set of first taxonomic nouns based upon classification information generated by an author of the document; a set of second taxonomic nouns based upon one or more user-generated tags characterizing at least a portion of the document; a set of third taxonomic nouns based upon one or more search terms utilized to access the document;
ora set of fourth taxonomic nouns based upon the attributes related to a method of access of the document; generating, by at least one of the one or more computing devices, a set of fifth taxonomic nouns by processing the document based upon one or more pattern rules and a dictionary of known terms, wherein the one or more pattern rules specify a method of extracting terms from the document; and aggregating, by at least one of the one or more computing devices, at least one of the one or more sets of taxonomic nouns with the set of fifth taxonomic nouns into a composite set of taxonomic nouns; and characterizing, by at least one of the one or more computing devices, the document based on the composite set of taxonomic nouns. - View Dependent Claims (2, 3, 4, 5, 6)
-
-
7. A system for characterizing a document, the system comprising:
-
one or more processors; and one or more memories operatively coupled to at least one of the one or more processors and having instructions stored thereon that, when executed by at least one of the one or more processors, cause at least one of the one or more processors to; generate one or more sets of taxonomic nouns corresponding to a document, wherein the one or more sets of taxonomic nouns comprise one or more of; a set of first taxonomic nouns based upon classification information generated by an author of the document; a set of second taxonomic nouns based upon one or more user-generated tags characterizing at least a portion of the document; a set of third taxonomic nouns based upon one or more search terms utilized to access the document;
ora set of fourth taxonomic nouns based upon the attributes related to a method of access of the document; generate a set of fifth taxonomic nouns by processing the document based upon one or more pattern rules and a dictionary of known terms, wherein the one or more pattern rules specify a method of extracting terms from the document; and aggregate at least one of the one or more sets of taxonomic nouns with the set of fifth taxonomic nouns into a composite set of taxonomic nouns; and characterize the document based on the composite set of taxonomic nouns. - View Dependent Claims (8, 9, 10, 11, 12)
-
-
13. At least one non-transitory computer-readable medium storing computer-readable instructions that, when executed by one or more computing devices, cause at least one of the one or more computing devices to:
-
generate one or more sets of taxonomic nouns corresponding to a document, wherein the one or more sets of taxonomic nouns comprise one or more of; a set of first taxonomic nouns based upon classification information generated by an author of the document; a set of second taxonomic nouns based upon one or more user-generated tags characterizing at least a portion of the document; a set of third taxonomic nouns based upon one or more search terms utilized to access the document;
ora set of fourth taxonomic nouns based upon the attributes related to a method of access of the document; generate a set of fifth taxonomic nouns by processing the document based upon one or more pattern rules and a dictionary of known terms, wherein the one or more pattern rules specify a method of extracting terms from the document; and aggregate at least one of the one or more sets of taxonomic nouns with the set of fifth taxonomic nouns into a composite set of taxonomic nouns; and characterize the document based on the composite set of taxonomic nouns. - View Dependent Claims (14, 15, 16, 17, 18)
-
Specification