Systems and methods for inferring concepts for association with content
First Claim
Patent Images
1. A method, performed by one or more server devices, comprising:
- retrieving, by a processor of the one or more server devices, a document;
indirectly inferring, by a processor of the one or more server devices, concepts associated with the document, where the concepts are not directly inferred from characteristics of the document and where indirectly inferring concepts associated with the document comprises;
identifying, by a processor of the one or more server devices, a plurality of second documents that each include a link pointing to the document,determining, by a processor of the one or more server devices, which second document of the plurality of the second documents has been frequently used to access the document,extracting, by a processor of the one or more server devices, concepts associated with the second document that has been frequently used to access the document,determining, by a processor of the one or more devices, which advertisements contained in the document have been accessed from the document more than a threshold number of times,extracting, by a processor of the one or more devices, a uniform resource locator (URL) associated with each of the advertisements that have been accessed from the document more than the threshold number of times,retrieving, by a processor of the one or more devices, third documents corresponding to each of the URLs, andextracting, by a processor of the one or more devices, concepts from each of the third documents; and
associating, by a processor of the one or more server devices, the inferred concepts with the document.
2 Assignments
0 Petitions
Accused Products
Abstract
A system indirectly infers concepts associated with a document. The concepts may be indirectly inferred based on information that does not include characteristics of the document, such as the characteristics that include a textual content of the document not associated with links included in the document, a domain of the document, and the document'"'"'s Uniform Resource Locator (URL). The system may label the inferred concepts as useful to an audience of the document.
20 Citations
11 Claims
-
1. A method, performed by one or more server devices, comprising:
-
retrieving, by a processor of the one or more server devices, a document; indirectly inferring, by a processor of the one or more server devices, concepts associated with the document, where the concepts are not directly inferred from characteristics of the document and where indirectly inferring concepts associated with the document comprises; identifying, by a processor of the one or more server devices, a plurality of second documents that each include a link pointing to the document, determining, by a processor of the one or more server devices, which second document of the plurality of the second documents has been frequently used to access the document, extracting, by a processor of the one or more server devices, concepts associated with the second document that has been frequently used to access the document, determining, by a processor of the one or more devices, which advertisements contained in the document have been accessed from the document more than a threshold number of times, extracting, by a processor of the one or more devices, a uniform resource locator (URL) associated with each of the advertisements that have been accessed from the document more than the threshold number of times, retrieving, by a processor of the one or more devices, third documents corresponding to each of the URLs, and extracting, by a processor of the one or more devices, concepts from each of the third documents; and associating, by a processor of the one or more server devices, the inferred concepts with the document. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8)
-
-
9. A computer-readable memory device containing instructions for controlling at least one processor to perform a method of inferring concepts associated with a document, the method comprising:
-
retrieving the document; indirectly inferring concepts associated with the document, where indirectly inferring concepts associated with the document comprises; identifying a plurality of second documents that each include a link pointing to the document, determining which second document of the plurality of second documents has been most frequently used to access the document, extracting concepts associated the second document that has been most frequently used to access the documents, determining which advertisements contained in the document have been accessed from the document more than a threshold number of times, extracting a uniform resource locator (URL) associated with each of the advertisements that have been accessed from the document more than the threshold number of times, retrieving third documents corresponding to each of the URLs, and extracting concepts from each of the third documents; and associating the inferred concepts with the document. - View Dependent Claims (10, 11)
-
Specification