Embedded communication of link information
First Claim
1. A computer-implemented method of processing documents, performed by a computer system having one or more processors and memory storing one or more programs for execution by the one or more processors, the method comprising:
- receiving a document in a search engine crawler, the document having a first link tag embedded in the document, the first link tag including a location value and one or more information pairs that are distinct from the location value, wherein a respective information pair has a respective parameter and a corresponding parameter value;
selecting a method of processing content, wherein the content is specified by the location value of the first link tag and the selected method of processing is in accordance with one or more of the one or more information pairs of the first link tag;
retrieving the content specified by the location value of the first link tag; and
processing the retrieved content specified by the first link tag in accordance with the selected method.
1 Assignment
0 Petitions
Accused Products
Abstract
A method of processing documents is described. The method includes the operation of receiving a document in a search engine crawler. The document includes an embedded first link tag. The first link tag includes one or more information pairs. A respective information pair includes a respective parameter and a corresponding value. The parameters in the one or more information pairs may correspond to content at one or more content locations or one or more document locations. The method also includes selecting a method of processing content associated with the first link tag in accordance with one or more of the information pairs.
-
Citations
24 Claims
-
1. A computer-implemented method of processing documents, performed by a computer system having one or more processors and memory storing one or more programs for execution by the one or more processors, the method comprising:
-
receiving a document in a search engine crawler, the document having a first link tag embedded in the document, the first link tag including a location value and one or more information pairs that are distinct from the location value, wherein a respective information pair has a respective parameter and a corresponding parameter value; selecting a method of processing content, wherein the content is specified by the location value of the first link tag and the selected method of processing is in accordance with one or more of the one or more information pairs of the first link tag; retrieving the content specified by the location value of the first link tag; and processing the retrieved content specified by the first link tag in accordance with the selected method. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12)
-
-
13. A non-transitory computer readable storage medium storing one or more programs to be executed by a computer system, the one or more programs comprising:
-
instructions for receiving a document in a search engine crawler, the document having a first link tag embedded in the document, the first link tag including a location value and one or more information pairs that are distinct from the location value, wherein a respective information pair has a respective parameter and a corresponding parameter value; instructions for selecting a method of processing content, wherein the content is specified by the location value of the first link tag and the selected method of processing is in accordance with one or more of the one or more information pairs of the first link tag; instructions for retrieving the content specified by the location value of the first link tag; and instructions for processing the retrieved content specified by the first link tag in accordance with the selected method. - View Dependent Claims (14, 15)
-
-
16. A non-transitory computer readable storage medium storing one or more programs to be executed by a computer system, the one or more programs comprising:
-
web crawling instructions to identify a set of documents to be retrieved and processed, wherein a document of the set of documents has an embedded first link tag, the first link tag including a location value and one or more information pairs that are distinct from the location value, a respective information pair having a respective parameter and a corresponding parameter value, and instructions to process content specified by the first link tag, including instructions to select a method of processing the content, wherein the content is specified by the location value of the first link tag and the selected method of processing is in accordance with one or more of the one or more information pairs of the first link tag, and instructions to process the content in accordance with the selected method. - View Dependent Claims (17, 18)
-
-
19. A computer system, comprising:
-
memory; one or more processors; and one or more programs, stored in the memory and executed by the one or more processors, the one or more programs including; web crawling instructions to identify a set of documents to be retrieved and processed, wherein at least one document has an embedded first link tag, the first link tag including a location value and one or more information pairs that are distinct from the location value, a respective information pair having a respective parameter and a corresponding parameter value, and instructions to process content specified by the first link tag, including instructions to select a method of processing the content, wherein the content is specified by the location value of the first link tag and the selected method of processing is in accordance with one or more of the one or more information pairs of the first link tag, and instructions to process the content in accordance with the selected method. - View Dependent Claims (20, 21)
-
-
22. A non-transitory computer readable storage medium storing one or more programs to be executed by a computer system, the one or more programs comprising:
-
instructions to generate a link tag, the link tag including a location value and one or more information pairs that are distinct from the location value, wherein a respective information pair has a respective parameter and a corresponding parameter value; and instructions to embed the link tag in the document; wherein the value in the embedded link tag specifies a method of processing content by a web crawler so as to modify information associated with the content, wherein the content to be processed is specified by the location value of the embedded link tag and the method of processing is in accordance with the respective parameter value in one or more of the one or more information pairs of the embedded link tag. - View Dependent Claims (23, 24)
-
Specification