Proxy server using a statistical model
First Claim
1. A computer-implemented method for selectively accessing a document in response to a current retrieval request, the document being identified by a document address specification, the document having been retrieved during a previous retrieval request, the method comprising:
- determining whether to access the document during the current retrieval request by identifying with the aid of a statistical model whether the document is likely to have changed since a previous retrieval request; and
accessing the document if the determination produces an instruction indicative that the document at the document address specification should be accessed during the current retrieval request, wherein determining whether to access the document during the current retrieval request comprises computing a probability that the document is likely to have changed since a previous retrieval request, and further wherein computing the probability that the document is likely to have changed since a previous retrieval request comprises;
selecting an active probability indicative of a proportion of documents in a plurality of documents that are changing at various change rates, the plurality of documents including the document, training the active probability to reflect an experience with the document during a plurality of previous document retrieval requests, and using the trained active probability to compute the probability that the document has changed since a previous retrieval request.
3 Assignments
0 Petitions
Accused Products
Abstract
A computer based system and method of determining whether to re-fetch a previously retrieved document across a computer network is disclosed. The method utilizes a statistical model to determine whether the previously retrieved document likely changed since last accessed. The statistical model is continuously improving its accuracy by training internal probability distributions to reflect the actual experience with change rate patterns of the documents accessed. The decision of whether to access the document is based on the probability of change compared against a desired synchronization level, random selections, maximum limits on the amount of time since the document was last accessed, and other criterion. Once the decision to access is made, the document is checked for changes and this information is used to train the statistical model.
73 Citations
15 Claims
-
1. A computer-implemented method for selectively accessing a document in response to a current retrieval request, the document being identified by a document address specification, the document having been retrieved during a previous retrieval request, the method comprising:
-
determining whether to access the document during the current retrieval request by identifying with the aid of a statistical model whether the document is likely to have changed since a previous retrieval request; and
accessing the document if the determination produces an instruction indicative that the document at the document address specification should be accessed during the current retrieval request, wherein determining whether to access the document during the current retrieval request comprises computing a probability that the document is likely to have changed since a previous retrieval request, and further wherein computing the probability that the document is likely to have changed since a previous retrieval request comprises;
selecting an active probability indicative of a proportion of documents in a plurality of documents that are changing at various change rates, the plurality of documents including the document, training the active probability to reflect an experience with the document during a plurality of previous document retrieval requests, and using the trained active probability to compute the probability that the document has changed since a previous retrieval request. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
-
8. A computer-readable medium having computer-executable instructions for retrieving one document in a plurality of documents from a remote server, which when executed comprise:
-
maintaining historical information representing prior changes to the one document at the remote server;
initiating a document retrieval request procedure for retrieving particular documents in the plurality of documents;
determining whether to access the one document from the remote server based on an analysis of the historical information representing prior changes to the one document at the remote server; and
if the determination to access the one document is positive, identifying the one document for retrieval during the document retrieval procedure, wherein determining whether to retrieve the document further comprises;
computing a probability that the one document has changed since the one document was last retrieved from the remote server, and further wherein computing the probability that the one document has changed comprises;
beginning with a probability that a pre-defined proportion of documents in the plurality of documents has changed, and training the probability that the pre-defined proportion of documents has changed using the historical information associated with the one document to achieve the probability that the one document has changed since the one document was last retrieved. - View Dependent Claims (9, 10, 11, 12, 13, 14, 15)
-
Specification