System, method and computer program product for creating a description for a document of a remote network data source for later identification of the document and identifying the document utilizing a description
First Claim
1. A method for creating a description of a document of a remote network data source for later identification of the document, comprising:
- (a) receiving information from a user about a document on a remote network data site;
(b) creating a document identifier based on the user-input information, wherein the document identifier identifies the particular document;
(c) retrieving a markup language description defining properties of elements of a document in a markup language;
(d) analyzing the document and the content of the document utilizing the document identifier and the markup language description;
(e) generating a description of the document based on the analysis; and
(f) storing the document description.
3 Assignments
0 Petitions
Accused Products
Abstract
A system, method and computer program product are provided for creating a description of a document of a remote network data source for later identification of the document. Information about a document on a remote network data site is received from a user. A document identifier is created based on the user-input information. The document identifier identifies the particular document. A markup language description is retrieved. The markup language description defines properties of elements of a document in a markup language. The document and the content of the document are analyzed utilizing the document identifier and the markup language description. A description of the document is generated based on the analysis. The document description is stored. A system, method and computer program product are also provided for identifying a document. A document is received. Document descriptions of several documents are also received. The document descriptions are compared with the document. A document recognition score is calculated for each of the document descriptions based on a likelihood that the document description matches the document. A document description is selected based at least in part on the document recognition scores. The document is identified based on the selected document description. A system, method and computer program product are provided for identifying documents. A document is analyzed. A description of the document is created based on the analysis. The document is recognized utilizing the document description. A determination is made as to whether the document is in a list of pre-identified documents.
-
Citations
45 Claims
-
1. A method for creating a description of a document of a remote network data source for later identification of the document, comprising:
-
(a) receiving information from a user about a document on a remote network data site;
(b) creating a document identifier based on the user-input information, wherein the document identifier identifies the particular document;
(c) retrieving a markup language description defining properties of elements of a document in a markup language;
(d) analyzing the document and the content of the document utilizing the document identifier and the markup language description;
(e) generating a description of the document based on the analysis; and
(f) storing the document description. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11)
-
-
12. A computer program product for creating a description of a document of a remote network data source for later identification of the document, comprising:
-
(a) computer code for receiving information from a user about a document on a remote network data site;
(b) computer code for creating a document identifier based on the user-input information, wherein the document identifier identifies the particular document;
(c) computer code for retrieving a markup language description defining properties of elements of a document in a markup language;
(d) computer code for analyzing the document and the content of the document utilizing the document identifier and the markup language description;
(e) computer code for generating a description of the document based on the analysis; and
(f) computer code for storing the document description. - View Dependent Claims (13, 14, 15, 16, 17, 18)
-
-
19. A system for creating a description of a document of a remote network data source for later identification of the document, comprising:
-
(a) logic for receiving information from a user about a document on a remote network data site;
(b) logic for creating a document identifier based on the user-input information, wherein the document identifier identifies the particular document;
(c) logic for retrieving a markup language description defining properties of elements of a document in a markup language;
(d) logic for analyzing the document and the content of the document utilizing the document identifier and the markup language description;
(e) logic for generating a description of the document based on the analysis; and
(f) logic for storing the document description.
-
-
20. A method for creating a description of content of a remote network data source for later identification of the content, comprising:
-
(a) receiving information from a user about content on a remote network data site;
(b) creating a content identifier based on the user-input information, wherein the content identifier identifies the particular content;
(c) retrieving a markup language description defining properties of elements of the content in a markup language;
(d) analyzing the content utilizing the content identifier and the markup language description;
(e) generating a description of the content based on the analysis; and
(f) storing the content description. - View Dependent Claims (21, 22, 23, 24)
-
-
25. A method for identifying a document, comprising:
-
(a) receiving a document;
(b) receiving document descriptions of several documents;
(c) comparing the document descriptions with the document;
(d) calculating a document recognition score for each of the document descriptions based on a likelihood that the document description matches the document;
(e) selecting a document description based at least in part on the document recognition scores; and
(f) identifying the document based on the selected document description. - View Dependent Claims (26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42)
-
-
43. A computer program product for identifying a document, comprising:
-
(a) computer code for receiving a document;
(b) computer code for receiving document descriptions of several documents;
(c) computer code for comparing the document descriptions with the document;
(d) computer code for calculating a document recognition score for each of the document descriptions based on a likelihood that the document description matches the document;
(e) computer code for selecting a document description based at least in part on the document recognition scores; and
(f) computer code for identifying the document based on the selected document description.
-
-
44. A method for identifying content, comprising:
-
(a) receiving several content elements;
(b) receiving a content description of a desired content element;
(c) comparing the content description with the received content elements;
(d) calculating a content recognition score for each of the content elements based on a likelihood that the content description matches the content element; and
(e) selecting a matching content based at least in part on the content recognition scores.
-
-
45. A method for creating a description of a document of a remote network data source for later identification of the document, comprising:
-
(a) receiving information from a user about a document on a remote network data site, wherein the information received from the user includes at least one of;
an identification of content of interest in the document, guidelines for recognizing a document, and guidelines for recognizing content elements of interest;
(b) creating a document identifier based on the user-input information, wherein the document identifier identifies the particular document;
(c) retrieving a markup language description defining properties of elements of a document in a markup language;
(d) comparing the document to at least one other document utilizing the document identifier and the markup language description;
(e) analyzing the content of the document utilizing the document identifier and the markup language description for identifying elements of interest of the content of the document;
(f) generating a description of the document based on the comparison and analysis, wherein the document description contains a list of the elements of interest and element properties for the elements of interest, wherein the document description reflects at least one difference between the document and the at least one other document; and
(g) storing the document description.
-
Specification