METHOD AND SYSTEM TO IDENTIFY PROVIDERS IN WEB DOCUMENTS
First Claim
Patent Images
1. A method of identifying providers, comprising:
- obtaining a results document from a search, wherein the results document comprises references to documents that contain a keyword;
analyzing the results document to identify a plurality of the references;
accessing the documents that correspond to the identified references; and
analyzing each of the accessed documents to determine a probabilistic value that the accessed document is associated with a provider.
2 Assignments
0 Petitions
Accused Products
Abstract
An exemplary embodiment of the present invention provides a method of identifying providers. The method includes obtaining a results document from a search, wherein the results document comprises references to documents that contain a keyword. analyzing the results document to identify a plurality of the references. The method includes accessing each of the documents using the identified references and analyzing each of the accessed documents to determine a probabilistic value that the accessed document is associated with a provide.
42 Citations
20 Claims
-
1. A method of identifying providers, comprising:
-
obtaining a results document from a search, wherein the results document comprises references to documents that contain a keyword; analyzing the results document to identify a plurality of the references; accessing the documents that correspond to the identified references; and analyzing each of the accessed documents to determine a probabilistic value that the accessed document is associated with a provider. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13)
-
-
14. A computer system for identifying providers, comprising:
-
a processor that is adapted to execute stored instructions; a memory device that stores instructions that are executable by the processor, the instructions comprising; a Web browser configured to access Web pages over the network interface; a link dereferencer configured to obtain a source code for each of a plurality of the Web pages in a source document; an indicator extractor configured to analyze the source code for each of the Web pages; and an indicator evaluator configured to calculate a probability that each Web page is associated with a provider. - View Dependent Claims (15, 16, 17, 18)
-
-
19. A tangible, computer-readable medium, comprising:
-
code configured to accept keywords from an input device, access a search site over a network interface, and display a results document on a display; code configured to analyze the results document to identify a plurality of links to Web pages, access the Web pages using the identified links, and store a source code for each of the accessed Web pages in a memory; code configured to analyze the source code for each accessed Web page for indicators that the accessed Web page is associated with a provider; and code configured to compare the indicators to probabilistic values for each indicator that are stored in the storage device, and calculate a probability that the accessed Web page is associated with a provider. - View Dependent Claims (20)
-
Specification