SYSTEM AND METHOD FOR EXTRACTING CONTENT FOR SUBMISSION TO A SEARCH ENGINE
5 Assignments
0 Petitions
Accused Products
Abstract
A system and a method for automatically submitting Web pages to a search engine, which is preferably used for submitting dynamic Web pages, but may optionally be used for any type of Web page. The present invention features a gateway server for providing these Web pages to the search engine, either directly or optionally through an autonomous software search program. Optionally and more preferably, the gateway server modifies the Web page before serving it to the autonomous software search program and/or search engine.
-
Citations
64 Claims
-
1-38. -38. (canceled)
-
39. A computer-implemented method, comprising:
-
analyzing a structure of a first web page; detecting, using a processor, non-essential information of a second web page based on at least the structure of the first web page; and generating instructions for extracting content from the second web page, based on at least the detected non-essential information. - View Dependent Claims (40, 41, 42, 43, 44, 45, 46, 47, 48, 49)
-
-
50. An apparatus, comprising:
-
a storage device; and a processor coupled to the storage device, wherein the storage device stores a program for controlling the processor, and wherein the processor, being operative with the program, is configured to; analyze a structure of a first web page; detect non-essential information of a second web page based on at least the structure of the first web page; and generate instructions for extracting content from the second web page, based on at least the detected non-essential information. - View Dependent Claims (51, 52, 53, 54, 55, 56, 57, 58, 59, 60)
-
-
61. A computer-readable medium storing instructions that, when executed by a processor, perform a method comprising the steps of:
-
analyzing a structure of a first web page; detecting, using a processor, non-essential information of a second web page based on at least the structure of the first web page; and generating instructions for extracting content from the second web page, based on at least the detected non-essential information, whereby the extracted content does not include the detected non-essential information. - View Dependent Claims (62, 63, 64)
-
Specification