METHOD AND SYSTEM FOR AUTOMATICALLY GENERATING WEB PAGE TRANSCODING INSTRUCTIONS
First Claim
1. A method of automatically generating transcoding instructions to locate and extract a subset of data from a selected web page of a web site, the method comprising:
- receiving an input describing the subset of data, said input comprising one or more data fields and, for each data field, respective field values from at least two sample web pages of a web page family for the web site; and
for each data field;
comparing respective web page code defining the respective field values for commonality to find a matching pattern with which to locate the respective field values, said matching pattern comprising a signature for the data field; and
defining the transcoding instructions in accordance with the matching pattern to locate and extract field values for the data field within web pages of the web page family.
4 Assignments
0 Petitions
Accused Products
Abstract
A system and method are provided for generating transcoding instructions to identify and extract a subset of data from a web page. Input describing the subset of data is received where the input describes one or more data fields and, for each data field, respective field values from at least two sample web pages of a web page family for the web site. For each field, respective web page code defining the respective field values may be compared for commonality to find a matching pattern with which to locate the respective field values. The matching pattern comprises a signature for the data field. Transcoding instructions are defined using the matching pattern to locate and extract field values within web pages of the web page family. The subset of data may be expressed in a target format to transcode the web page for particular client machines (e.g. a wireless mobile device).
-
Citations
21 Claims
-
1. A method of automatically generating transcoding instructions to locate and extract a subset of data from a selected web page of a web site, the method comprising:
-
receiving an input describing the subset of data, said input comprising one or more data fields and, for each data field, respective field values from at least two sample web pages of a web page family for the web site; and for each data field; comparing respective web page code defining the respective field values for commonality to find a matching pattern with which to locate the respective field values, said matching pattern comprising a signature for the data field; and defining the transcoding instructions in accordance with the matching pattern to locate and extract field values for the data field within web pages of the web page family. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10)
-
-
11. A system for automatically generating transcoding instructions to locate and extract a subset of data from a selected web page of a web site, the system comprising a processor and memory coupled thereto, said memory storing instructions and data to configure the processor for:
-
receiving an input describing the subset of data, said input comprising one or more data fields and, for each data field, respective field values from at least two sample web pages of a web page family for the web site; and for each data field; comparing respective web page code defining the respective field values for commonality to find a matching pattern with which to locate the respective field values, said matching pattern comprising a signature for the data field; and defining the transcoding instructions in accordance with the matching pattern to locate and extract field values for the data field within web pages of the web page family. - View Dependent Claims (12, 13, 14, 15, 16, 17, 18, 19, 20)
-
-
21. A computer program product for automatically generating transcoding instructions to locate and extract a subset of data from a selected web page of a web site, the computer program product storing computer readable instructions which when executed by a computer processor configure the processor to:
-
receive an input describing the subset of data, said input comprising one or more data fields and, for each data field, respective field values from at least two sample web pages of a web page family for the web site; and for each data field; compare respective web page code defining the respective field values for commonality to find a matching pattern with which to locate the respective field values, said matching pattern comprising a signature for the data field; and define the transcoding instructions in accordance with the matching pattern to locate and extract field values for the data field within web pages of the web page family.
-
Specification