Method and system for automatically generating web page transcoding instructions
First Claim
1. A method of automatically generating transcoding instructions to locate and extract a subset of data from a selected web page of a web site, the method comprising:
- receiving an input describing the subset of data, said input comprising one or more data fields and, for each data field,respective field values from at least two sample web pages of a web page family for the web site;
identifying a common pattern among the respective field values for the at least two sample web pages to define a pattern match between the respective data field values,said matching pattern defining a signature for the data field; and
generating the transcoding instructions in accordance with the identified common pattern, the transcoding instructions comprising location information of field values for extracting the subset of data within a web page of the web page family,the transcoding instructions for expressing the extracted subset of data in a target format thereby to transcode the web page,wherein identifying the commonality among the respective field values comprises locating the respective field values in the respective web page code.
4 Assignments
0 Petitions
Accused Products
Abstract
A system and method are provided for generating transcoding instructions to identify and extract a subset of data from a web page. Input describing the subset of data is received where the input describes one or more data fields and, for each data field, respective field values from at least two sample web pages of a web page family for the web site. For each field, respective web page code defining the respective field values may be compared for commonality to find a matching pattern with which to locate the respective field values. The matching pattern comprises a signature for the data field. Transcoding instructions are defined using the matching pattern to locate and extract field values within web pages of the web page family. The subset of data may be expressed in a target format to transcode the web page for particular client machines (e.g. a wireless mobile device).
36 Citations
17 Claims
-
1. A method of automatically generating transcoding instructions to locate and extract a subset of data from a selected web page of a web site, the method comprising:
-
receiving an input describing the subset of data, said input comprising one or more data fields and, for each data field, respective field values from at least two sample web pages of a web page family for the web site; identifying a common pattern among the respective field values for the at least two sample web pages to define a pattern match between the respective data field values, said matching pattern defining a signature for the data field; and generating the transcoding instructions in accordance with the identified common pattern, the transcoding instructions comprising location information of field values for extracting the subset of data within a web page of the web page family, the transcoding instructions for expressing the extracted subset of data in a target format thereby to transcode the web page, wherein identifying the commonality among the respective field values comprises locating the respective field values in the respective web page code. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8)
-
-
9. A system for automatically generating transcoding instructions to locate and extract a subset of data from a selected web page of a web site, the system comprising a processor and memory coupled thereto, said memory storing instructions and data to configure the processor for:
-
receiving an input describing the subset of data, said input comprising one or more data fields and, for each data field, respective field values from at least two sample web pages of a web page family for the web site; identifying a common pattern among the respective field values for the at least two sample web pages to define a pattern match between the respective data field values, said matching pattern defining a signature for the data field; and generating the transcoding instructions in accordance with the identified common pattern, the transcoding instructions comprising location information of field values for extracting the subset of data within a web page of the web page family, the transcoding instructions for expressing the extracted subset of data in a target format thereby to transcode the web page, wherein identifying the commonality among the respective field values comprises locating the respective field values in the respective web page code. - View Dependent Claims (10, 11, 12, 13, 14, 15, 16)
-
-
17. A computer program product for automatically generating transcoding instructions to locate and extract a subset of data from a selected web page of a web site,
the computer program product storing computer readable instructions which when executed by a computer processor configure the processor to: -
receive an input describing the subset of data, said input comprising one or more data fields and, for each data field, respective field values from at least two sample web pages of a web page family for the web site; identify a common pattern among the respective field values for the at least two sample web pages to define a pattern match between the respective data field values, said matching pattern defining a signature for the data field; and generate the transcoding instructions in accordance with the identified common pattern, the transcoding instructions comprising location information of field values for extracting the subset of data within a web page of the web page family, the transcoding instructions for expressing the extracted subset of data in a target format thereby to transcode the web page, wherein identifying the commonality among the respective field values comprises locating the respective field values in the respective web page code.
-
Specification