System and method for web knowledge extraction
First Claim
1. A system for web data extraction, comprising:
- a non-transitory data storage component configured for storing a plurality of preconfigured reusable software components that provide services for creating a client workflow for web data extraction;
a communication interface operable to receive information related to workflow from a client for creating the client workflow for web data extraction utilizing at least one of the plurality of preconfigured reusable software components; and
a processor for executing instructions to run the client workflow for web data extraction, wherein the plurality of preconfigured reusable software components includes a web data surfacing component configured for extracting web data from a plurality of web data extraction services, the web data surfacing component comprising instructions for formatting the request in at least one format to generate the at least one formatted request, communicating the at least one formatted request to the at least one of the plurality of web data extraction services, and receiving a response from the at least one of the plurality of web data extraction services.
9 Assignments
0 Petitions
Accused Products
Abstract
Embodiments of the disclosed invention include an apparatus, method, and computer program product for creating and executing a client workflow for web data extraction. For example, the disclosed embodiments provide a system for web data extraction. The system includes a data storage component configured for storing a plurality of preconfigured reusable software components that provide services for creating a client workflow for web data extraction. The system also includes a communication interface operable to receive workflow definitions from a client for creating the client workflow for web data extraction utilizing at least one of the plurality of preconfigured reusable software components. The system has a processor for executing instructions to run the client workflow for web data extraction.
19 Citations
17 Claims
-
1. A system for web data extraction, comprising:
-
a non-transitory data storage component configured for storing a plurality of preconfigured reusable software components that provide services for creating a client workflow for web data extraction; a communication interface operable to receive information related to workflow from a client for creating the client workflow for web data extraction utilizing at least one of the plurality of preconfigured reusable software components; and a processor for executing instructions to run the client workflow for web data extraction, wherein the plurality of preconfigured reusable software components includes a web data surfacing component configured for extracting web data from a plurality of web data extraction services, the web data surfacing component comprising instructions for formatting the request in at least one format to generate the at least one formatted request, communicating the at least one formatted request to the at least one of the plurality of web data extraction services, and receiving a response from the at least one of the plurality of web data extraction services. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10)
-
-
11. A method, implemented on a machine having at least one processor, storage, and a communication platform connected to a network for web data extraction, the method comprising:
-
receiving information related to workflow from a client device for at least one of a plurality of preconfigured reusable software components that provide services for creating a client workflow for web data extraction; generating the client workflow based on the information related to workflow; and extracting web data in accordance with the client workflow which comprises; formatting a web data extraction request in at least one format to generate the at least one formatted request, communicating the at least one formatted request to the at least one of the plurality of web data extraction services, and receiving a response from the at least one of the plurality of web data extraction services that received the at least one formatted request. - View Dependent Claims (12, 13, 14)
-
-
15. A machine-readable tangible and non-transitory medium having information for web data extraction, wherein the information, when read by the machine, causes the machine to perform the following:
-
receive information related to workflow from a client device for at least one of a plurality of preconfigured reusable software components that provide services for creating a client workflow for web data extraction; generate the client workflow based on the information related to workflow; extract web data in accordance with the client workflow; format a web data extraction request in at least one format to generate the at least one formatted request; communicate the at least one formatted request to the at least one of the plurality of web data extraction services; and receive a response from the at least one of the plurality of web data extraction services that received the at least one formatted request. - View Dependent Claims (16, 17)
-
Specification