State extrapolation for automated and semi-automated crawling architecture
First Claim
1. An apparatus comprising:
- a memory; and
at least one processor configured to;
receive an identification of a target state of an application reachable from an intermediate state of the application;
access, by using the identification of the target state, the intermediate state in an executing instance of the application, wherein the intermediate state is a state prior to the target state;
identify a plurality of states which are reachable from the intermediate state by using information indicating linking relation between states in the application;
identify at least one state which has at least a threshold level of commonality with the target state; and
add identified at least one state and the target state to a state list,wherein the at least one processor is configured to extract text and metadata from each of the identified at least one state and the target state in the state list,wherein information based on the extracted text and metadata is stored in the memory, andwherein the at least one processor is further configured to receive a query from a user device and transmit a search result corresponding to the query which is identified at least based on the information based on the extracted text and metadata.
4 Assignments
0 Petitions
Accused Products
Abstract
A system for automated acquisition of content from an application includes a link extraction controller that receives an identification of a target state of the application directly reachable from an intermediate state and a specification of a user interface element of the intermediate state actuated by a user to arrive at the target state. After navigating to the intermediate state in an executing instance of the application and extracting a tree of user interface widgets, the link extraction controller identifies widget sub-trees that have at least a threshold level of commonality with a reference widget sub-tree that includes the specified user interface element. The link extraction controller adds states, including the target state, reachable by user actuation of the identified widget sub-trees to a state list. A scraper module extracts text and metadata from each of the states in the state list for storage in a data store.
33 Citations
26 Claims
-
1. An apparatus comprising:
-
a memory; and at least one processor configured to; receive an identification of a target state of an application reachable from an intermediate state of the application; access, by using the identification of the target state, the intermediate state in an executing instance of the application, wherein the intermediate state is a state prior to the target state; identify a plurality of states which are reachable from the intermediate state by using information indicating linking relation between states in the application; identify at least one state which has at least a threshold level of commonality with the target state; and add identified at least one state and the target state to a state list, wherein the at least one processor is configured to extract text and metadata from each of the identified at least one state and the target state in the state list, wherein information based on the extracted text and metadata is stored in the memory, and wherein the at least one processor is further configured to receive a query from a user device and transmit a search result corresponding to the query which is identified at least based on the information based on the extracted text and metadata. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14)
-
-
15. A method comprising:
-
receiving, using at least one processor of an apparatus, an identification of a target state of an application reachable from an intermediate state of the application; accessing, by using the identification of the target state, the intermediate state in an executing instance of the application, wherein the intermediate state is a state prior to the target state; identifying, using the at least one processor, a plurality of states which are reachable from the intermediate state by using information indicating linking relation between states in the application; identifying, using the at least one processor, at least one state which has at least a threshold level of commonality with the target state; adding, using the at least one processor, identified at least one state to a state list; extracting text and metadata from each of the identified at least one state and the target state in the state list; and storing, using the at least one processor, information based on the extracted text and metadata in a memory, wherein the at least one processor is further configured to receive a query from a user device and transmit a search result corresponding to the query which is identified at least based on the information based on the extracted text and metadata. - View Dependent Claims (16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26)
-
Specification