METHOD AND APPARATUS FOR AN APPLICATION CRAWLER
First Claim
Patent Images
1. A computer-implemented method, comprising:
- loading multiple documents or applications;
assembling the multiple documents or applications;
executing the loaded and assembled multiple documents or applications to instantiate the multiple documents or applications into a completely functioning application;
indexing an object model of the multiple running, instantiated documents or applications including identifying objects that are loaded once the documents or applications have been instantiated;
applying data-query interfaces to media player objects in the instantiated documents or applications; and
using the data-query interfaces to query the media player objects for media player properties and for metadata about any downloaded audio or video streams.
5 Assignments
0 Petitions
Accused Products
Abstract
A computer-implemented method is provided for searching for files on the Internet. In one embodiment, the method may provide an application crawler that assembles and dynamically instantiates all components of a web page. The instantiated web application may then be analyzed to locate desired components on the web page. This may involve finding and analyzing all clickable items in the application, driving the web application by injecting events, and extracting information from the application and writing it to a file or database.
-
Citations
26 Claims
-
1. A computer-implemented method, comprising:
-
loading multiple documents or applications; assembling the multiple documents or applications; executing the loaded and assembled multiple documents or applications to instantiate the multiple documents or applications into a completely functioning application; indexing an object model of the multiple running, instantiated documents or applications including identifying objects that are loaded once the documents or applications have been instantiated; applying data-query interfaces to media player objects in the instantiated documents or applications; and using the data-query interfaces to query the media player objects for media player properties and for metadata about any downloaded audio or video streams. - View Dependent Claims (2, 3, 4, 5)
-
-
6. A computer-implemented method for searching for video files on a computer network, the method comprising:
-
loading multiple documents or applications; assembling the multiple documents or applications; executing the loaded and assembled multiple documents or applications to initially instantiate at least a portion of a functioning application; at an initial point in time, crawling and indexing an object model of the multiple running, initially instantiated documents or applications including identifying objects that are loaded once the documents or applications have been initially instantiated; relating information gathered from crawling and indexing the at least a portion of the functioning application object model of the initially instantiated documents or applications with objects that are displayed at the initial point in time; continuing to instantiate the at least a portion of the functioning application, wherein objects are loaded, assembled and executed to subsequently instantiate the at least a portion of the functioning application; at a subsequent point in time, crawling and indexing an object model of the multiple running, subsequently instantiated documents or applications including identifying objects that are loaded after subsequently instantiating the at least a portion of the functioning application; and relating information gathered from crawling and indexing the at least a portion of the functioning application object model of the subsequently instantiated documents or applications with objects that are displayed at the subsequent point in time. - View Dependent Claims (7, 8, 9, 10, 11)
-
-
12. A computer-implemented method for creating a searchable database, the method comprising:
-
crawling an object model of multiple running, instantiated documents or applications to locate video files; indexing video files found in the object model by saving pointers to the video files in the database; extracting metadata about the video files from the object model; saving the metadata in the database; detecting when a video file has been initiated; extracting metadata in real-time as the video file is being played; and relating the real-time metadata with objects that are displayed at the same point in time.
-
-
13. A computer-implemented method for searching for files on the Internet, the method comprising:
-
providing a protocol crawler for identifying video-rich web sites; and providing an application crawler comprising; an inspector for dynamically instantiating and assembling all components of a web page at one of said video-rich web sites to create at least one instantiated web application; an extractor for identifying specific parts of the instantiated web application that contain useful information and providing the logic required to extract that information into a metadata record including applying data-query interfaces to media player objects in the at least one instantiated web application and using the data-query interfaces to query the media player objects for media player properties and for metadata about any downloaded audio or video streams; and a crawler for analyzing the instantiated web application, finding and analyzing all clickable items in the application, driving the web application by injecting events, and extracting information from the application and writing it to a file or database. - View Dependent Claims (14, 15, 16)
-
-
17. A computer system having a storage device having computer-executable code stored thereon and a processor, the computer system comprising:
an application crawler having programming code for crawling and indexing an object model of running, instantiated documents or applications from one or more web sites including; at an initial point in time, crawling and indexing an object model of the multiple running, initially instantiated documents or applications including identifying objects that are loaded once the documents or applications have been initially instantiated; relating information gathered from crawling and indexing the at least a portion of the functioning application object model of the initially instantiated documents or applications with objects that are displayed at the initial point in time; continuing to instantiate the at least a portion of the functioning application, wherein objects are loaded, assembled and executed to subsequently instantiate the at least a portion of the functioning application; at a subsequent point in time, crawling and indexing an object model of the multiple running, subsequently instantiated documents or applications including identifying objects that are loaded after subsequently instantiating the at least a portion of the functioning application; and relating information gathered from crawling and indexing the at least a portion of the functioning application object model of the subsequently instantiated documents or applications with objects that are displayed at the subsequent point in time. - View Dependent Claims (18, 19, 20, 21, 22)
-
23. A computer system having a storage device having computer-executable code stored thereon and a processor, the computer system comprising:
-
a protocol crawler for identifying video-rich websites; and an application crawler comprising; an inspector for dynamically instantiating and assembling all components of a web page at one of said video-rich websites to create at least one instantiated web application; an extractor for identifying specific parts of the instantiated web application that contain useful information and providing the logic required to extract that information into a metadata record; and a crawler for analyzing the instantiated web application, finding and analyzing all clickable items in the application, driving the web application by injecting events, detecting when a video file has been initiated, and extracting information from the application and writing it to a file or database including extracting metadata in real-time as the video file is being played and relating the real-time metadata with objects that are displayed at the same point in time. - View Dependent Claims (24, 25, 26)
-
Specification