SYSTEM AND METHOD FOR RESUME, YEARBOOK AND REPORT GENERATION BASED ON WEBCRAWLING AND SPECIALIZED DATA COLLECTION
First Claim
Patent Images
1. A website system, the website system comprising:
- a web crawler that crawls webpages starting with seed URLs, and crawls links and references contained in those webpages, and gathers a URL frontier;
the web crawler visits the URLs from the URL frontier recursively according to a set of policies and gathers a crawled data, wherein the crawled data also comprises information about one or more entities encountered or referenced in the webpages crawled;
a relationship tracker module that creates a network of relationships from the crawled data and tracks relationships between the one or more entities;
the relationship tracker module determining the dates of relationships, duration of those relationships, the category and type of those relationships and storing them as searchable data in a database in the website system;
an entity tracker module that collects and tracks details of the one or more entities, the details comprising activities associated with the one or more entities, and modifications to those details over time, and stores them in the database; and
the website system, when triggered, providing a report related to one of the one or more entities, a report of relationships over time for a given entity among the one or more entities, or a report of activities associated with the given entity specified.
0 Assignments
0 Petitions
Accused Products
Abstract
A website system collecting specialized data on users and organizations from a web crawler. The website system receives from a user a search string (via a search webpage provided by the website for example) with a request to create a technology overview and research report with recent updates and research information in the field for a user specified technology/subject area. It creates a technology overview and research report and presents it. Similarly, it creates user profiles, yearbooks, resumes, etc. based on the specialized data collected from web crawling.
-
Citations
24 Claims
-
1. A website system, the website system comprising:
-
a web crawler that crawls webpages starting with seed URLs, and crawls links and references contained in those webpages, and gathers a URL frontier; the web crawler visits the URLs from the URL frontier recursively according to a set of policies and gathers a crawled data, wherein the crawled data also comprises information about one or more entities encountered or referenced in the webpages crawled; a relationship tracker module that creates a network of relationships from the crawled data and tracks relationships between the one or more entities; the relationship tracker module determining the dates of relationships, duration of those relationships, the category and type of those relationships and storing them as searchable data in a database in the website system; an entity tracker module that collects and tracks details of the one or more entities, the details comprising activities associated with the one or more entities, and modifications to those details over time, and stores them in the database; and the website system, when triggered, providing a report related to one of the one or more entities, a report of relationships over time for a given entity among the one or more entities, or a report of activities associated with the given entity specified. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15)
-
-
16. A web crawler for a website system, the web crawler comprising:
-
the web crawler crawling through Internet webpages; a user data module that collects user data from the Internet webpages and stores it in a database, wherein the user data corresponds to a plurality of users; and the user data module automatically creating a resume or a user profile for at least one of the plurality of users when requested, based at least on the user data in the database. - View Dependent Claims (17, 18, 19, 24)
-
-
20. A method of operating a web system, the method comprising:
-
collecting, by the web system that comprises a web crawler or is communicatively coupled to the web crawler, using crawling techniques and crawling across a plurality of websites and processing a plurality of webpages, a collection of user information for a plurality of users and a collection of organization information for a plurality of organizations; storing and updating the collection of user information and the collection of organization information in a database; creating, upon a user request, an annual publication based upon the data in the database; and presenting the annual publication employing webpages or employing email. - View Dependent Claims (21, 22, 23)
-
Specification