SYSTEM AND METHOD FOR RESUME, YEARBOOK AND REPORT GENERATION BASED ON WEBCRAWLING AND SPECIALIZED DATA COLLECTION

US 20120246139A1
Filed: 06/08/2012
Published: 09/27/2012
Est. Priority Date: 10/21/2010
Status: Abandoned Application

First Claim

Patent Images

1. A website system, the website system comprising:

a web crawler that crawls webpages starting with seed URLs, and crawls links and references contained in those webpages, and gathers a URL frontier;

the web crawler visits the URLs from the URL frontier recursively according to a set of policies and gathers a crawled data, wherein the crawled data also comprises information about one or more entities encountered or referenced in the webpages crawled;

a relationship tracker module that creates a network of relationships from the crawled data and tracks relationships between the one or more entities;

the relationship tracker module determining the dates of relationships, duration of those relationships, the category and type of those relationships and storing them as searchable data in a database in the website system;

an entity tracker module that collects and tracks details of the one or more entities, the details comprising activities associated with the one or more entities, and modifications to those details over time, and stores them in the database; and

the website system, when triggered, providing a report related to one of the one or more entities, a report of relationships over time for a given entity among the one or more entities, or a report of activities associated with the given entity specified.

View all claims

0 Assignments

Timeline View

Assignment View

0 Petitions

Accused Products

Abstract

A website system collecting specialized data on users and organizations from a web crawler. The website system receives from a user a search string (via a search webpage provided by the website for example) with a request to create a technology overview and research report with recent updates and research information in the field for a user specified technology/subject area. It creates a technology overview and research report and presents it. Similarly, it creates user profiles, yearbooks, resumes, etc. based on the specialized data collected from web crawling.

Citations

24 Claims

1. A website system, the website system comprising:
- a web crawler that crawls webpages starting with seed URLs, and crawls links and references contained in those webpages, and gathers a URL frontier;
  
  the web crawler visits the URLs from the URL frontier recursively according to a set of policies and gathers a crawled data, wherein the crawled data also comprises information about one or more entities encountered or referenced in the webpages crawled;
  
  a relationship tracker module that creates a network of relationships from the crawled data and tracks relationships between the one or more entities;
  
  the relationship tracker module determining the dates of relationships, duration of those relationships, the category and type of those relationships and storing them as searchable data in a database in the website system;
  
  an entity tracker module that collects and tracks details of the one or more entities, the details comprising activities associated with the one or more entities, and modifications to those details over time, and stores them in the database; and
  
  the website system, when triggered, providing a report related to one of the one or more entities, a report of relationships over time for a given entity among the one or more entities, or a report of activities associated with the given entity specified.
- View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15)
- - 2. The website system of claim 1 further comprising:
    - the one or more entities comprising an organization;
      
      the entity tracker module collects and tracks details of the organization, the details comprising activities associated with the organization, and modifications to the details of the organization; and
      
      the website system, when triggered, providing a report of details of the organization and relationships over time for the organization with some of the other entities.
  - 3. The website system of claim 1 further comprising:
    - the one or more entities comprising references to one or more organizations and at least one person;
      
      the entity tracker module collects and tracks details of the one or more organizations and the person, and modifications to the details of the one or more organizations and the at least one person; and
      
      the website system, when triggered, providing a resume report of relationships over time for one of the at least one person with at least one of the one or more organizations.
  - 4. The website system of claim 1 further comprising:
    - the website system providing a search interface that facilitates search and retrieval of data collected and managed by the relationship tracker module and the entity tracker module.
  - 5. The website system of claim 1 further comprising:
    - the one or more entities comprising identification of and references to one or more people and one or more organizations; and
      
      a dynamic publication creator module that dynamically creates an annual publication providing a people information, an events information, an activities information and related data for a given year in a target organization among the one or more organizations, based at least partially on the data in the database.
  - 6. The website system of claim 5 further comprising:
    - the website system, upon receiving a request that specifies the given year and the target organization, creates an annual publication employing the dynamic publication creator module and presents it and communicates it employing webpages and email as necessary.
  - 7. The website system of claim 6 wherein the annual publication is a yearbook and wherein the website system selectively charges fees for presenting or communicating the annual publication.
  - 8. The website system of claim 1 wherein the one or more entities comprises one or more educational institutions and one or more individuals who are students in those educational institutions, and wherein the report of relationships over time is a yearbook for the one or more individuals.
  - 9. The website system of claim 2 wherein the one or more entities comprises one or more commercial or business organizations, and one or more individuals who were, at some point, employees or workers in those one or more commercial or business organizations.
  - 10. The website system of claim 1 wherein the report of relationships over time that also comprises activities over time, based on data in the database, is appropriately presented, as relevant, as a webpage or an email, and is organized as one of a resume, a newsletter published by an educational institution, a bibliography, a newspaper, a publication from a school district, a product review, a sports statistics publication, a research paper on a topic, and a student graduation related document.
  - 11. The website system of claim 1 further comprising:
    - the entity tracker module also collects and tracks documents associated with the one or more entities and stores references to those documents; and
      
      the report created by the website system incorporates at least a portion of the documents or references to the documents.
  - 12. The website system of claim 5 wherein the annual publication is a yearbook for a school, for a corporation, for a business or for a social group.
  - 13. The website system of claim 5 wherein the annual publication is a profile of at least one of the one or more organizations for a given year.
  - 14. The website system of claim 1 wherein the one or more entities comprises one or more individuals and one or more organizations, the website system further comprising:
    - a search engine that uses the web crawler to collect information on the one or more individuals and the one or more organizations; and
      
      the website system comprising a website that provides webpages to permit users on the Internet to enter a search term in order to retrieve details regarding the at least one of the one or more individuals or the one or more organizations.
  - 15. The website system of claim 1, wherein the one or more entities comprises one or more individuals, the website system further comprising:
    - the website system providing an interface to automatically create a resume or a user profile based on the crawled data and the data in the database, for at least one of the one or more individuals.

16. A web crawler for a website system, the web crawler comprising:
- the web crawler crawling through Internet webpages;
  
  a user data module that collects user data from the Internet webpages and stores it in a database, wherein the user data corresponds to a plurality of users; and
  
  the user data module automatically creating a resume or a user profile for at least one of the plurality of users when requested, based at least on the user data in the database.
- View Dependent Claims (17, 18, 19, 24)
- - 17. The web crawler of claim 16 further comprising:
    - an organization tracker module that collects an org data from the Internet webpages and stores it in the database, wherein the org data corresponds to a plurality of organizations; and
      
      the organization tracker module automatically creating a organization profile report for at least one of the plurality of organizations when requested, based at least on the data in the database.
  - 18. The web crawler of claim 16 wherein the user data also comprises textual, audio and video recommendations and user reviews and feedback for products and services provided by the plurality of users.
  - 19. The web crawler of claim 16 further comprising:
    - a selection policy module that specifies policies regarding which webpages to retrieve or download as part of the crawling activity by the web crawler; and
      
      the user data module encountering the plurality of users in webpages retrieved in accordance with the selection policies enforced by the selection policy module, collecting user data from the webpages and storing details of the plurality of users in the database.
  - 24. The system of claim 19 wherein the user feedback to the presented report comprises at least one of a rating feedback, a preference feedback, an audio feedback, a video feedback, an image feedback, and a text feedback.

20. A method of operating a web system, the method comprising:
- collecting, by the web system that comprises a web crawler or is communicatively coupled to the web crawler, using crawling techniques and crawling across a plurality of websites and processing a plurality of webpages, a collection of user information for a plurality of users and a collection of organization information for a plurality of organizations;
  
  storing and updating the collection of user information and the collection of organization information in a database;
  
  creating, upon a user request, an annual publication based upon the data in the database; and
  
  presenting the annual publication employing webpages or employing email.
- View Dependent Claims (21, 22, 23)
- - 21. The method of claim 20 wherein the user request comprises a report type, a given year and a given organization and wherein the annual publication is, based on the report type, one of a yearbook for the given organization for the given year or an annual profile for the given organization that comprises one or more of products information, services information, market share information, research information, competitive intelligence information, patents information, sales information, marketing information, legal status information, financial resources information and personnel information.
  - 22. The method of claim 21 further comprising:
    - customizing the annual publication created by editing, modifying, enhancing and formatting the annual publication to create a customized version of the annual publication; and
      
      sharing the customized version of the annual publication with one or more friends.
  - 23. The method of claim 20 wherein collecting, by the web crawler using crawling techniques, also comprising gathering information on various subject matters and technologies, the method further comprising:
    - generating automatically a research paper and presenting to a user, for a given subject matter identified by a user, based on data in the database and based on information collected on the various subject matters and technologies during crawling.

Specification

Resources

Litigation Campaign Assessment

Current Assignee
Bindu Rama Rao
Original Assignee
Bindu Rama Rao
Inventors
RAO, BINDU RAMA

Application Number

US13/492,799
Publication Number

US 20120246139A1
Time in Patent Office

Days
Field of Search
US Class Current

707/709
CPC Class Codes

G06F 16/951 Indexing; Web crawling tech...

SYSTEM AND METHOD FOR RESUME, YEARBOOK AND REPORT GENERATION BASED ON WEBCRAWLING AND SPECIALIZED DATA COLLECTION

First Claim

0 Assignments

0 Petitions

Accused Products

Abstract

Citations

24 Claims

Specification

Solutions

Use Cases

Quick Links

SYSTEM AND METHOD FOR RESUME, YEARBOOK AND REPORT GENERATION BASED ON WEBCRAWLING AND SPECIALIZED DATA COLLECTION

First Claim

0 Assignments

Subscription Required

Subscription Required

0 Petitions

Subscription Required

Accused Products

Subscription Required

Abstract

Citations

24 Claims

Specification

Subscription Required

Solutions

Use Cases

Quick Links