Job categorization system and method
First Claim
1. A computer system comprising:
- a plurality of processors for managing capture and processing of job listing information data captured through a data network from a plurality of job related sources for compilation into a searchable data structure;
an administrative portal module implemented on one of the plurality of processors for providing system administration and operational control through a network interface;
a scraping management module implemented on one of the plurality of processors for coordinating operation of and communication between one or more job scraping engines to obtain scraped job information data sets from corporate career sites and job boards identified by a site management module in the administrative portal module and store the scraped data sets in a database, each scraped data set comprising data fields;
a quality management module implemented on one of the plurality of processors coupled to the scraping management module for comparing the data fields of each scraped job data set stored in the database with predetermined quality rules, wherein the quality rules include document rules, and wherein if the data set fails one or more of the document rules, the data set will not be indexed in the database and the data set will be flagged for quality review;
a job listing data categorization module implemented on one of the plurality of processors operable to examine and categorize each job data set stored in the database into one or more predetermined job categories based upon a volume of the scraped job data sets and return categorized job data sets to the database; and
a search bank synchronizer implemented on one of the plurality of processors for communicating with the database for compiling and transferring categorized job data sets from the database to a job search bank.
3 Assignments
0 Petitions
Accused Products
Abstract
A computer system and method for capture, managing and presenting data obtained from various often unrelated postings via the Internet for examination by a user. This system includes a scraping module having one or more scraping engines operable to scrape information data sets from listings on the corporate sites and web sites, direct feeds, and other sources, wherein the scraping module receives and stores the scraped listing information data sets in a database. The system also has a management platform coordinating all operation of and communication between the sources, system administrators and processing modules. The processing modules in the platform include scraping management module analyzing selected scraped data stored in the database, and a categorization module that examines and categorizes each data set stored in the database into one or more of a predetermined set of categories and returns categorized data sets to the database.
-
Citations
24 Claims
-
1. A computer system comprising:
-
a plurality of processors for managing capture and processing of job listing information data captured through a data network from a plurality of job related sources for compilation into a searchable data structure; an administrative portal module implemented on one of the plurality of processors for providing system administration and operational control through a network interface; a scraping management module implemented on one of the plurality of processors for coordinating operation of and communication between one or more job scraping engines to obtain scraped job information data sets from corporate career sites and job boards identified by a site management module in the administrative portal module and store the scraped data sets in a database, each scraped data set comprising data fields; a quality management module implemented on one of the plurality of processors coupled to the scraping management module for comparing the data fields of each scraped job data set stored in the database with predetermined quality rules, wherein the quality rules include document rules, and wherein if the data set fails one or more of the document rules, the data set will not be indexed in the database and the data set will be flagged for quality review; a job listing data categorization module implemented on one of the plurality of processors operable to examine and categorize each job data set stored in the database into one or more predetermined job categories based upon a volume of the scraped job data sets and return categorized job data sets to the database; and a search bank synchronizer implemented on one of the plurality of processors for communicating with the database for compiling and transferring categorized job data sets from the database to a job search bank. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9)
-
-
10. A method of obtaining, handling and compiling job information data sets comprising:
-
scraping, by a processor, job information data sets from one or more job listings on one or more corporate career sites or job boards available through the Internet; storing, by the processor, a job data set corresponding to each scraped job listing found in a database, each job data set comprising data fields; comparing, by the processor, each data field of each scraped data set stored in the database with predetermined quality criteria, wherein the quality criteria include document rules, and wherein if the data set fails one or more of the document rules, the data set will not be indexed in the database and the data set will be flagged for quality review; categorizing, by the processor, each data set stored in the database into one or more predetermined job categories based upon a volume of the scraped data sets and returning the categorized job information data set to the database; and communicating, by the processor, with the database for compiling and transferring categorized job data sets from the database to a job search bank. - View Dependent Claims (11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 23, 24)
-
-
21. A computer readable medium tangibly encoding a computer program of instructions for executing a computer process for scraping job description data from corporate career sites and job boards, the computer process comprising:
-
scraping listing information data from one or more listings on sites available through the Internet; storing a scraped data set corresponding to each scraped listing information in a database, each scraped data set comprising data fields; comparing data fields of each scraped data set stored in the database with predetermined quality criteria, wherein the quality criteria include document rules, and wherein if the data set fails one or more of the document rules, the data set will not be indexed in the database and the data set will be flagged for quality review; categorizing each data set stored in the database into one or more predetermined categories based upon a volume of the scraped data sets and returning the categorized data set to the database; and communicating with the database for compiling and transferring categorized job data sets from the database to a job search bank. - View Dependent Claims (22)
-
Specification