Search engine
First Claim
1. A method comprising:
- defining a vector vocabulary;
defining an occupation taxonomy that includes multiple different occupations;
obtaining multiple labeled training data items, wherein each labeled training data item is associated with at least (i) a job title, and (ii) an occupation, wherein the occupation includes a category that encompasses multiple job titles that describe the same job;
generating, for each of the respective labeled training data items, an occupation vector that includes a feature weight for each respective term in the vector vocabulary that is based on (i) a value indicating an inverse occupation frequency that is determined based on a number of occupations in the occupation taxonomy where each respective term in the job title of the respective training data item is present and (ii) a value representing an occupation derivative that is based on a density of each respective term in the job title of the respective training data item across each of the respective occupations in the occupation taxonomy;
associating each respective occupation vector with an occupation in the occupation taxonomy based on the occupation of the labeled training data item used to generate the occupation vector;
receiving a search query that includes a string related to a characteristic of one or more potential job opportunities;
generating a first vector based on the received query;
determining, for each respective occupation of the multiple occupations in the occupation taxonomy, a confidence score that is indicative of whether the query vector is correctly classified in the respective occupation;
selecting the particular occupation that is associated with the highest confidence score;
obtaining one or more job postings using the selected occupation; and
providing the obtained job postings in a set of search results in response to the search query.
2 Assignments
0 Petitions
Accused Products
Abstract
Methods, systems, and apparatus, including computer programs encoded on storage devices, for performing a job opportunity search. In one aspect, a system includes a data processing apparatus, and a computer-readable storage device having stored thereon instructions that, when executed by the data processing apparatus, cause the data processing apparatus to perform operations. The operations include defining a vector vocabulary, defining an occupation taxonomy that includes multiple different occupations, obtaining multiple labeled training data items, wherein each labeled training data item is associated with at least (i) a job title, and (ii) an occupation, generating, for each of the respective labeled training data items, an occupation vector that includes a feature weight for each respective term in the vector vocabulary, and associating each respective occupation vector with an occupation in the occupation taxonomy based on the occupation of the labeled training data item used to generate the occupation vector.
-
Citations
12 Claims
-
1. A method comprising:
-
defining a vector vocabulary; defining an occupation taxonomy that includes multiple different occupations; obtaining multiple labeled training data items, wherein each labeled training data item is associated with at least (i) a job title, and (ii) an occupation, wherein the occupation includes a category that encompasses multiple job titles that describe the same job; generating, for each of the respective labeled training data items, an occupation vector that includes a feature weight for each respective term in the vector vocabulary that is based on (i) a value indicating an inverse occupation frequency that is determined based on a number of occupations in the occupation taxonomy where each respective term in the job title of the respective training data item is present and (ii) a value representing an occupation derivative that is based on a density of each respective term in the job title of the respective training data item across each of the respective occupations in the occupation taxonomy; associating each respective occupation vector with an occupation in the occupation taxonomy based on the occupation of the labeled training data item used to generate the occupation vector; receiving a search query that includes a string related to a characteristic of one or more potential job opportunities; generating a first vector based on the received query; determining, for each respective occupation of the multiple occupations in the occupation taxonomy, a confidence score that is indicative of whether the query vector is correctly classified in the respective occupation; selecting the particular occupation that is associated with the highest confidence score; obtaining one or more job postings using the selected occupation; and providing the obtained job postings in a set of search results in response to the search query. - View Dependent Claims (2, 3, 4)
-
-
5. A system comprising:
-
one or more processors; and one or more computer storage media, the computer storage media comprising instructions that, when executed by the one or more processors, cause the one or more processors to perform operations, the operations comprising; defining a vector vocabulary; defining an occupation taxonomy that includes multiple different occupations; obtaining multiple labeled training data items, wherein each labeled training data item is associated with at least (i) a job title, and (ii) an occupation, wherein the occupation includes a category that encompasses multiple job titles that describe the same job; generating, for each of the respective labeled training data items, an occupation vector that includes a feature weight for each respective term in the vector vocabulary that is based on (i) a value indicating an inverse occupation frequency that is determined based on a number of occupations in the occupation taxonomy where each respective term in the job title of the respective training data item is present and (ii) a value representing an occupation derivative that is based on a density of each respective term in the job title of the respective training data item across each of the respective occupations in the occupation taxonomy; associating each respective occupation vector with an occupation in the occupation taxonomy based on the occupation of the labeled training data item used to generate the occupation vector; receiving a search query that includes a string related to a characteristic of one or more potential job opportunities; generating a first vector based on the received query; determining, for each respective occupation of the multiple occupations in the occupation taxonomy, a confidence score that is indicative of whether the query vector is correctly classified in the respective occupation; selecting the particular occupation that is associated with the highest confidence score; obtaining one or more job postings using the selected occupation; and providing the obtained job postings in a set of search results in response to the search query. - View Dependent Claims (6, 7, 8)
-
-
9. A non-transitory computer-readable medium storing software comprising instructions executable by one or more computers which, upon such execution, cause the one or more computers to perform operations comprising:
-
defining a vector vocabulary; defining an occupation taxonomy that includes multiple different occupations; obtaining multiple labeled training data items, wherein each labeled training data item is associated with at least (i) a job title, and (ii) an occupation, wherein the occupation includes a category that encompasses multiple job titles that describe the same job; generating, for each of the respective labeled training data items, an occupation vector that includes a feature weight for each respective term in the vector vocabulary that is based on (i) a value indicating an inverse occupation frequency that is determined based on a number of occupations in the occupation taxonomy where each respective term in the job title of the respective training data item is present and (ii) a value representing an occupation derivative that is based in part on a density of each respective term in the job title of the respective training data item across each of the respective occupations in the occupation taxonomy; associating each respective occupation vector with an occupation in the occupation taxonomy based on the occupation of the labeled training data item used to generate the occupation vector; receiving a search query that includes a string related to a characteristic of one or more potential job opportunities; generating a first vector based on the received query; determining, for each respective occupation of the multiple occupations in the occupation taxonomy, a confidence score that is indicative of whether the query vector is correctly classified in the respective occupation; selecting the particular occupation that is associated with the highest confidence score; obtaining one or more job postings using the selected occupation; and providing the obtained job postings in a set of search results in response to the search query. - View Dependent Claims (10, 11, 12)
-
Specification