Skill extraction system
First Claim
Patent Images
1. A method comprising:
- extracting a plurality of skill seed phrases from a plurality of member profiles of a social networking service;
using one or more computer processors, disambiguating the plurality of skill seed phrases to create a plurality of disambiguated skill seed phrases; and
de-duplicating the plurality of disambiguated skill seed phrases to create a plurality of de-duplicated skill seed phrases, the de-duplicated skill seed phrases identifying a plurality of skills; and
wherein de-duplicating the plurality of disambiguated skill seed phrases comprises;
calculating a plurality of internet search queries, at least one for each of the plurality of disambiguated skill seed phrases, the internet search queries, when executed, returning a list of internet web pages;
for each of the plurality of internet search queries, submitting to a crowd sourcing application, over a network, a job for which the crowd sourcing application returns a specific internet web page selected from the list of internet web pages that is returned from each of the plurality of the internet search queries, the specific internet web page representing an interpretation of a meaning of the disambiguated skill seed phrase corresponding to each internet search query; and
using the plurality of returned specific internet web pages to determine which of the plurality of disambiguated skill seed phrases identify identical skills.
2 Assignments
0 Petitions
Accused Products
Abstract
In an example, disclosed is a machine automated method of identifying a set of skills. In some examples, the method includes extracting a plurality of skill seed phrases from a plurality of member profiles of a social networking site, creating a plurality of disambiguated skill seed phrases by disambiguating the plurality of skill seed phrases using one or more computer processors, and de-duplicating the plurality of disambiguated skill seed phrases to create a plurality of de-duplicated skill seed phrases.
50 Citations
24 Claims
-
1. A method comprising:
-
extracting a plurality of skill seed phrases from a plurality of member profiles of a social networking service; using one or more computer processors, disambiguating the plurality of skill seed phrases to create a plurality of disambiguated skill seed phrases; and de-duplicating the plurality of disambiguated skill seed phrases to create a plurality of de-duplicated skill seed phrases, the de-duplicated skill seed phrases identifying a plurality of skills; and wherein de-duplicating the plurality of disambiguated skill seed phrases comprises; calculating a plurality of internet search queries, at least one for each of the plurality of disambiguated skill seed phrases, the internet search queries, when executed, returning a list of internet web pages; for each of the plurality of internet search queries, submitting to a crowd sourcing application, over a network, a job for which the crowd sourcing application returns a specific internet web page selected from the list of internet web pages that is returned from each of the plurality of the internet search queries, the specific internet web page representing an interpretation of a meaning of the disambiguated skill seed phrase corresponding to each internet search query; and
using the plurality of returned specific internet web pages to determine which of the plurality of disambiguated skill seed phrases identify identical skills. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8)
-
-
9. A system, comprising:
one or more computer processors configured to include; an extraction module configured to extract a plurality of skill seed phrases from a plurality of member profiles of a social networking service; a disambiguation module configured to disambiguating the plurality of skill seed phrases to create a plurality of disambiguated skill seed phrases; and a de-duplication module configured to; de-duplicate the plurality of disambiguated skill seed phrases to create a plurality of de-duplicated skill seed phrases, the plurality of de-duplicated skill seed phrases identifying a plurality of skills; wherein the de-duplication module is configured to de-duplicate the plurality of disambiguated skill seed phrases by at least being configured to; calculate a plurality of internet search queries, at least one for each of the disambiguated skill seed phrases, the internet search queries, when executed, returning a list of internet web pages; for each of the plurality of internet search queries, submit to a crowd sourcing application over a network, a job for which the crowd sourcing application returns a specific internet web page selected from the list of internet web pages that is returned from each of the plurality of the internet search queries, the specific internet web page representing an interpretation of a meaning of the disambiguated skill seed phrase corresponding to each internet search query; and
use the plurality of returned specific internet web pages to determine which of the plurality of disambiguated skill seed phrases identify identical skills.- View Dependent Claims (10, 11, 12, 13, 14, 15, 16)
-
17. A non-transitory machine readable storage medium including instructions, which when executed on the machine, cause the machine to:
-
extract a plurality of skill seed phrases from a plurality of member profiles of a social networking service; disambiguate the plurality of skill seed phrases to create a plurality of disambiguated skill seed phrases; and de-duplicate the plurality of disambiguated skill seed phrases to create a plurality of de-duplicated skill seed phrases, the de-duplicated skill seed phrases identifying a plurality of skills; and wherein the instructions for de-duplication cause the machine to; calculate a plurality of internet search queries, at least one for each of the plurality of disambiguated skill seed phrases, the internet search queries, when executed returning a list of internet web pages; for each of the plurality of internet search queries, submitting to a crowd sourcing application, over a network, a job for which the crowd sourcing application returns a specific internet web page selected from the list of internet web pages that is returned from each of the plurality of the internet search queries, the specific internet web page representing an interpretation of a meaning of the disambiguated skill seed phrase corresponding to each internet search query; and use the plurality of returned specific internet web pages to determine which of the plurality of disambiguated skill seed phrases identify identical skills. - View Dependent Claims (18, 19, 20, 21, 22, 23, 24)
-
Specification