Computerized systems and methods for generating interactive cluster charts of human resources-related documents
First Claim
1. A computer system for generating a cluster chart for HR-related documents, the computer system comprising:
- a client computer device comprising a software application for displaying content; and
a host computer data center in communication with the client computer device via an electronic data communication network, wherein the host computer data center comprises;
a database for electronically storing HR-related documents that comprise at least one of resumes and job descriptions;
a web server that serves web pages to the client computer device via the network that are renderable by the software application of the client computer device, and wherein the web server receives requests from the software application of the client computer device for web pages via the network; and
a programmable computer device that is in communication with the web server and that is programmed to;
determine clusters of concepts in a collection of HR-related documents in the database, wherein the collection of HR-related documents is identified based on search criteria submitted from the client computer device, and wherein the clusters of concepts in the collection are determined by;
determining whether terms appearing in the collection of HR-related documents are candidates for cluster labels based on, in part, a frequency of occurrence of the terms in the collection of HR-related documents, wherein the terms comprise at least one of single terms and phrases;
identifying distinct concepts in the collection of HR-related documents through cluster-label induction that includes;
applying Singular Value Decomposition (SVD) to a term document matrix constructed from high-frequency terms to determine an orthogonal basis of the term-document matrix, wherein the high-frequency terms are terms that exceed a threshold occurrence frequency in the collection of HR-related documents;
selecting a first set of k vectors of the orthogonal basis, which represent k concepts in the collection of HR-related documents, to be the cluster candidates; and
calculating distances between the high-frequency terms in the collection of HR-related documents to the k concepts to determine labels for the cluster candidates;
assign each of the HR-related documents in the collection to one or more of the determined clusters using a Vector Space Model sorting algorithm; and
generate a chart graphically showing the clusters, wherein the determined labels for clusters is shown in the chart and each cluster has a characteristic that is related to the quantity of the HR-related documents assigned to the cluster; and
the web server serves the chart in a cluster chart web page to the client computer device, wherein;
the cluster chart web page comprises a document listing field; and
each cluster in the cluster chart web page served to the client computer device comprises a hyperlink that when activated from the client computer device, to thereby select a cluster, causes the document listing field to list the HR-related documents assigned to the selected cluster.
1 Assignment
0 Petitions
Accused Products
Abstract
Computer systems and methods generate a cluster chart for HR-related documents. A host computer data center comprises a database for electronically storing HR-related documents, a web server, and a programmable computer device. The programmable computer device is programmed to determine clusters of prevalent terms in a collection of HR-related documents in the database. The collection of HR-related documents from which the clusters are generated is identified based on search criteria submitted from a client computer device. The clusters of prevalent terms in the collection can be determined using a clustering algorithm employing algebraic transformations of a term-document matrix generated from the collection of HR-related documents. The programmable computer device is also programmed to assign each of the HR-related documents in the collection to one or more of the determined clusters, and to generate a chart graphically showing the clusters. Each cluster in the chart has a characteristic (e.g., size) that is related to the quantity of the HR-related documents assigned to the cluster. A web server serves the chart in a cluster chart web page to the client computer device. The cluster chart web page comprises a document listing field. Each cluster in the cluster chart web page comprises a hyperlink that when activated from the client computer device, to thereby select a cluster, causes the document listing field to list the HR-related documents assigned to the selected cluster.
16 Citations
15 Claims
-
1. A computer system for generating a cluster chart for HR-related documents, the computer system comprising:
-
a client computer device comprising a software application for displaying content; and a host computer data center in communication with the client computer device via an electronic data communication network, wherein the host computer data center comprises; a database for electronically storing HR-related documents that comprise at least one of resumes and job descriptions; a web server that serves web pages to the client computer device via the network that are renderable by the software application of the client computer device, and wherein the web server receives requests from the software application of the client computer device for web pages via the network; and a programmable computer device that is in communication with the web server and that is programmed to; determine clusters of concepts in a collection of HR-related documents in the database, wherein the collection of HR-related documents is identified based on search criteria submitted from the client computer device, and wherein the clusters of concepts in the collection are determined by; determining whether terms appearing in the collection of HR-related documents are candidates for cluster labels based on, in part, a frequency of occurrence of the terms in the collection of HR-related documents, wherein the terms comprise at least one of single terms and phrases; identifying distinct concepts in the collection of HR-related documents through cluster-label induction that includes; applying Singular Value Decomposition (SVD) to a term document matrix constructed from high-frequency terms to determine an orthogonal basis of the term-document matrix, wherein the high-frequency terms are terms that exceed a threshold occurrence frequency in the collection of HR-related documents; selecting a first set of k vectors of the orthogonal basis, which represent k concepts in the collection of HR-related documents, to be the cluster candidates; and calculating distances between the high-frequency terms in the collection of HR-related documents to the k concepts to determine labels for the cluster candidates; assign each of the HR-related documents in the collection to one or more of the determined clusters using a Vector Space Model sorting algorithm; and generate a chart graphically showing the clusters, wherein the determined labels for clusters is shown in the chart and each cluster has a characteristic that is related to the quantity of the HR-related documents assigned to the cluster; and the web server serves the chart in a cluster chart web page to the client computer device, wherein; the cluster chart web page comprises a document listing field; and each cluster in the cluster chart web page served to the client computer device comprises a hyperlink that when activated from the client computer device, to thereby select a cluster, causes the document listing field to list the HR-related documents assigned to the selected cluster. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 14)
-
-
9. A computer-implemented method for generating a cluster chart for HR-related documents, the method comprising:
-
electronically storing HR-related documents in a computer database of a host data center, wherein the HR-related documents comprise at least one of resumes and job descriptions; receiving, by a web server of the host data center, search criteria from a client computer device that is in communication with the host data center via an electronic data communication network; determining, by a programmable computer device of the host data center, clusters of concepts in a collection of HR-related documents in the database, wherein the collection of HR-related documents is identified based on the search criteria received from the client computer device, and wherein the clusters of concepts in the collection are determined by; determining whether terms appearing in the collection of HR-related documents are candidates for cluster labels based on, in part, a frequency of occurrence of the terms in the collection of HR-related documents, wherein the terms comprise at least one of single terms and phrases; identifying distinct concepts in the collection of HR-related documents through cluster-label induction that includes; applying Singular Value Decomposition (SVD) to a term-document matrix constructed from high-frequency terms to determine an orthogonal basis of the term-document matrix, wherein the high-frequency terms are terms that exceed a threshold occurrence frequency in the collection of HR-related documents; selecting a first set of k vectors of the orthogonal basis, which represent k concepts in the collection of HR-related documents, to be the cluster candidates; and calculating distances between the high-frequency terms in the collection of HR-related documents to the k concepts to determine labels for the cluster candidates; assigning, by the programmable computer device, each of the HR-related documents in the collection to one or more of the determined clusters using a Vector Space Model sorting algorithm; generating, by the programmable computer device, a chart graphically showing the clusters, wherein the determined labels for clusters is shown in the chart and each cluster has a characteristic that is related to the quantity of the HR-related documents assigned to the cluster; and serving, by the web server, the chart in a cluster chart web page to the client computer device via the network, wherein; the cluster chart web page comprises a document listing field; and each cluster in the cluster chart web page served to the client computer device comprises a hyperlink that when activated from the client computer device, to thereby select a cluster, causes the document listing field to list the HR-related documents assigned to the selected cluster. - View Dependent Claims (10, 11, 12, 13, 15)
-
Specification