×

Computerized systems and methods for generating interactive cluster charts of human resources-related documents

  • US 9,946,787 B2
  • Filed: 06/03/2016
  • Issued: 04/17/2018
  • Est. Priority Date: 06/12/2015
  • Status: Active Grant
First Claim
Patent Images

1. A computer system for generating a cluster chart for HR-related documents, the computer system comprising:

  • a client computer device comprising a software application for displaying content; and

    a host computer data center in communication with the client computer device via an electronic data communication network, wherein the host computer data center comprises;

    a database for electronically storing HR-related documents that comprise at least one of resumes and job descriptions;

    a web server that serves web pages to the client computer device via the network that are renderable by the software application of the client computer device, and wherein the web server receives requests from the software application of the client computer device for web pages via the network; and

    a programmable computer device that is in communication with the web server and that is programmed to;

    determine clusters of concepts in a collection of HR-related documents in the database, wherein the collection of HR-related documents is identified based on search criteria submitted from the client computer device, and wherein the clusters of concepts in the collection are determined by;

    determining whether terms appearing in the collection of HR-related documents are candidates for cluster labels based on, in part, a frequency of occurrence of the terms in the collection of HR-related documents, wherein the terms comprise at least one of single terms and phrases;

    identifying distinct concepts in the collection of HR-related documents through cluster-label induction that includes;

    applying Singular Value Decomposition (SVD) to a term document matrix constructed from high-frequency terms to determine an orthogonal basis of the term-document matrix, wherein the high-frequency terms are terms that exceed a threshold occurrence frequency in the collection of HR-related documents;

    selecting a first set of k vectors of the orthogonal basis, which represent k concepts in the collection of HR-related documents, to be the cluster candidates; and

    calculating distances between the high-frequency terms in the collection of HR-related documents to the k concepts to determine labels for the cluster candidates;

    assign each of the HR-related documents in the collection to one or more of the determined clusters using a Vector Space Model sorting algorithm; and

    generate a chart graphically showing the clusters, wherein the determined labels for clusters is shown in the chart and each cluster has a characteristic that is related to the quantity of the HR-related documents assigned to the cluster; and

    the web server serves the chart in a cluster chart web page to the client computer device, wherein;

    the cluster chart web page comprises a document listing field; and

    each cluster in the cluster chart web page served to the client computer device comprises a hyperlink that when activated from the client computer device, to thereby select a cluster, causes the document listing field to list the HR-related documents assigned to the selected cluster.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×