Please download the dossier by clicking on the dossier button x
×

System and Method for Creating Labels for Clusters

  • US 20150006531A1
  • Filed: 02/25/2014
  • Published: 01/01/2015
  • Est. Priority Date: 07/01/2013
  • Status: Active Grant
First Claim
Patent Images

1. A system for creating at least one label for at least one cluster in a computing environment, the system comprising:

  • a processor; and

    a memory coupled to the processor, wherein the processor is capable of executing a plurality of modules stored in the memory, and wherein the plurality of modules comprise;

    a receiving module configured to receive an input data;

    a candidate items selector configured to select a plurality of candidate items occurring repetitively in the input data using a n-gram selection technique for a predefined value of n to generate a sorted list of the plurality of candidate items with a frequency of occurrence of the plurality of candidate items based on the input data;

    a combination array generator configured to select a predefined number of the plurality of candidate items from the sorted list of the plurality of candidate items to populate a two-dimensional array having a plurality of elements, wherein each element of the plurality of elements of the two-dimensional array represents a pair of the plurality of candidate items;

    a coverage value analyzer configured to determine a coverage value for each pair of the plurality of candidate items present in the two-dimensional array to further populate a sorted two-dimensional array;

    a candidate pair selector configured to select a predefined number of pairs of the plurality of candidate items from the sorted two-dimensional array to further process and generate a list of the pairs of the plurality of candidate items;

    a unique word filter configured to accept the list of the pairs of the plurality of candidate items to determine a number of unique words in each of the pairs of the plurality of candidate items; and

    a cluster label selector configured to sort the list of the pairs of the plurality of candidate items using the coverage value and the number of unique words to create a sorted list of the pairs of the plurality of candidate items for selecting a cluster label from the sorted list of the pairs of the plurality of candidate items.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×