×

Computer method for identifying a misclassified software object in a cluster of internally similar software objects

  • US 5,317,741 A
  • Filed: 05/10/1991
  • Issued: 05/31/1994
  • Est. Priority Date: 05/10/1991
  • Status: Expired due to Term
First Claim
Patent Images

1. A method for use in a programmable computer system for identifying software objects that have been assigned to a wrong group, said group being intended to represent a respective cluster of internally similar software objects, wherein the similarity between objects is determined, such as by evaluating a similarity function, and wherein the input comprises a set of software objects, assigned to various groups, peer parameter K, and confidence parameter N, said method comprising the computer-implemented steps of:

  • (a) ascertaining the similarity between each pair of objects, such as by computing a similarity function such as Feature Ratio With Linking;

    (b) for each object O,(b.1) sorting O'"'"'s neighbors, nearest first,(b.2) examining O'"'"'s neighbors in order, counting how many of them are assigned to one or another group, until K are found that are assigned to the same group, recording the group name, say G, and the number of neighbors examined, say E,(b.3) if G is the group to which O is currently assigned, marking O as being correctly classified with confidence E-K and skipping to step (c), and(b.4) otherwise, continuing examining the neighbors in order until K have been found that are assigned to the same module as O, or until all neighbors have been examined, recording the number of neighbors examined, say F, marking O as being misclassified, with confidence F-K, and as likely belonging to group G with confidence E-K;

    (c) sorting the misclassified objects according to their mis-classification confidence, greatest first (here "greater" corresponds to "worse"), and outputing the list, reporting for each object the current group assignment, the mis-classification confidence, the group that the object likely belong to, and the confidence with which it likely belongs; and

    (d) sorting the objects that are correctly classified but with confidence greater than N (here "greater" corresponds to "worse"), sorting by confidence, greatest first, and outputing the sorted list, reporting for each object the confidence with which it belongs to the module to which it is currently assigned, whereby the likelihood of misclassification of objects is ascertainable by the respective confidence level.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×