×

Normalizing non-numeric features of files

  • US 10,078,667 B2
  • Filed: 12/13/2015
  • Issued: 09/18/2018
  • Est. Priority Date: 11/28/2014
  • Status: Active Grant
First Claim
Patent Images

1. A computer-implemented method for normalizing non-numeric features of files, comprising:

  • segmenting at least one pair of positive instances of a non-numeric feature of a file into a number of tokens, wherein the non-numeric feature of the file comprises a file storage path of a configuration file stored in a networked computer environment;

    comparing the tokens in the at least one pair of positive instances to obtain matching tokens by;

    calculating the maximum matching score between each token in a positive instance with the tokens in another positive instance;

    selecting the tokens of which the maximum matching scores are greater than a given threshold, to get the matching tokens; and

    for each of the matching tokens, calculating weights of their matching the file, and storing the tokens and their weights in a token base, wherein the matching tokens identify similar configuration files stored in different locations in the networked computer environment.

View all claims
  • 2 Assignments
Timeline View
Assignment View
    ×
    ×