×

Normalizing non-numeric features of files

  • US 10,078,666 B2
  • Filed: 11/05/2015
  • Issued: 09/18/2018
  • Est. Priority Date: 11/28/2014
  • Status: Active Grant
First Claim
Patent Images

1. An apparatus for normalizing non-numeric features of files, comprising:

  • a token segmenting module configured to segment at least one pair of positive instances of a non-numeric feature of a file into a number of tokens, wherein the non-numeric feature of the file comprises a file storage path of a configuration file stored in a networked computer environment;

    a token matching module configured to compare the tokens in the at least one pair of positive instances to obtain matching tokens, wherein the token matching module comprises;

    a token matching score calculating sub-module configured to calculate a maximum matching score between each token in a positive instance with the tokens in another positive instance;

    a token selecting sub-module configured to select the tokens of which the maximum matching scores are greater than a given threshold, to get the matching tokens; and

    a token base constructing module configured to, for the matching tokens, calculate weights of their matching the file, and store the tokens and their weights in a token base, wherein the matching tokens identify similar configuration files stored in different locations in the networked computer environment.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×