×

N-gram analysis of inputs to a software application

  • US 9,880,915 B2
  • Filed: 03/05/2014
  • Issued: 01/30/2018
  • Est. Priority Date: 03/05/2014
  • Status: Active Grant
First Claim
Patent Images

1. A method, implemented at a distributed computer system that includes at least one computer processor, said method for analyzing an application based on n-gram sequences associated with inputs of said application, said method comprising:

  • executing an application in a production environment that comprises a first computer system of said distributed computer system;

    receiving first tracer data observed from execution of said application in said production environment, said first tracer data observed from execution of said application in said production environment comprising a first plurality of inputs provided to said application during execution in said production environment;

    identifying, within said first tracer data, a first plurality of n-gram sequences of said first plurality of inputs, each of said first plurality of n-gram sequences comprising at least one of a first plurality of input parameter sequences;

    identifying, from a usage frequency database comprising usage data for each of said first plurality of n-gram sequences from said first tracer data, one or more ways in which said application was used during execution in said production environment;

    based on said one or more ways in which said application was used during execution in said production environment, identifying one or more characteristics of a test environment for said application;

    based on said one or more characteristics, configuring a test environment that comprises a second computer system of said distributed computer system;

    executing said application in said test environment that includes said one or more identified characteristics;

    receiving second tracer data observed from execution of said application during execution in said test environment, said second tracer data observed from execution of said application in said test environment comprising a second plurality of inputs provided to said application during execution in said test environment;

    identifying, within said second tracer data, a second plurality of n-gram sequences of said second plurality of inputs, each of said second plurality of n-gram sequences comprising at least one of a second plurality of input parameter sequences;

    identifying a subset of said first plurality of n-gram sequences contained in said second plurality of n-gram sequences of said second tracer data; and

    comparing said subset of said first plurality of n-gram sequences contained in said second plurality of n-gram sequences to said usage frequency database, wherein comparing comprises mapping said subset of said first plurality of n-gram sequences contained in said second plurality of n-gram sequences to said usage frequency database to thereby determine a test coverage factor of said application.

View all claims
  • 2 Assignments
Timeline View
Assignment View
    ×
    ×