×

Discriminative feature selection for data sequences

  • US 20040153307A1
  • Filed: 03/22/2004
  • Published: 08/05/2004
  • Est. Priority Date: 03/30/2001
  • Status: Abandoned Application
First Claim
Patent Images

1. A discriminative feature selection method for selecting a set of features from training data comprising a plurality of data sequences, said data sequences being generated from at least two data sources, and wherein each data sequence comprises a sequence of data symbols from an alphabet, said method comprising:

  • building a suffix tree from said training data, said suffix tree comprising suffixes of said data sequences having an empirical probability of occurrence from at least one of said sources greater than a first predetermined threshold; and

    pruning from said suffix tree all suffixes for which there exists in said suffix tree a shorter suffix having equivalent predictive capability for all of said data sources.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×