×

Method and apparatus for automatic comparison of data sequences using local and global relationships

  • US 8,271,403 B2
  • Filed: 12/08/2006
  • Issued: 09/18/2012
  • Est. Priority Date: 12/09/2005
  • Status: Expired due to Fees
First Claim
Patent Images

1. A method for automatic byte stream comparison of at least two data sequences, a first and second data sequence comprising one or more of symbols, images, text, ASCII characters, genetic data, protein data, bytes, binary data, or tokens as objects are, the method comprising the steps of:

  • performing an evaluation of;

    a) a local relationship between any pair of subsequences in two or more sequences received from a byte stream, wherein subsequences for evaluation of the local relationship are specified on a com uterized detector by a subsequence selection mode comprising one of;

    words, wherein the words are subsequences separated by a given set of delimiters;

    n-grams, wherein the n-grams are overlapping subsequences of a given length n; and

    all possible subsequences of two or more sequences;

    b) performing an evaluation of a global relationship by aggregation of a plurality of evaluations of said local relationships, wherein evaluation of the global relationships is performed by one of the following data structures or a representation thereof;

    a hash table or indexed table;

    a trie or compacted trie;

    a suffix tree or suffix array; and

    a generalized suffix tree or generalized suffix array; and

    c) wherein the totality of local and global relationships comprises a measure s for similarity or dissimilarity of two or more sequences.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×