×

Systems and methods for efficient data searching, storage and reduction

  • US 20060059173A1
  • Filed: 09/15/2004
  • Published: 03/16/2006
  • Est. Priority Date: 09/15/2004
  • Status: Active Grant
First Claim
Patent Images

1. A method for identifying input data in repository data comprising:

  • providing an index of repository data, including at least N distinguishing characteristics for each of a plurality of chunks of the repository data;

    partitioning the input data into a plurality of input chunks;

    for each input chunk, determining at least K distinguishing characteristics and searching the index for each of the K distinguishing characteristics until at least J matches with the repository distinguishing characteristics are found, and if J matches are found for an input chunk and a respective repository chunk, the respective repository chunk being determined to be a similar repository chunk where J≦

    N≦

    K; and

    computing at least one of common and noncommon sections of the input chunk and similar repository chunk using the matching distinguishing characteristics as anchors to define corresponding intervals in the input chunk and similar repository chunk.

View all claims
  • 3 Assignments
Timeline View
Assignment View
    ×
    ×