×

System and method of obfuscating data

  • US 6,981,217 B1
  • Filed: 12/08/1999
  • Issued: 12/27/2005
  • Est. Priority Date: 12/08/1998
  • Status: Expired due to Fees
First Claim
Patent Images

1. A method of obfuscating the text of an electronic document in a computing environment for indexing by search engine systems, the method comprising:

  • receiving a request from a search engine spider for a source electronic document, wherein said source electronic document is protected by a digital rights management system such that said digital rights management system does not enable said search engine spider to have full-access to said source electronic document;

    retrieving source text from said source electronic document;

    parsing said source text into tokens using a delimiter based upon indexing characteristics of a search engine system for which said search engine spider requested said source electronic document, wherein for search engine systems that recognize phrases, said tokens include groups of words;

    providing a stop list comprising a predefined list of tokensremoving tokens, from parsed source text, that are listed within said stop list;

    inserting randomly selected tokens into said parsed source text, wherein said random tokens are selected from said stop list;

    randomizing an order of adjacent tokens of said parsed source text when said stop list has a small number of tokens;

    generating a second electronic document after said inserting random tokens and said removing tokens thereby said second electronic document of obfuscated index information, such that it is difficult to reconstruct said source electronic document from said second electronic document, and such that said second electronic document adequately represents said source electronic document for use by said search engine system andtransmitting the second electronic document to said search engine spider.

View all claims
  • 19 Assignments
Timeline View
Assignment View
    ×
    ×