Method and apparatus for distributing secure digital content that can be indexed by third party search engines
First Claim
1. A method for distributing secure digital content that can be indexed by third party search engines, the method comprising:
- generating a text stream from the digital content by stripping all graphic information and punctuation from the digital content;
fragmenting the text stream into multi-word phrases that are each contained in the digital content;
randomly assembling the phrases into a scrambled document such that the scrambled document contains at least nearly all of the words and at least most of the phrases as are contained in the digital content; and
making the scrambled document available to third party search engines to permit indexing of the scrambled document that will result in an index that is comparable to an index that would result if the third party search engine indexed the digital content.
3 Assignments
0 Petitions
Accused Products
Abstract
In a secure content distribution system, the text is extracted and scrambled in content documents that include text. The scrambled content is made available for indexing by conventional search engines but is not available as plain text and thus is kept secure. The scrambling process breaks a text stream derived from the content document into two to five word phrases, randomizes the phrases and creates a text file from the randomized stream. Third party search engines are allowed to index the scrambled file so that search algorithms that search on particular words or phrases produce nearly the same number of hits as with the plain text file. A web server that provides the content returns either the scrambled content to a search engine or a link to the publisher by examining a user agent parameter that accompanies a content request. Alternatively the scrambled content also includes a script routine that links to the publisher.
47 Citations
30 Claims
-
1. A method for distributing secure digital content that can be indexed by third party search engines, the method comprising:
-
generating a text stream from the digital content by stripping all graphic information and punctuation from the digital content; fragmenting the text stream into multi-word phrases that are each contained in the digital content; randomly assembling the phrases into a scrambled document such that the scrambled document contains at least nearly all of the words and at least most of the phrases as are contained in the digital content; and making the scrambled document available to third party search engines to permit indexing of the scrambled document that will result in an index that is comparable to an index that would result if the third party search engine indexed the digital content. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10)
-
-
11. Apparatus for distributing secure digital content that can be indexed by third party search engines, the apparatus comprising:
-
a stripper that generates a text stream from the digital content by stripping all graphic information and punctuation from the digital content; means for fragmenting the text stream into multi-word phrases that are each contained in the digital content; a stream assembler that randomly assembles the phrases into a scrambled document such that the scrambled document contains at least nearly all of the words and at least most of the phrases as are contained in the digital content; and means for making the scrambled document available to third party search engines to permit indexing of the scrambled document that will result in an index that is comparable to an index that would result if the third party search engine indexed the digital content. - View Dependent Claims (12, 13, 14, 15, 16, 17, 18, 19, 20)
-
-
21. A non-transitory computer-readable medium having instructions stored thereon, the instructions comprising:
-
instructions for generating a text stream from the digital content by stripping all graphic information and punctuation from the digital content; instructions for fragmenting the text stream into multi-word phrases that are each contained in the digital content; instructions for randomly assembling the phrases into a scrambled document such that the scrambled document contains at least nearly all of the words and at least most of the phrases as are contained in the digital content; and instructions for making the scrambled document available to third party search engines to permit indexing of the scrambled document that will result in an index that is comparable to an index that would result if the third party search engine indexed the digital content. - View Dependent Claims (22, 23, 24, 25, 26, 27, 28, 29, 30)
-
Specification