Hierarchical and index based watermarks represented as trees
First Claim
1. A method, performed on a computer system, for watermarking content stored in a plurality of corpora each having a plurality of corpuses, the method comprising:
- using the computer system to perform the following;
receiving a data set of content from a corpus within one of the plurality of corpora;
applying the data set to a natural language processing (NLP) stack including a plurality of NLP analytics each analyzing the data set;
generating a feature watermark from each of the plurality of NLP analytics; and
forming a watermark tree from each of the feature watermarks, the watermark tree representing a hierarchical relationship between each of the feature watermarks generated from each of the plurality of NLP analytics, the watermark tree defining hierarchical pointers that point out inherited watermarks that exist between the feature watermarks according to the hierarchical relationship, the watermark tree including a time stamp specifying a time that the data set was accessed from the corpus.
1 Assignment
0 Petitions
Accused Products
Abstract
An approach that provides hierarchical and index based watermarks represented as trees is described. In one embodiment, a watermark tree is formed from feature watermarks generated from a natural language processing (NLP) stack having NLP analytics. The watermark tree represents a hierarchical relationship between each of the feature watermarks. In particular, the watermark tree defines hierarchical pointers that point out inherited watermarks that exist between the feature watermarks according to the hierarchical relationship. Further, the watermark tree includes a time stamp specifying a time that a data set content residing in a corpus was accessed.
10 Citations
25 Claims
-
1. A method, performed on a computer system, for watermarking content stored in a plurality of corpora each having a plurality of corpuses, the method comprising:
using the computer system to perform the following; receiving a data set of content from a corpus within one of the plurality of corpora; applying the data set to a natural language processing (NLP) stack including a plurality of NLP analytics each analyzing the data set; generating a feature watermark from each of the plurality of NLP analytics; and forming a watermark tree from each of the feature watermarks, the watermark tree representing a hierarchical relationship between each of the feature watermarks generated from each of the plurality of NLP analytics, the watermark tree defining hierarchical pointers that point out inherited watermarks that exist between the feature watermarks according to the hierarchical relationship, the watermark tree including a time stamp specifying a time that the data set was accessed from the corpus. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13)
-
14. A computer system for watermarking content stored in a plurality of corpora each having a plurality of corpuses, the computer system comprising:
-
at least one processing unit; memory operably associated with the at least one processing unit; and a feature watermark tool storable in memory and executable by the at least one processing unit, the tool comprising; an input component configured to receive a data set of content from a corpus within one of the plurality of corpora; a natural language processing (NLP) stack including a plurality of NLP analytics each configured to extract features from the data set; and a feature watermark generator configured to generate a feature watermark for each of the plurality of NLP analytics for features extracted therefrom, the feature watermark generator further configured to form a watermark tree from each of the feature watermarks, the watermark tree representing a hierarchical relationship between each of the feature watermarks generated from each of the plurality of NLP analytics, the watermark tree defining hierarchical pointers that point out inherited watermarks that exist between the feature watermarks according to the hierarchical relationship, the watermark tree including a time stamp specifying a time that the data set was accessed from the corpus. - View Dependent Claims (15, 16, 17)
-
-
18. A non-transitory computer readable medium storing computer instructions, which when executed, enables a computer system to watermark content stored in a plurality of corpora each having a plurality of corpuses, the executed computer instructions causing the computer system to perform a method comprising:
- receiving a data set of content from a corpus within one of the plurality of corpora;
applying the data set to a natural language processing (NLP) stack including a plurality of NLP analytics each analyzing the data set;
generating a feature watermark from each of the plurality of NLP analytics; and
forming a watermark tree from each of the feature watermarks, the watermark tree representing a hierarchical relationship between each of the feature watermarks generated from each of the plurality of NLP analytics, the watermark tree defining hierarchical pointers that point out inherited watermarks that exist between the feature watermarks according to the hierarchical relationship, the watermark tree including a time stamp specifying a time that the data set was accessed. - View Dependent Claims (19, 20, 21, 22, 23, 24, 25)
- receiving a data set of content from a corpus within one of the plurality of corpora;
Specification