Noise reduction for content matching analysis results for protectable content
First Claim
1. A method for refining an indication of a composition of protectable content, comprising:
- storing, in a memory structure, a number of items of comparison content comprising at least one item of open source software content;
providing, on a computer-based content exchange, instructions regarding a plurality of noise reduction techniques;
receiving, at said computer-based content exchange, an item of user content;
comparing said item of user content to said items of comparison content;
determining, from among said items of comparison content, an identity of one or more potential matches, wherein each said potential match includes a matched portion that is similar to a portion of said item of user content;
selecting, using said computer-based content exchange, at least one of said plurality of noise reduction techniques for application to said potential matches; and
applying, using said computer-based content exchange, said at least one selected noise reduction technique to eliminate noise from said potential matches.
13 Assignments
0 Petitions
Accused Products
Abstract
A system and method (a “utility”) is provided for improving the accuracy of a content matching analysis that identifies a composition of an item of protectable content of a user. The item of protectable content may include a portion of source code or object code, individual or bundled source code or object code files, binary code files, directory structures and/or trees, open source software projects or packages, and/or proprietary software applications or packages. The utility involves storing a number of items of comparison content on a storage structure, receiving an item of user content at a computer-based content exchange, and comparing the item of user content to the items of comparison content to determine, from among the items of comparison content, one or more potential matches that each include a matched portion that is similar to a portion of the item of user content. The utility further includes selecting a noise reduction technique from a number of noise reduction techniques available to the content exchange and applying the noise reduction technique to eliminate noise and/or false positives (e.g., one or more redundant matches and/or erroneously identified matches) from the potential matches.
79 Citations
59 Claims
-
1. A method for refining an indication of a composition of protectable content, comprising:
-
storing, in a memory structure, a number of items of comparison content comprising at least one item of open source software content; providing, on a computer-based content exchange, instructions regarding a plurality of noise reduction techniques; receiving, at said computer-based content exchange, an item of user content; comparing said item of user content to said items of comparison content; determining, from among said items of comparison content, an identity of one or more potential matches, wherein each said potential match includes a matched portion that is similar to a portion of said item of user content; selecting, using said computer-based content exchange, at least one of said plurality of noise reduction techniques for application to said potential matches; and applying, using said computer-based content exchange, said at least one selected noise reduction technique to eliminate noise from said potential matches. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50)
-
-
51. A method for reducing noise in a set of deep discovery analysis results, comprising:
-
establishing a computer-based content exchange, wherein said computer-based content exchange receives an item of user content and compares said item of user content to a library of items of comparison content to identify a number of potential matches from among said items of comparison content, wherein each said potential match includes a matched portion that is similar to a portion of said item of user content, and wherein the items of comparison content comprise at least one item of open source software content; selecting, from among a plurality of noise reduction protocols stored on said computer-based content exchange, one or more of said noise reduction protocols; applying, using said computer-based content exchange, said one or more noise reduction protocols to said potential matches to eliminate noise from said potential matches; and depending on an outcome of said selecting and said applying steps, iteratively repeating said selecting and said applying steps until an acceptable amount of said noise is eliminated from said potential matches. - View Dependent Claims (52, 53, 54, 55, 56, 57)
-
-
58. A system for analyzing protectable content, comprising:
-
one or more memory structures, said memory structures storing a library of items of comparison content including at least one item of open source software content; a scanner, said scanner receiving an item of user content received from a user; and a processor communicatively coupled with said memory structures and said scanner, wherein said processor directs a comparison between said item of user content and said items of comparison content to identify a number of potential matches from among said items of comparison content, wherein each said potential match includes a matched portion that is similar to a portion of said item of user content, and wherein said processor iteratively applies a plurality of noise reduction protocols to said potential matches to eliminate individual ones of said potential matches. - View Dependent Claims (59)
-
Specification