Analyzing content using abstractable interchangeable elements
First Claim
1. A method for use in analyzing protectable content, comprising:
- storing, in a memory structure, a plurality of items of comparison content in a first format and a second format;
receiving, from a user at a computer-based content exchange, an item of user content in said first format, at least one of said plurality of items of comparison content and said item of user content including open source content;
converting said item of user content from said first format to said second format, wherein said converting comprises abstracting one or more interchangeable elements of said item of user content, and wherein said interchangeable elements combine to comprise less than an entirety of said item of user content, said interchangeable elements indicative of a non-functional element associated with said item of user content;
comparing, using said computer-based content exchange, said item of user content in said second format to said items of comparison content in said second format; and
in response to said comparing, determining that said item of user content and at least one of said items of comparison content each include a substantially similar portion.
11 Assignments
0 Petitions
Accused Products
Abstract
A system and method (a “utility”) is provided for improving the accuracy of a content matching analysis that identifies a composition of an item of protectable content of a user. The item of protectable content may include a portion of source code or object code, individual or bundled source code or object code files, binary code files, directory structures and/or trees, open source software projects or packages, and/or proprietary software applications or packages. The utility involves storing a number of items of comparison content on a storage structure, receiving an item of user content at a computer-based content exchange, and comparing the item of user content to the items of comparison content to determine, from among the items of comparison content, one or more potential matches that each include a matched portion that is similar to a portion of the item of user content. The utility further includes selecting a noise reduction technique from a number of noise reduction techniques available to the content exchange and applying the noise reduction technique to eliminate noise and/or false positives (e.g., one or more redundant matches and/or erroneously identified matches) from the potential matches.
-
Citations
32 Claims
-
1. A method for use in analyzing protectable content, comprising:
-
storing, in a memory structure, a plurality of items of comparison content in a first format and a second format; receiving, from a user at a computer-based content exchange, an item of user content in said first format, at least one of said plurality of items of comparison content and said item of user content including open source content; converting said item of user content from said first format to said second format, wherein said converting comprises abstracting one or more interchangeable elements of said item of user content, and wherein said interchangeable elements combine to comprise less than an entirety of said item of user content, said interchangeable elements indicative of a non-functional element associated with said item of user content; comparing, using said computer-based content exchange, said item of user content in said second format to said items of comparison content in said second format; and in response to said comparing, determining that said item of user content and at least one of said items of comparison content each include a substantially similar portion. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16)
-
-
17. A system for analyzing user content, comprising:
-
a memory structure for storing a plurality of items of comparison content and one or more comparison signatures corresponding to each said item of comparison content, said one or more comparison signatures having previously been abstracted of one or more interchangeable elements of at least one of said plurality of items of comparison content; a scanner for receiving an item of user content from a user and creating one or more user signatures associated with said item of user content, wherein each said user signature abstracts one or more interchangeable elements of said item of user content, said one or more interchangeable elements indicative of a non-functional element associated with said item of user content; and a processor for comparing said user signatures to said comparison signatures and determining whether at least one of said items of comparison content includes a matched potion that is similar to a portion of said item of user content. - View Dependent Claims (18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 31)
-
-
30. A system as set forth in 29, wherein said item of user content remains behind said firewall, and wherein said portion of said item of user content is loaded for said direct comparison behind said firewall.
-
32. A method of analyzing user content, comprising:
-
storing, in a memory structure, a plurality of comparison signatures, wherein each said comparison signature is associated with an item of comparison content, at least one of said, plurality of comparison signatures having previously been abstracted of one or more interchangeable elements of said item of comparison content; receiving, at a scanner located at a user node, an item of user content; dividing said item of user content into one or more portions; creating, using said scanner, a user signature associated with each said portion, wherein each said user signature abstracts one or more interchangeable elements of said portion; comparing, using a computer-based content exchange, said user signatures to said comparison signatures; and from said comparing, determining which of said items of comparison content include a matched portion that is similar to one of said portions of said item of user content.
-
Specification