System for software source code comparison
First Claim
Patent Images
1. A system for comparing at least a first corpus to a second corpus, comprising:
- an analyzer identifying concepts in the corpuses, said analyzer determining a frequency rating of each of said concepts in each corpus;
for each corpus, replacing each instance of each of said concepts with its respective determined frequency rating to create a frequency file; and
a comparator comparing the frequency file for the first corpus to the frequency file for the second corpus,wherein said comparing the frequency file for the first corpus to the frequency file for the second corpus further comprises comparing portions of one corpus against the other corpus.
2 Assignments
0 Petitions
Accused Products
Abstract
A system for analyzing similarities between a first and second corpus or between a set of concepts and a corpus uses natural language processing and machine intelligence methods to replace terms or phrases in the corpus with concepts, determine the frequency of each concept in the corpus, and convert the corpus into a concept frequency file to enable easy comparison of the two corpuses or easy retrieval of items from the corpus that contain concept. Difference analysis and a combination of content and spectral analysis may be employed.
64 Citations
3 Claims
-
1. A system for comparing at least a first corpus to a second corpus, comprising:
-
an analyzer identifying concepts in the corpuses, said analyzer determining a frequency rating of each of said concepts in each corpus; for each corpus, replacing each instance of each of said concepts with its respective determined frequency rating to create a frequency file; and a comparator comparing the frequency file for the first corpus to the frequency file for the second corpus, wherein said comparing the frequency file for the first corpus to the frequency file for the second corpus further comprises comparing portions of one corpus against the other corpus. - View Dependent Claims (2, 3)
-
Specification