Method and System for Creating a Data Profile Engine, Tool Creation Engines and Product Interfaces for Identifying and Analyzing Files and Sections of Files
First Claim
Patent Images
1. A method for identifying a relationship between a plurality of data files comprising word text, the method comprising:
- receiving at a processor the plurality of data files from one or more computer databases;
deconstructing the data files into one or more text blocks;
creating a data profile for each data file and for the one or more text blocks associated with each data file, each data profile comprising a statistical signature for a set of data forming the corresponding data file or text block compared to the plurality of data files; and
storing the data profiles on the one or more computer databases.
3 Assignments
0 Petitions
Accused Products
Abstract
A data profile engine identifies, classifies, analyzes, searches, compares and cross-references entire files and sections of files, records and other forms of electronic media, and a tool creation engine in combination with the data profile engine builds custom solutions and product interfaces.
91 Citations
25 Claims
-
1. A method for identifying a relationship between a plurality of data files comprising word text, the method comprising:
-
receiving at a processor the plurality of data files from one or more computer databases; deconstructing the data files into one or more text blocks; creating a data profile for each data file and for the one or more text blocks associated with each data file, each data profile comprising a statistical signature for a set of data forming the corresponding data file or text block compared to the plurality of data files; and storing the data profiles on the one or more computer databases. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11)
-
-
12. A system for analyzing a document comprising text, the system comprising:
-
a processor configured to receive the document and perform the steps of; deconstructing the document into one or more text blocks; creating a data profile for each of the one or more text blocks, each data profile comprising a statistical signature for a set of data forming the text block; and comparing the data profile for each of the one or more text blocks with a template stored on a computer database, the template comprising data profiles for matching text blocks from a source set of documents; and a user interface coupled to the processor for displaying an indication of similarity of the document compared to the template, the indication of similarity comprising statistical measure of frequency for matching text blocks. - View Dependent Claims (13, 14, 15, 16)
-
-
17. A system for preparing a document comprising text, the system comprising:
-
a processor configured to receive the document and perform the steps of; deconstructing the document into one or more text blocks; creating a data profile for each of the one or more text blocks, each data profile comprising a statistical signature of a set of data forming the text block; and comparing the data profile for each of the one or more text blocks with data profiles associated with a model document stored on a computer database, the model document comprising a plurality of statistically similar text blocks from a source set of model documents; and a user interface coupled to the processor for displaying default standard clauses, alternative clauses and infrequently used clauses based on the source set of model documents. - View Dependent Claims (18, 19, 20, 21)
-
-
22. A system for searching entire files and sections of files, the system comprising:
-
a processor configured to receive a plurality of documents and perform the steps of; deconstructing the plurality of documents into one or more text blocks; creating a data profile for each of the one or more text blocks, each data profile comprising a statistical signature of a set of data forming the text block; and comparing the data profile for each of the one or more text blocks with data profiles associated with a model document stored on a computer database, the model document comprising a plurality of statistically similar text blocks from a source set of model documents; a user interface coupled to the processor for entering a search, the search comprising search terms, sections captions and/or text of a similar section of a user document compared to the model document; and a display for displaying the search results. - View Dependent Claims (23, 24, 25)
-
Specification