Automated understanding, extraction and structured reformatting of information in electronic files
First Claim
1. A method for automatically understanding a document, the method comprising:
- utilizing algorithms to automate the understanding of a document, wherein no prior identification of a document type is required, no prior identification of an expected format for the document type is required, and no pre-created scripts are required to map contents of the document.
1 Assignment
0 Petitions
Accused Products
Abstract
Systems and methods for automatically understanding, decomposing, extracting, validating and reformatting unstructured tabular information into intermediate structured representations of the information contained therein are described. No constraints are placed on the origin or format of these documents when originally submitted. Furthermore, no pre-created scripts are required to map the information contained in the submitted documents. The systems and methods of this invention generally comprise obtaining an electronic document, automatically analyzing and understanding the contents of the document, extracting information from the document, categorizing the information, and then creating an intermediate structured representation of the information contained therein. The intermediate structured representations may then be easily converted for use in a myriad of back-end systems. Embodiments of this invention automatically process a multitude of financial documents, thereby eliminating the need for human interaction with such documents in many cases and lowering the costs associated with processing such documents.
-
Citations
50 Claims
-
1. A method for automatically understanding a document, the method comprising:
-
utilizing algorithms to automate the understanding of a document, wherein no prior identification of a document type is required, no prior identification of an expected format for the document type is required, and no pre-created scripts are required to map contents of the document. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24)
-
-
25. A method for understanding a document and converting it into an intermediate structured representation of the information contained therein, the method comprising:
-
obtaining a document;
utilizing algorithms to automatically understand the document; and
creating an intermediate structured representation of the information contained therein from the extracted information, wherein no prior identification of a document type is required, no prior identification of an expected format for the document type is required, no pre-created scripts are required to map contents of the document, and the intermediate structured representation of the information is capable of being exchanged across diverse hardware, operating systems and applications. - View Dependent Claims (26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37)
-
-
38. A system for understanding a document and converting it into an intermediate structured representation of the information contained therein, the system comprising:
-
a means for obtaining a document;
a means for utilizing algorithms to automatically understand the document; and
a means for creating an intermediate structured representation of the information contained therein from the extracted information, wherein no prior identification of a document type is required, no prior identification of an expected format for the document type is required, no pre-created scripts are required to map contents of the document, and the intermediate structured representation of the information is capable of being exchanged across diverse hardware, operating systems and applications. - View Dependent Claims (39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50)
-
Specification