Corporate disclosure and repository system utilizing inference synthesis as applied to a database
First Claim
1. A software system for producing the inferences in a specialized field, the software system fixed in a machine-readable medium, and the software system comprising:
- synthesis software tools when executed by a processor receive documents including freely formatted text documents and which produce a formatted database of information from the freely formatted text documents, the synthesis software tools relating instances to concepts, an instance which can be associated with plural concepts being related to a single concept with reference to a context in which the instance appears; and
analysis software tools which when executed by a processor receive the formatted database, fill portions of a template of predefined relationships between different concepts, the predefined relationships being relevant to inferences in the specialized field, cluster the filled portions of the template of predefined relationships according to a similarity index computed between the filled portions of the template of predefined relationships when the similarity index exceeds a threshold value to form a grouped database, draw inferences from the grouped database and produce an analysis output containing the inferences, wherein the analysis software tools comprise a diagonalization tool which forms the grouped database in which each cluster of topics having the similarity index exceeding the threshold value is a group.
1 Assignment
0 Petitions
Accused Products
Abstract
A corporate disclosure and repository system includes one or more software programs which execute on one or more general purpose data processing systems. The software programs include components for gathering information in the form of free form text documents, reducing the information to a formatted database, analyzing the contents of the database and reorganizing the database in a format suitable for drawing inferences with respect to the contents thereof and synthesizing inferences based upon the contents of the reorganized database. The software programs may be used both intracompany, in preparing documents for deposit in the repository system, and intercompany, in reviewing documents already deposited in the repository system. The intercompany part may be further divided into parts useful to regulators and parts useful to the public. The principle difference between the various parts is in certain knowledge and rules applied in the analysis and reorganization stages, since the various users of the system have different goals at those stages.
-
Citations
10 Claims
-
1. A software system for producing the inferences in a specialized field, the software system fixed in a machine-readable medium, and the software system comprising:
-
synthesis software tools when executed by a processor receive documents including freely formatted text documents and which produce a formatted database of information from the freely formatted text documents, the synthesis software tools relating instances to concepts, an instance which can be associated with plural concepts being related to a single concept with reference to a context in which the instance appears; and
analysis software tools which when executed by a processor receive the formatted database, fill portions of a template of predefined relationships between different concepts, the predefined relationships being relevant to inferences in the specialized field, cluster the filled portions of the template of predefined relationships according to a similarity index computed between the filled portions of the template of predefined relationships when the similarity index exceeds a threshold value to form a grouped database, draw inferences from the grouped database and produce an analysis output containing the inferences, wherein the analysis software tools comprise a diagonalization tool which forms the grouped database in which each cluster of topics having the similarity index exceeding the threshold value is a group. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10)
a concept dictionary relating a concept word root, a context word root and an instance word root;
a parser which receives the documents and which produces a plurality of words contained in the documents;
a rooter which receives the words parsed by the parser and which produces corresponding word roots for the words received; and
a contexter which receives the concept dictionary and the word roots and which identifies a concept corresponding to each word root on the basis of the word roots received.
-
-
3. A software system as defined in claim 2, wherein the word roots include a word root defining context, and one or more word roots associated with the context, the contexter further comprising:
-
a context recognizer which identifies in the concept dictionary all concepts having word roots associated with the context; and
an instance recognizer which identifies in the concept dictionary all concepts previously identified by the context recognizer which also include an instance word root matching the word root.
-
-
4. A software system as defined in claim 1, the analysis tools further comprising:
an inferencer which receives the groups from the grouped database and which produces an inferenced database in which inferences are drawn on the basis of information present in and absent from the groups.
-
5. A software system as defined in claim 1, the analysis tools further comprising;
a catalog defining each database entry as one of either required or optional.
-
6. The software system of claim 1, wherein the specialized field is corporate financial information disclosure.
-
7. A software system as defined in claim 6, the synthesis tools further comprising:
-
a concept dictionary relating a concept word root, a context word root and an instance word root;
a parser which receives the documents and which produces a plurality of words contained in the documents;
a rooter which receives the words parsed by the parser and which produces corresponding word roots for the words received; and
a contexter which receives the concept dictionary and the word roots and which identifies a concept corresponding to each word root on the basis of the word roots received.
-
-
8. A software system as defined in claim 6, wherein the word roots include a word root defining context, and one or more word roots associated with the context, the contexter further comprising:
-
a context recognizer which identifies in the concept dictionary all concepts having word roots associated with the context; and
an instance recognizer which identifies in the concept dictionary all concepts previously identified by the context recognizer which also include an instance word root matching the word root.
-
-
9. A software system as defined in claim 6, the analysis tools further comprising:
a catalog defining each database entry as one of either required or optional.
-
10. A software system as defined in claim 6, the analysis tools further comprising:
an inferencer which receives the groups from the grouped database and which produces an inferenced database in which inferences are drawn on the basis of information present in and absent from the groups.
Specification