Information management system
First Claim
1. A method of operating an information management system, comprising:
- for each source file in a first collection of source files;
parsing, by a processor of a computing device, the respective source file to extract one or more chemical structures,comparing, by the processor, the one or more chemical structures to chemical structures stored in at least one dictionary, whereineach dictionary of the at least one dictionary comprises a hierarchical listing of chemical structures, andcomparing comprises identifying one or more matching chemical structures, andassociating, by the processor, with the respective source file, the one or more matching chemical structures;
generating, by the processor, a first virtual relational network comprising the source files in the first collection, wherein the first virtual relational network comprises;
one or more nodes, wherein each node represents a particular matching chemical structure of the one or more matching chemical structures associated with a particular source file of the source files in the first collection, andone or more links, wherein each link represents a connection between a pair of nodes, wherein each node of the pair of nodes is associated with a common chemical structure; and
comparing, by the processor, the first virtual relational network to a second virtual relational network, wherein comparing comprises identifying at least one of a) one or more nodes and b) one or more links common to both the first virtual relational network and the second virtual relational network, whereinthe second virtual relational network is created from a second collection of source files different than the first collection of source files; and
the first virtual relational network and the second virtual relational network share at least one common dictionary, whereinthe at least one dictionary comprises the at least one common dictionary.
4 Assignments
0 Petitions
Accused Products
Abstract
An information management system creates data structures based entirely on the content of source files, then compares these data structures to discover synergies and commonalities. In one embodiment, the system accepts a first collection of source files, and extracts text from each source file. The text is compared to tags in one or more dictionaries, which comprise hierarchical listing of tags. Tags matching the text are associated with each source file. The system then generates a virtual relational network in which each source file having matching tags is a node. Tags associated with two or more source files are links between the nodes. This virtual relational network may be compared with another virtual relational network to discover common nodes or links. Source files later added to a collection are massively linked by associating all tags from all source files with the newly added source file, and vice versa.
72 Citations
22 Claims
-
1. A method of operating an information management system, comprising:
for each source file in a first collection of source files; parsing, by a processor of a computing device, the respective source file to extract one or more chemical structures, comparing, by the processor, the one or more chemical structures to chemical structures stored in at least one dictionary, wherein each dictionary of the at least one dictionary comprises a hierarchical listing of chemical structures, and comparing comprises identifying one or more matching chemical structures, and associating, by the processor, with the respective source file, the one or more matching chemical structures; generating, by the processor, a first virtual relational network comprising the source files in the first collection, wherein the first virtual relational network comprises; one or more nodes, wherein each node represents a particular matching chemical structure of the one or more matching chemical structures associated with a particular source file of the source files in the first collection, and one or more links, wherein each link represents a connection between a pair of nodes, wherein each node of the pair of nodes is associated with a common chemical structure; and comparing, by the processor, the first virtual relational network to a second virtual relational network, wherein comparing comprises identifying at least one of a) one or more nodes and b) one or more links common to both the first virtual relational network and the second virtual relational network, wherein the second virtual relational network is created from a second collection of source files different than the first collection of source files; and the first virtual relational network and the second virtual relational network share at least one common dictionary, wherein the at least one dictionary comprises the at least one common dictionary. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15)
-
16. A non-transitory computer readable medium having instructions stored thereon, wherein the instructions, when executed by a processor, cause the processor to for each source file in a first collection of source files:
-
parse the respective source file to extract one or more chemical structures, compare the one or more chemical structures to chemical structures stored in at least one dictionary, wherein comparing the one or more chemical structures comprises identifying one or more matching chemical structures, and each dictionary of the at least one dictionary comprises a hierarchical listing of chemical structures; associate, with the respective source file, the one or more matching chemical structures; generate a first virtual relational network comprising the source files in the first collection, wherein the first virtual network comprises; one or more nodes, wherein each node of the one or more nodes represents a particular matching chemical structure of the one or more matching chemical structures associated with a particular source file of the source files in the first collection, and one or more links, wherein each link represents a connection between a pair of nodes, wherein each node of the pair of nodes is associated with a common chemical structure; and compare the first virtual relational network to a second virtual relational network, wherein comparing comprises identifying at least one of a) one or more nodes and b) one or more links common to both the first virtual relational network and the second virtual relational network, wherein the second virtual relational network is created from a second collection of source files different than the first collection of source files; and the first virtual relational network and the second virtual relational network share at least one common dictionary, wherein the at least one dictionary comprises the at least one common dictionary. - View Dependent Claims (17, 18, 19, 20, 21, 22)
-
Specification