Information management system
First Claim
1. A method of operating an information management system, comprising:
- obtaining a first collection of source files;
for each source file in the first collection;
parsing, by a processor of a computing device, the respective source file to extract one or more tags,comparing, by the processor, the one or more tags to tags in at least one dictionary to determine one or more matching tags, wherein the at least one dictionary comprises a hierarchical listing of tags, andassociating, by the processor, with the respective source file, the one or more matching tags;
generating, by the processor, a first virtual relational network comprising the source files in the first collection, wherein the first virtual relational network comprises;
one or more nodes, wherein each node of the one or more nodes represents a particular matching tag of the one or more matching tags associated with a particular source file of the source files in the first collection, andone or more links, wherein each link of the one or more links represents a connection between a pair of nodes, wherein each node of the pair of nodes is associated with a same tag;
comparing, by the processor, the first virtual relational network to a second virtual relational network to identify at least one of (a) nodes common to the first and second virtual relational networks, and (b) links common to the first and second virtual relational networks, whereinthe second virtual relational network is created from a second collection of source files different from the first collection of source files, andthe second virtual relational network is created using one or more dictionaries of the at least one dictionary; and
displaying a graphical representation of at least part of the first and second virtual relational networks.
4 Assignments
0 Petitions
Accused Products
Abstract
An information management system creates data structures based entirely on the content of source files, then compares these data structures to discover synergies and commonalities. In one embodiment, the system accepts a first collection of source files, and extracts text from each source file. The text is compared to tags in one or more dictionaries, which comprise hierarchical listing of tags. Tags matching the text are associated with each source file. The system then generates a virtual relational network in which each source file having matching tags is a node. Tags associated with two or more source files are links between the nodes. This virtual relational network may be compared with another virtual relational network to discover common nodes or links. Source files later added to a collection are massively linked by associating all tags from all source files with the newly added source file, and vice versa.
-
Citations
26 Claims
-
1. A method of operating an information management system, comprising:
-
obtaining a first collection of source files; for each source file in the first collection; parsing, by a processor of a computing device, the respective source file to extract one or more tags, comparing, by the processor, the one or more tags to tags in at least one dictionary to determine one or more matching tags, wherein the at least one dictionary comprises a hierarchical listing of tags, and associating, by the processor, with the respective source file, the one or more matching tags; generating, by the processor, a first virtual relational network comprising the source files in the first collection, wherein the first virtual relational network comprises; one or more nodes, wherein each node of the one or more nodes represents a particular matching tag of the one or more matching tags associated with a particular source file of the source files in the first collection, and one or more links, wherein each link of the one or more links represents a connection between a pair of nodes, wherein each node of the pair of nodes is associated with a same tag; comparing, by the processor, the first virtual relational network to a second virtual relational network to identify at least one of (a) nodes common to the first and second virtual relational networks, and (b) links common to the first and second virtual relational networks, wherein the second virtual relational network is created from a second collection of source files different from the first collection of source files, and the second virtual relational network is created using one or more dictionaries of the at least one dictionary; and displaying a graphical representation of at least part of the first and second virtual relational networks. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14)
-
-
15. A non-transitory computer readable medium having instructions stored thereon that, when executed, cause a processor to:
-
obtain a first collection of source files; for each source file in the first collection; parse the respective source file to extract one or more tags, compare the one or more tags to tags in at least one dictionary to determine one or more matching tags, wherein the at least one dictionary comprises a hierarchical listing of tags, and associate, with the respective source file, the one or more matching tags; generate a first virtual relational network comprising the source files in the first collection, wherein the first virtual relational network comprises; one or more nodes, wherein each node of the one or more nodes represents a particular matching tag of the one or more matching tags associated with a particular source file of the source files in the first collection, and one or more links, wherein each link of the one or more links represents a connection between a pair of nodes, wherein each node of the pair of nodes is associated with a same tag; compare the first virtual relational network to a second virtual relational network to identify at least one of (a) nodes common to the first and second virtual relational networks, and (b) links common to the first and second virtual relational networks, wherein the second virtual relational network is created from a second collection of source files different from the first collection of source files, and the second virtual relational network is created using one or more dictionaries of the at least one dictionary; and display a graphical representation of at least part of the first and second virtual relational networks. - View Dependent Claims (16, 17, 18, 19, 20, 21, 22)
-
-
23. A system comprising:
-
a processor; and a memory having instructions stored thereon, wherein the instructions, when executed by the processor, cause the processor to; obtain a first collection of source files; for each source file in the first collection; parse the respective source file to extract one or more tags, compare the one or more tags to tags in at least one dictionary to determine one or more matching tags, wherein the at least one dictionary comprises a hierarchical listing of tags, and associate, with the respective source file, the one or more matching tags; generate a first virtual relational network comprising the source files in the first collection, wherein the first virtual relational network comprises; one or more nodes, wherein each node of the one or more nodes represents a particular matching tag of the one or more matching tags associated with a particular source file of the source files in the first collection, and one or more links, wherein each link of the one or more links represents a connection between a pair of nodes, wherein each node of the pair of nodes is associated with a same tag; and obtain a second collection of source files different than the first collection of source files; for each source file in the second collection; parse the respective source file to extract one or more second tags, compare the one or more second tags to tags in one or more dictionaries to determine one or more matching second tags, wherein the one or more dictionaries comprise at least a first dictionary of the at least one dictionary, and each dictionary of the one or more dictionaries comprises a hierarchical listing of tags, and associate, with the respective source file, the one or more second matching tags; generate a second virtual relational network comprising the source files in the second collection, wherein the second virtual relational network comprises; one or more nodes, wherein each node of the one or more nodes represents a particular second matching tag of the one or more second matching tags associated with a particular source file of the source files in the second collection, and one or more links, wherein each link of the one or more links represents a connection between a pair of nodes, wherein each node of the pair of nodes is associated with a same tag; compare the first virtual relational network to the second virtual relational network to identify at least one of (a) nodes common to the first and second virtual relational networks, and (b) links common to the first and second virtual relational networks; and display a graphical representation of at least part of the first and second virtual relational networks. - View Dependent Claims (24, 25, 26)
-
Specification