Knowledge discovery agent system
First Claim
Patent Images
1. A system, comprising:
- at least one processor in a distributed computer network;
at least one data storage device in the distributed computer network, in communication with the at least one processor, and configured to store computer-executable instructions and program data; and
at least one data agent in the distributed computer network and in communication with the at least one processor and configured to perform specific functions in response to instructions from the at least one processor;
wherein the at least one data agent is configured to perform functions that include creating a directed or undirected graph representation of features from unstructured data of at least one unstructured data source, wherein the unstructured data source comprises text data from a text corpus;
wherein the graph representation of the features is configured for use with at least one machine learning function, the at least one machine learning function including learning semantic associations between words using calculations of similarity of the words based on usage of the words in context over time, wherein the contexts correspond to semantic units and constituents of the semantic units correspond to elements;
wherein the similarity of words determined from extracted semantic units that re-occur across the text corpus are used for forming a compressed knowledge representation of a discovered pattern of relationships between semantic units and respective elements; and
an organic software agent, of the at least one data agent, having a belief state formed from the compressed knowledge representation and that modifies its own source code for at least one of decisions and execution of plans by the organic software agent, wherein modifying the source code comprises at least one of autonomous dynamic creation and manipulation of executable code by the organic software agent that, when executed, causes the at least one data agent to perform personalized agent services for a user in response to a user interaction with the system.
6 Assignments
0 Petitions
Accused Products
Abstract
A system and method for processing information in unstructured or structured form, comprising a computer running in a distributed network with one or more data agents. Associations of natural language artifacts may be learned from natural language artifacts in unstructured data sources, and semantic and syntactic relationships may be learned in structured data sources, using grouping based on a criteria of shared features that are dynamically determined without the use of a priori classifications, by employing conditional probability constraints.
30 Citations
11 Claims
-
1. A system, comprising:
-
at least one processor in a distributed computer network; at least one data storage device in the distributed computer network, in communication with the at least one processor, and configured to store computer-executable instructions and program data; and at least one data agent in the distributed computer network and in communication with the at least one processor and configured to perform specific functions in response to instructions from the at least one processor; wherein the at least one data agent is configured to perform functions that include creating a directed or undirected graph representation of features from unstructured data of at least one unstructured data source, wherein the unstructured data source comprises text data from a text corpus; wherein the graph representation of the features is configured for use with at least one machine learning function, the at least one machine learning function including learning semantic associations between words using calculations of similarity of the words based on usage of the words in context over time, wherein the contexts correspond to semantic units and constituents of the semantic units correspond to elements; wherein the similarity of words determined from extracted semantic units that re-occur across the text corpus are used for forming a compressed knowledge representation of a discovered pattern of relationships between semantic units and respective elements; and an organic software agent, of the at least one data agent, having a belief state formed from the compressed knowledge representation and that modifies its own source code for at least one of decisions and execution of plans by the organic software agent, wherein modifying the source code comprises at least one of autonomous dynamic creation and manipulation of executable code by the organic software agent that, when executed, causes the at least one data agent to perform personalized agent services for a user in response to a user interaction with the system. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
-
8. A computer-implemented method, comprising:
-
performing, by at least one data agent in a distributed computer network, functions that include creating a directed or undirected graph representation of features from unstructured data of at least one unstructured data source, wherein the unstructured data source comprises text data from a text corpus; wherein the graph representation of the features is configured for use with at least one machine learning function, the at least one machine learning function including learning semantic associations between words using calculations of similarity of the words based on usage of the words in context over time, wherein the contexts correspond to semantic units and constituents of the semantic units correspond to elements; wherein the similarity of words determined from extracted semantic units that re-occur across the text corpus are used for forming a compressed knowledge representation of a discovered pattern of relationships between semantic units and respective elements; and wherein the compressed knowledge representation forms a belief state of an organic software agent of the at least one data agent; modifying, by the organic software agent, its own source code for at least one of decisions and execution of plans by the organic software agent, wherein modifying the source code comprises at least one of autonomous dynamic creation and manipulation of executable code by the organic software agent that, when executed, causes the at least one data agent to perform personalized agent services for a user in response to a user interaction with the system. - View Dependent Claims (9)
-
-
10. A non-transitory computer-readable medium storing instructions which, when executed by at least one processor in a distributed computer network, cause at least one computer to perform functions that include:
-
performing, by at least one data agent in the distributed computer network, functions that include creating a directed or undirected graph representation of features from unstructured data of at least one unstructured data source, wherein the unstructured data source comprises text data from a text corpus; wherein the graph representation of the features is configured for use with at least one machine learning function, the at least one machine learning function including learning semantic associations between words using calculations of similarity of the words based on usage of the words in context over time, wherein the contexts correspond to semantic units and constituents of the semantic units correspond to elements; wherein the similarity of words determined from extracted semantic units that re-occur across the text corpus are used for forming a compressed knowledge representation of a discovered pattern of relationships between semantic units and respective elements; and wherein the compressed knowledge representation forms a belief state of an organic software agent, of the at least one data agent, that modifies its own source code for at least one of decisions and execution of plans by the organic software agent, wherein modifying the source code comprises at least one of dynamic creation and manipulation of executable code by the organic software agent that, when executed, causes the at least one data agent to perform personalized agent services for a user in response to a user interaction with the system. - View Dependent Claims (11)
-
Specification