Knowledge discovery appliance
First Claim
1. A computer-implemented method of analyzing data in a local network to determine relevance to a topic, the method comprising:
- deploying one or more data retrieval modules to interface with one or more data sources within the local network, wherein the one or more data retrieval modules comprise executable program code;
executing the one or more data retrieval modules to perform operations comprising;
accessing application data representing data stored or communicated through the one or more data sources;
converting the application data into a normalized format; and
forwarding the normalized application data for analysis; and
analyzing the forwarded application data to determine whether it is relevant to the topic, wherein the analyzing is performed by one or more devices within the local network, the analyzing comprising;
calculating one or more relevancy scores for the forwarded application data;
classifying as relevant forwarded application data having scores above a first threshold;
classifying as non-relevant forwarded application data having scores below a second threshold, wherein the second threshold is lower than the first threshold;
classifying as review-pending forwarded application data having scores between the first threshold and the second threshold;
forwarding the review-pending forwarded application data to one or more analysts;
receiving input from the one or more analysts as to whether at least a portion of the forwarded review-pending application data is relevant; and
modifying one or more automated algorithms used for determining relevance based on the input from the one or more analysts.
14 Assignments
0 Petitions
Accused Products
Abstract
Methods and systems for collecting and processing large volumes of data to determine the relevancy and value thereof comprise: deploying one or more data retrieval modules to interface with one or more data sources within the local network, wherein the one or more data retrieval modules comprise executable program code; executing the one or more data retrieval modules to perform operations comprising: accessing application data representing data stored or communicated through the one or more data sources; and forwarding the application data for analysis; and analyzing the forwarded application data to determine whether it is relevant to the topic, wherein the analyzing is performed by one or more devices within the local network.
19 Citations
21 Claims
-
1. A computer-implemented method of analyzing data in a local network to determine relevance to a topic, the method comprising:
-
deploying one or more data retrieval modules to interface with one or more data sources within the local network, wherein the one or more data retrieval modules comprise executable program code; executing the one or more data retrieval modules to perform operations comprising; accessing application data representing data stored or communicated through the one or more data sources; converting the application data into a normalized format; and forwarding the normalized application data for analysis; and analyzing the forwarded application data to determine whether it is relevant to the topic, wherein the analyzing is performed by one or more devices within the local network, the analyzing comprising; calculating one or more relevancy scores for the forwarded application data; classifying as relevant forwarded application data having scores above a first threshold; classifying as non-relevant forwarded application data having scores below a second threshold, wherein the second threshold is lower than the first threshold; classifying as review-pending forwarded application data having scores between the first threshold and the second threshold; forwarding the review-pending forwarded application data to one or more analysts; receiving input from the one or more analysts as to whether at least a portion of the forwarded review-pending application data is relevant; and modifying one or more automated algorithms used for determining relevance based on the input from the one or more analysts. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10)
-
-
11. A system for analyzing data in a network to determine relevance to a topic, the system comprising:
-
an application system comprising one or more devices within the network configured to operate one or more data sources, wherein the one or more devices are additionally configured to execute one or more data retrieval modules, the data retrieval modules configured to; access application data representing data stored or communicated through the one or more data sources; converting the application data into a normalized format; and forwarding the normalized application data for analysis; and an analysis system comprising one or more devices within the network configured to analyze the forwarded application data to determine whether it is relevant to the topic, the analysis system is configured to; calculate one or more relevancy scores for the forwarded application data; classify as relevant forwarded application data having scores above a first threshold; classifying as non-relevant forwarded application data having scores below a second threshold, wherein the second threshold is lower than the first threshold; classify as review-pending forwarded application data having scores between the first threshold and the second threshold; forward the review-pending forwarded application data to one or more analysts; receive input from the one or more analysts as to whether at least a portion of the forwarded review-pending application data is relevant; and modify one or more automated algorithms used for determining relevance based on the input from the one or more analysts. - View Dependent Claims (12, 13, 14, 15, 16, 17, 18, 19, 20)
-
-
21. A computer-implemented method of analyzing data in a network to determine relevance to a topic, the method comprising:
-
deploying one or more data retrieval modules to interface with one or more data sources within the network, wherein the one or more data retrieval modules comprise executable program code; executing the one or more data retrieval modules to perform operations comprising; accessing application data representing data stored or communicated through the one or more data sources; converting the application data into a normalized format; and forwarding the application data for analysis; and analyzing the forwarded application data to determine whether it is relevant to the topic, wherein the analyzing is performed by one or more devices within the network, the analyzing comprising; excluding forwarded application data satisfying one or more preliminary exclusion criteria; analyzing the forwarded application data to determine one or more answers to one or more questions about the forwarded application data; generating a set of metadata about the forwarded application data based on the one or more answers; calculating one or more relevancy scores for the forwarded application data based on the first set of metadata; classifying as relevant forwarded application data having scores above a first threshold; classifying as non-relevant forwarded application data having scores below a second threshold, wherein the second threshold is lower than the first threshold; classifying as review-pending forwarded application data having scores between the first threshold and the second threshold; forwarding the review-pending forwarded application data to one or more analysts; receiving input from the one or more analysts as to whether the review-pending forwarded application data is relevant; and modifying one or more automated algorithms used for determining relevance based on the input from the one or more analysts.
-
Specification