A SYSTEM AND METHOD FOR MINING DATA FROM HIGH-VOLUME TEXT STREAMS AND AN ASSOCIATED SYSTEM AND METHOD FOR ANALYZING MINED DATA
First Claim
1. A method of mining data from a text stream, said method comprising:
- receiving variables that are relevant to at least one predetermined scenario that characterizes a change;
applying a data mining algorithm to said text stream in order to retrieve data, wherein said variables comprise parameters for said data mining algorithm; and
performing a statistical analysis of said data.
1 Assignment
0 Petitions
Accused Products
Abstract
Disclosed are embodiments of a method of mining data and a method of evaluating that data in order to discover significant changes in conditions (e.g., changes in activities, events, associations, affiliations, market preferences, etc.). The data mining technique uses predetermined scenarios that characterize specific changes as well as key variables that are relevant to those scenarios. These variables are input as mining parameters into a data mining tool. Retrieved data is analyzed and the results are evaluated. One technique of evaluating the results includes displaying them in a visual format (e.g., graphs, tables) along with additional information (e.g., lists of documents or portions of documents containing data relevant to the displayed results). A user evaluates the displayed results and additional information in order to identify data that should be filtered, to identify trends and/or patterns in the data, and to assess the trends and/or patterns.
135 Citations
20 Claims
-
1. A method of mining data from a text stream, said method comprising:
-
receiving variables that are relevant to at least one predetermined scenario that characterizes a change; applying a data mining algorithm to said text stream in order to retrieve data, wherein said variables comprise parameters for said data mining algorithm; and performing a statistical analysis of said data. - View Dependent Claims (2, 3, 4, 5)
-
-
6. A method of evaluating data mined from a text stream, said method comprising:
-
performing a statistical analysis of said data; displaying information related to said data, wherein said displaying of said information comprises; displaying results of said statistical analysis in at least one visual format; and displaying one of portions of documents containing said data, a list of documents containing said data, and at least one document containing said data; and evaluating said information in order to filter said data and to identify at least one of a trend and a pattern in said data that is suggestive of a change. - View Dependent Claims (7, 8, 9, 10, 11, 12)
-
-
13. A system for mining and analyzing data from a text stream, said system comprising:
-
a user-interface adapted to allow a user to input variables that are relevant to at least one user-identified scenario that characterizes a change; a data mining tool configured to apply a data mining algorithm to said text stream in order to retrieve data, wherein parameters for said mining algorithm comprise said variables; and
,an analyzer adapted to perform a statistical analysis of said data and produce results. - View Dependent Claims (14, 15, 16, 17, 18, 19, 20)
-
Specification