System and method for forecasting fluctuations in future data and particularly for forecasting security prices by news analysis
First Claim
1. A method of predicting the future performance of one or more predefined securities, the method including:
- receiving raw data representing language including sentences relating to one or more predefined securities whose future performance is to be predicted;
scanning the raw data for references to one or more of the predefined securities and providing the reference as a standard representation thereof;
preprocessing the sentences containing references to at least one of said one or more predefined securities to provide a relationship structure of one or more words in the preprocessed sentences; and
providing a training model for one or more of the relationship structures to predict future performance of one or more of the predefined securities.
0 Assignments
0 Petitions
Accused Products
Abstract
A system and method for predicting price fluctuations in financial markets. Our approach utilizes both market history and public news articles, published before the beginning of trading each day, to produce a set of recommended investment actions. We empirically show that these markets are surprisingly predictable, even by purely market-historical techniques. Furthermore, analyzing relevant news articles captures information features independent of the markets history, and combining the two methods significantly improves results. Capturing usable features from news articles requires some linguistic sophistication the standard naïve bag-f-words approach does not yield predictive features. Instead, we use part-of-speech tagging, dependency parsing and semantic role labeling to generate features that improve system accuracy. We evaluate our system on eight political prediction markets from 2004 and show that we can make effective investment decisions based on our systems predictions, whose profits greatly exceed those generated by a baseline system.
64 Citations
4 Claims
-
1. A method of predicting the future performance of one or more predefined securities, the method including:
-
receiving raw data representing language including sentences relating to one or more predefined securities whose future performance is to be predicted; scanning the raw data for references to one or more of the predefined securities and providing the reference as a standard representation thereof; preprocessing the sentences containing references to at least one of said one or more predefined securities to provide a relationship structure of one or more words in the preprocessed sentences; and providing a training model for one or more of the relationship structures to predict future performance of one or more of the predefined securities. - View Dependent Claims (2, 3, 4)
-
Specification