Financial time series forecasting using support vector machines Kyoung-jae Kim∗
Department of Information Systems, College of Business Administration, Dongguk University, 3-26, Pil-dong, Chung-gu, Seoul 100715, South Korea Received 28 February 2002; accepted 13 March 2003
Abstract Support vector machines (SVMs) are promising methods for the prediction of ÿnancial timeseries because they use a risk function consisting of the empirical error and a regularized term which is derived from the structural risk minimization principle. This study applies SVM to predicting the stock price index. In addition, this study examines the feasibility of applying SVM in ÿnancial forecasting by comparing it with back-propagation neural networks and case-based reasoning. The experimental results show that SVM provides a promising alternative to stock market prediction. c 2003 Elsevier B.V. All rights reserved. Keywords: Support vector machines; Back-propagation neural networks; Case-based reasoning; Financial time series
1. Introduction Stock market prediction is regarded as a challenging task of ÿnancial time-series prediction. There have been many studies using artiÿcial neural networks (ANNs) in this area. A large number of successful applications have shown that ANN can be a very useful tool for time-series modeling and forecasting . The early days of these studies focused on application of ANNs to stock market prediction (for instance [2,6,11,13,19,23]). Recent research tends to hybridize several artiÿcial intelligence (AI) techniques (for instance [10,22]). Some researchers tend to include novel factors in the learning process. Kohara et al.  incorporated prior knowledge to improve the ∗
Tel: +82-2-2260-3324; fax: +82-2-2260-8824. E-mail address: email@example.com (K.-j. Kim).
0925-2312/03/$ - see front matter c 2003 Elsevier B.V. All rights reserved. doi:10.1016/S0925-2312(03)00372-2
K.-j. Kim / Neurocomputing 55 (2003) 307 – 319
performance of stock market prediction. Tsaih et al.  integrated the rule-based technique and ANN to predict the direction of the S& P 500 stock index futures on a daily basis. Quah and Srinivasan  proposed an ANN stock selection system to select stocks that are top performers from the market and to avoid selecting under performers. They concluded that the portfolio of the proposed model outperformed the portfolios of the benchmark model in terms of compounded actual returns overtime. Kim and Han  proposed a genetic algorithms approach to feature discretization and the determination of connection weights for ANN to predict the stock price index. They suggested that their approach reduced the dimensionality of the feature space and enhanced the prediction performance. Some of these studies, however, showed that ANN had some limitations in learning the patterns because stock market data has tremendous noise and complex dimensionality. ANN often exhibits inconsistent and unpredictable performance on noisy data. However, back-propagation (BP) neural network, the most popular neural network model, su ers from di culty in selecting a large number of controlling parameters which include relevant input variables, hidden layer size, learning rate, momentum term. Recently, a support vector machine (SVM), a novel neural network algorithm, was developed by Vapnik and his colleagues . Many traditional neural network models had implemented the empirical risk minimization principle, SVM implements the structural risk minimization principle. The former seeks to minimize the mis-classiÿcation error or deviation from correct solution of the training data but the latter searches to minimize an upper bound of generalization error. In addition, the solution of SVM may be global optimum while other neural network models may tend to fall into a local optimal solution. Thus, overÿtting is unlikely to occur with...