Due to the disordered characteristic and strong randomness of China's stock market, the typical data mining algorithms currently used to analyze and forecast the stock have imprecise prediction outcomes. In order to solve this problem, based on the industry rotation cycle theory, this paper constructs a new stock selection model combining Decision Tree C5.0 Algorithm and factor analysis. Industry rotation cycle theory aims to analyze the development trend of various industries to find promising industries as initial stock pool. According to this principle, this paper selects four industries and the A-share stocks of these industries are used as initial stock pool. This paper builds a stock index system consisting of six effective factors based on the factor analysis of stocks financial indicators and technical indicators. Then Decision Tree C5.0 Algorithm is presented to realize the prediction of stock returns and the classification of stocks. The empirical test of the proposed stock selection model, using the data from the second and the third quarter of 2017 in China A-share stock market, demonstrates that this model has significant difference in the classification accuracy between low-yielding stocks and high-yielding stocks in that case classification accuracy shows a trend opposite against stock return rate. In a conclusion, this model can effectively help investors to avoid risks and make rational investment but has little effect on obtaining excess return.
Published in | Journal of Investment and Management (Volume 7, Issue 4) |
DOI | 10.11648/j.jim.20180704.12 |
Page(s) | 117-124 |
Creative Commons |
This is an Open Access article, distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution and reproduction in any medium or format, provided the original work is properly cited. |
Copyright |
Copyright © The Author(s), 2018. Published by Science Publishing Group |
Decision Tree C5.0, Factor Analysis, Stock Selection Model Introduction
[1] | Wang Dong, Wu Wen-feng, and Aetna School of Management. "Application of Support Vector Machines Regression in Prediction Shanghai Stock Composite Index." Wuhan University Journal of Natural Sciences8.4(2003):1126-1130. |
[2] | Panigrahi, S. S., and J. K. Mantri. "Epsilon-SVR and decision tree for stock market forecasting." International Conference on Green Computing and Internet of Things IEEE, 2016:761-766. |
[3] | Panigrahi, S. S., and J. K. Mantri. "Epsilon-SVR and decision tree for stock market forecasting." International Conference on Green Computing and Internet of Things IEEE, 2016:761-766. |
[4] | Barak, Sasan, A. Arjmand, and S. Ortobelli. "Fusion of multiple diverse predictors in stock market." Information Fusion 36(2017):90-102. |
[5] | Wei Xiong, “Application of Decision Tree Algorithm in Stock Analysis and Prediction[J].” Computer Knowledge and Technology (academic exchange), 2.9(2007):764-765. |
[6] | Huang Lingqin, The Application of Data Mining in Stock Analysis and Prediction[D]. Dalian University of Technology, 2008,12. |
[7] | Zhang Jingyi, Empirical Research on Stock Investment Based on Data Mining Technology[D]. Chongqing University, 2013,04. |
[8] | Tao Yuyu, Application of Decision Tree and Neural Network in Stock Classification and Forecasting[D]. Hangzhou Dianzi University, 2013,10. |
[9] | Haung Ling, Hu Yang, “Stock Data Mining Based on C4.5 Decision Tree[J].” Computer and Modernization (Periodical)10(2015):21-24. |
[10] | Huang Yue, Analysis of Stock Selection Based on Data Mining Technology[D]. Beijing Foreign Studies University, 2017.06. |
[11] | Shen Jinrong, Stepwise Regression Algorithm Based on Decision Tree and its Application in Stock Prediction[D], Guangdong University of Technology, 2017,06. |
[12] | Zhang Huiheng, “On the theory of industrial life cycle[J].” Finance and Trade Research15.6(2004):7-11. |
[13] | Hu Xiaomei, “The Application of Industry Analysis in Securities Investment Management[J].” Heilongjiang's Foreign Trade and Economic Trade 5(2007):94-96. |
[14] | Wang Huilin, Stock Selection Strategy and Product Design of Medical Theme Private Equity Fund[D], Nanjing University, 2017. |
[15] | Quinlan, R. "Introduction of Decision Trees." Machine Learning1.1(1986):81-106. |
[16] | Quinlan, J. Ross.” C4.5: programs for machine learning.” Morgan Kaufmann Publishers Inc. 1992. |
[17] | Quinlan J R. “Bagging, boosting, and C4.5[C].” Proc of 14th National Conference on Artificial Intelligence, Portland, Oregon, 1996:725-730. |
[18] | Quinlan, J. R. "Simplifying decision trees." International Journal of Man-Machine Studies 27.3(1987):221-234. |
APA Style
Qiansheng Zhang, Jingru Zhang, Zisheng Chen, Miao Zhang, Songying Li. (2018). A New Stock Selection Model Based on Decision Tree C5.0 Algorithm. Journal of Investment and Management, 7(4), 117-124. https://doi.org/10.11648/j.jim.20180704.12
ACS Style
Qiansheng Zhang; Jingru Zhang; Zisheng Chen; Miao Zhang; Songying Li. A New Stock Selection Model Based on Decision Tree C5.0 Algorithm. J. Invest. Manag. 2018, 7(4), 117-124. doi: 10.11648/j.jim.20180704.12
AMA Style
Qiansheng Zhang, Jingru Zhang, Zisheng Chen, Miao Zhang, Songying Li. A New Stock Selection Model Based on Decision Tree C5.0 Algorithm. J Invest Manag. 2018;7(4):117-124. doi: 10.11648/j.jim.20180704.12
@article{10.11648/j.jim.20180704.12, author = {Qiansheng Zhang and Jingru Zhang and Zisheng Chen and Miao Zhang and Songying Li}, title = {A New Stock Selection Model Based on Decision Tree C5.0 Algorithm}, journal = {Journal of Investment and Management}, volume = {7}, number = {4}, pages = {117-124}, doi = {10.11648/j.jim.20180704.12}, url = {https://doi.org/10.11648/j.jim.20180704.12}, eprint = {https://article.sciencepublishinggroup.com/pdf/10.11648.j.jim.20180704.12}, abstract = {Due to the disordered characteristic and strong randomness of China's stock market, the typical data mining algorithms currently used to analyze and forecast the stock have imprecise prediction outcomes. In order to solve this problem, based on the industry rotation cycle theory, this paper constructs a new stock selection model combining Decision Tree C5.0 Algorithm and factor analysis. Industry rotation cycle theory aims to analyze the development trend of various industries to find promising industries as initial stock pool. According to this principle, this paper selects four industries and the A-share stocks of these industries are used as initial stock pool. This paper builds a stock index system consisting of six effective factors based on the factor analysis of stocks financial indicators and technical indicators. Then Decision Tree C5.0 Algorithm is presented to realize the prediction of stock returns and the classification of stocks. The empirical test of the proposed stock selection model, using the data from the second and the third quarter of 2017 in China A-share stock market, demonstrates that this model has significant difference in the classification accuracy between low-yielding stocks and high-yielding stocks in that case classification accuracy shows a trend opposite against stock return rate. In a conclusion, this model can effectively help investors to avoid risks and make rational investment but has little effect on obtaining excess return.}, year = {2018} }
TY - JOUR T1 - A New Stock Selection Model Based on Decision Tree C5.0 Algorithm AU - Qiansheng Zhang AU - Jingru Zhang AU - Zisheng Chen AU - Miao Zhang AU - Songying Li Y1 - 2018/09/21 PY - 2018 N1 - https://doi.org/10.11648/j.jim.20180704.12 DO - 10.11648/j.jim.20180704.12 T2 - Journal of Investment and Management JF - Journal of Investment and Management JO - Journal of Investment and Management SP - 117 EP - 124 PB - Science Publishing Group SN - 2328-7721 UR - https://doi.org/10.11648/j.jim.20180704.12 AB - Due to the disordered characteristic and strong randomness of China's stock market, the typical data mining algorithms currently used to analyze and forecast the stock have imprecise prediction outcomes. In order to solve this problem, based on the industry rotation cycle theory, this paper constructs a new stock selection model combining Decision Tree C5.0 Algorithm and factor analysis. Industry rotation cycle theory aims to analyze the development trend of various industries to find promising industries as initial stock pool. According to this principle, this paper selects four industries and the A-share stocks of these industries are used as initial stock pool. This paper builds a stock index system consisting of six effective factors based on the factor analysis of stocks financial indicators and technical indicators. Then Decision Tree C5.0 Algorithm is presented to realize the prediction of stock returns and the classification of stocks. The empirical test of the proposed stock selection model, using the data from the second and the third quarter of 2017 in China A-share stock market, demonstrates that this model has significant difference in the classification accuracy between low-yielding stocks and high-yielding stocks in that case classification accuracy shows a trend opposite against stock return rate. In a conclusion, this model can effectively help investors to avoid risks and make rational investment but has little effect on obtaining excess return. VL - 7 IS - 4 ER -