[1]任雪妮,罗幼喜. 基于bagging算法的经济金融数据分析[J].湖北工业大学学报,2021,(2):110-114.
 REN Xueni,LUO Youxi. Research and Analysis of Economic and Financial Data Based on Bagging Algorithm[J].,2021,(2):110-114.
点击复制

 基于bagging算法的经济金融数据分析()
分享到:

《湖北工业大学学报》[ISSN:1003-4684/CN:42-1752/Z]

卷:
期数:
2021年第2期
页码:
110-114
栏目:
湖北工业大学学报
出版日期:
2021-04-22

文章信息/Info

Title:
 Research and Analysis of Economic and Financial Data Based on Bagging Algorithm
文章编号:
1003-4684(2021)02-0110-05
作者:
 任雪妮 罗幼喜
 湖北工业大学理学院, 湖北 武汉 430068
Author(s):
 REN Xueni LUO Youxi
 School of Sciences, Hubei Univ. of Tech., Wuhan 430068, China
关键词:
 bagging算法C5.0决策树算法KNN算法朴素贝叶斯算法经济金融数据
Keywords:
 bagging algorithm C5.0 decision tree algorithm KNN algorithm Naive bayes algorithmEconomic and financial data
分类号:
F22
文献标志码:
A
摘要:
 为了研究弱分类器的个数及种类的构成对强分类器预测准确性的影响,以及在不同经济金融数据的预测情况,选择credit,bank,stock,audit数据集模拟bagging算法分类器的个数和分类器构成方法不同时的预测情况。其中由KNN算法、C5.0算法和朴素贝叶斯算法构造了7种不同的组合方法。总的来看,基于bagging的C5.0决策树方法并设置分类器个数为50时预测效果最佳。对bank数据集设置最佳的方法和分类器个数进行实例分析,准确性达到94.17%,预测情况良好。
Abstract:
 In order to study the influence of the number and type composition of weak classifiers on the accuracy of strong classifier prediction and the prediction of different economic and financial data, the data sets of Credit, Bank, Stock and Audit are selected to simulate the situation where the number of classifiers and the composition of classifiers of Bagging algorithm. Among them, 7 different combination methods are constructed by KNN algorithm, C5.0 algorithm and Naive Bayes algorithm. In general, the C5.0 decision tree method based on Bagging has the best prediction effect when the number of classifiers is set to 50. Finally, an example analysis of the Bank data set shows that the accuracy reaches 94.17% and the prediction is good.

参考文献/References:

[1] Breiman L. Bagging predictors [J].Machine Learning, 1996, 24 :123-140.
[2] Opitz D, Richard Maclin. Popular ensemble methods: an empirical study[J]. Journal of Artificial Intelligence Research, 1999,11:169-198.
[3] Wezel M V, Rob Potharst. Improved customer choice predictions using ensemble methods[J]. European Journal of Operational Research,2007,181:436-452.
[4] Chrzanowska M, Esteban Alfaro, Dorota Witkowska. The individual borrowers recognition: Single and ensemble trees[J]. Expert Systems with Applications,2009,36 : 6409-6414.
[5] Inoue A, Lutz Kilian. How useful is bagging in forecasting economic time series? a case study of U.S. CPI Inflation[J].Journal of the American Statistical Association, 2008, 103:511-522.
[6] Choprab A, Bhilare P. Application of ensemble models in credit scoring models [J]. Business Perspectives and Research, 2018, 6(2): 129 -141.
[7] Cover T, Hart P. Nearest neighbor pattern classification[J]. Institute of Electrical and Electronics Engineers, 1967,13(1):21-27.
[8] 韩佳伟.数据挖掘概念与技术[M].第3版.北京:机械工业出版社,2012: 213-230.

相似文献/References:

[1]熊韧,曹海印,王焱清,等.非牛顿润滑静压轴承的节流器流量方程修正[J].湖北工业大学学报,2019,34(5):6.
 XIONG Ren,CAO Haiyin,WANG Yanqing,et al.Modified restrictor flow equations of hydrostatic bearings ubricated by non-Newtonian fluids[J].,2019,34(2):6.
[2]王照远,曹 民,王 毅,等. 场景与数据双驱动的隧道图像拼接方法[J].湖北工业大学学报,2020,(4):11.
 WANG Zhaoyuan,CAO Min,WANG Yi,et al. Tunnel Image Stitching Method based on Scene and Data[J].,2020,(2):11.
[3]潘 健,梁佳成,陈凤娇,等. 单电流闭环多重PR控制的LCL型逆变器[J].湖北工业大学学报,2020,(4):16.
 PAN Jian,LIANG Jiacheng,CHEN Fengjiao,et al. Design of LCL Grid Connected Inverter based on Single Closed Loop Control and Multiple PR Controllers[J].,2020,(2):16.
[4]王晓光,赵 萌,文益雪,等. 定子闭口槽结构对永磁电机齿槽转矩影响分析[J].湖北工业大学学报,2020,(4):25.
 WANG Xiaoguang,ZHAO Meng,WEN Yixue,et al. Study on Cogging Torque and Vibration Noise of Permanent Magnet Motor with Segmental Stator and Closed-Slot[J].,2020,(2):25.
[5]宇 卫,凃玲英,陈 健. 风电场集中接入对集电线电流保护的影响[J].湖北工业大学学报,2020,(4):29.
 YU Wei,TU Lingying,CHEN Jian. Effect of the Collective Line Current Protection when Wind Farms are Centralized Accessed to the Power System[J].,2020,(2):29.
[6]廖政斌,王泽飞,祝 珊. 二惯量系统谐振在线抑制及相位补偿[J].湖北工业大学学报,2020,(4):34.
 LIAO Zhengbin,WANG Zefei,ZHU Shan. Online Resonance Suppression and Phase Compensation for Double Inertia System[J].,2020,(2):34.
[7]王 欣,游 颖,姜天翔,等. 面向3D打印过程的产品工艺设计和优化[J].湖北工业大学学报,2020,(4):39.
 WANG Xin,YOU Ying,JIANG Tianxiang,et al. Product Process Design and Optimization for 3D Printing Processes[J].,2020,(2):39.
[8]冉晶晶,文 红,罗雅梅,等. 全自动样品前处理平台及其控制系统[J].湖北工业大学学报,2020,(4):43.
 RAN Jingjing,WEN Hong,LUO Yamei,et al. Research on Automatic Sample Preprocessing Platform and its Control System[J].,2020,(2):43.
[9]杨 磊,马志艳,石 敏,等. 基于模糊PID的小型冷库过热度控制方法[J].湖北工业大学学报,2020,(4):43.
 YANG Lei,MA Zhiyan,SHI Min,et al. Research on Superheat Control Method of Small Cold Storage based on Fuzzy PID[J].,2020,(2):43.
[10]黄 晶,周细枝,周业望. 动态注塑成型模具的设计与实验研究[J].湖北工业大学学报,2020,(4):52.
 HUANG Jing,ZHOU Xizhi,ZHOU Yewang. Design and Experimental Study of Dynamic Injection Molding[J].,2020,(2):52.

备注/Memo

备注/Memo:
 [收稿日期] 2020-10-09
[基金项目] 国家社会科学基金项目(17BJY210)
[第一作者] 任雪妮(1996-), 女,湖北仙桃人,湖北工业大学硕士研究生,研究方向为数据科学与决策
[通信作者] 罗幼喜(1979-), 男,湖北黄冈人,理学博士,湖北工业大学教授,研究方向为计量经济建模,数据挖掘
更新日期/Last Update: 2021-04-23