• Previous Article
    An incremental nonsmooth optimization algorithm for clustering using $ L_1 $ and $ L_\infty $ norms
  • JIMO Home
  • This Issue
  • Next Article
    Bargaining equilibrium in a two-echelon supply chain with a capital-constrained retailer
November  2020, 16(6): 2743-2756. doi: 10.3934/jimo.2019078

Corporate and personal credit scoring via fuzzy non-kernel SVM with fuzzy within-class scatter

1. 

School of Management Science and Engineering, Dongbei University of Finance and Economics, Dalian 116025, China

2. 

School of Business Administration and Collaborative Innovation Center of Financial Security, Southwestern University of Finance and Economics, Chengdu 611130, China

* Corresponding author

Received  August 2018 Revised  March 2019 Published  July 2019

Fund Project: The first author is supported by NNSFC grant # 71701035 and # 71831003

Nowadays, the effective credit scoring becomes a very crucial factor for gaining competitive advantages in credit market for both customers and corporations. In this paper, we propose a credit scoring method which combines the non-kernel fuzzy 2-norm quadratic surface SVM model, T-test feature weighting strategy and fuzzy within-class scatter together. It is worth pointing out that this new method not only saves computational time by avoiding choosing a kernel and corresponding parameters in the classical SVM models, but also addresses the "curse of dimensionality" issue and improves the robustness. Besides, we develop an efficient way to calculate the fuzzy membership of each training point by solving a linear programming problem. Finally, we conduct several numerical tests on two benchmark data sets of personal credit and one real-world data set of corporation credit. The numerical results strongly demonstrate that the proposed method outperforms eight state-of-the-art and commonly-used credit scoring methods in terms of accuracy and robustness.

Citation: Jian Luo, Xueqi Yang, Ye Tian, Wenwen Yu. Corporate and personal credit scoring via fuzzy non-kernel SVM with fuzzy within-class scatter. Journal of Industrial & Management Optimization, 2020, 16 (6) : 2743-2756. doi: 10.3934/jimo.2019078
References:
[1]

W. An and M. Liang, Fuzzy support vector machine based on within-class scatter for classification problems with outliers or noises, Neurocomputing, 110 (2013), 101-110.  doi: 10.1016/j.neucom.2012.11.023.  Google Scholar

[2]

K. Bache and M. Lichman, Uci machine learning repository, http://archive.ics.uci.edu/ml, 2013. Google Scholar

[3]

Y. BaiX. HanT. Chen and H. Yu, Quadratic kernel-free least squares support vector machine for target diseases classification, Journal of Combinatorial Optimization, 30 (2015), 850-870.  doi: 10.1007/s10878-015-9848-z.  Google Scholar

[4]

G. Baudat and F. Anouar, Generalized discriminant analysis using a kernel approach, Neural Computation, 12 (2000), 2385-2404.  doi: 10.1162/089976600300014980.  Google Scholar

[5] S. Boyd and L. Vandenberghe, Convex Optimization, Cambridge University Press, New York, 2004.  doi: 10.1017/CBO9780511804441.  Google Scholar
[6]

I. Dagher, Quadratic kernel-free non-linear support vector machine, Journal of Global Optimization, 41 (2008), 15-30.  doi: 10.1007/s10898-007-9162-0.  Google Scholar

[7]

R. Fisher, The use of multiple measurements in taxonomic problems, Annals of Human Genetics, 7 (1936), 179-188.  doi: 10.1111/j.1469-1809.1936.tb02137.x.  Google Scholar

[8]

T. GestelB. Baesens and J. Garcia, A support vector machine approach to credit scoring, Journal of Bank and Finance, 2 (2003), 73-82.   Google Scholar

[9]

J. Han and M. Kamber, Data Mining: Concepts and Techniques, 2nd edition, Morgan Kaufmann, San Francisco, CA, 2006. Google Scholar

[10]

L. Han and H. Zhao, Orthogonal support vector machine for credit scoring, Engineering Applications of Artificial Intelligence, 26 (2013), 848-862.  doi: 10.1016/j.engappai.2012.10.005.  Google Scholar

[11]

T. Harris, Credit scoring using the clustered support vector machine, Expert Systems with Applications, 42 (2015), 741-750.  doi: 10.1016/j.eswa.2014.08.029.  Google Scholar

[12]

C. HuangM. Chen and C. Wang, Credit scoring with a data mining approach based on support vector machines, Expert Systems with Applications, 33 (2007), 847-856.  doi: 10.1016/j.eswa.2006.07.007.  Google Scholar

[13]

X. JiangY. Zhang and J. Lv, Fuzzy svm with a new fuzzy membership function, Neural Computing and Applications, 15 (2006), 268-276.  doi: 10.1007/s00521-006-0028-z.  Google Scholar

[14]

C. Lin and S. Wang, Fuzzy support vector machines, IEEE Transactions on Neural Networks, 13 (2002), 464-471.   Google Scholar

[15]

F. Liu and X. Xue, Subgradient-based neural network for nonconvex optimization problems in support vector machines with indefinite kernels, Journal of Industrial and Management Optimization, 12 (2016), 285-301.  doi: 10.3934/jimo.2016.12.285.  Google Scholar

[16]

J. LuoS.-C. FangY. Bai and Z. Deng, Fuzzy quadratic surface support vector machine based on fisher discriminant analysis, Journal of Industrial and Management Optimization, 12 (2016), 357-373.  doi: 10.3934/jimo.2016.12.357.  Google Scholar

[17]

J. Luo, S.-C. Fang, Z. Deng and X. Guo, Quadratic surface support vector machine for binary classification, Asia-Pacific Journal Of Operational Research, 33 (2016), 1650046. doi: 10.1142/S0217595916500469.  Google Scholar

[18]

A. MarquesV. Garcia and J. Sanchez, On the suitability of resampling techniques for the class imbalance problem in credit scoring, Journal of the Operational Research Society, 64 (2013), 1060-1070.  doi: 10.1057/jors.2012.120.  Google Scholar

[19]

D. Martin, Early warning of bank failure: a logistic regression approach, Journal of Banking and Finance, 1 (1977), 249-276.   Google Scholar

[20] B. Schölkopf and A. Smola, Learning with Kernels: Support Vector Machines, Regularization, Optimization, and Beyond, MIT Press, Cambridge, MA, 2002.  doi: 10.1016/B978-044451378-6/50001-6.  Google Scholar
[21]

Y. TianM. SunZ. DengJ. Luo and Y. Li, A new fuzzy set and non-kernel svm approach for mislabeled binary classification with applications, IEEE Transactions on Fuzzy Systems, 25 (2017), 1536-1545.   Google Scholar

[22]

W. TungaC. Queka and P. Cheng, Genso-ews: A novel neural-fuzzy based early warning system for predicting bank failures, Neural Networks, 17 (2004), 567-587.  doi: 10.1016/j.neunet.2003.11.006.  Google Scholar

[23]

J. Wiginton, A note on the comparison of logic and discriminant models of customer credit behavior, Journal of Financial and Quantitative Analysis, 15 (1980), 757-770.   Google Scholar

[24]

X. YanY. BaiS.-C. Fang and J. Luo, A kernel-free quadratic surface support vector machine for semi-supervised learning, Journal of the Operational Research Society, 67 (2016), 1001-1011.  doi: 10.1007/s10957-015-0843-4.  Google Scholar

[25]

X. ZhangX. Xiao and G. Xu, Fuzzy support vector machine based on affinity among samples, Journal of Software, 17 (2006), 951-958.  doi: 10.1360/jos170951.  Google Scholar

[26]

H. ZhongC. MiaoZ. Shen and Y. Feng, Comparing the learning effectiveness of BP, ELM, I-ELM, and SVM for corporate credit ratings, Neurocomputing, 128 (2014), 285-295.  doi: 10.1016/j.neucom.2013.02.054.  Google Scholar

[27]

L. ZhouK. Lai and J. Yen, Credit scoring models with auc maximization based on weighted svm, International Journal of Information Technology and Decision Making, 4 (2009), 677-696.  doi: 10.1142/S0219622009003582.  Google Scholar

show all references

References:
[1]

W. An and M. Liang, Fuzzy support vector machine based on within-class scatter for classification problems with outliers or noises, Neurocomputing, 110 (2013), 101-110.  doi: 10.1016/j.neucom.2012.11.023.  Google Scholar

[2]

K. Bache and M. Lichman, Uci machine learning repository, http://archive.ics.uci.edu/ml, 2013. Google Scholar

[3]

Y. BaiX. HanT. Chen and H. Yu, Quadratic kernel-free least squares support vector machine for target diseases classification, Journal of Combinatorial Optimization, 30 (2015), 850-870.  doi: 10.1007/s10878-015-9848-z.  Google Scholar

[4]

G. Baudat and F. Anouar, Generalized discriminant analysis using a kernel approach, Neural Computation, 12 (2000), 2385-2404.  doi: 10.1162/089976600300014980.  Google Scholar

[5] S. Boyd and L. Vandenberghe, Convex Optimization, Cambridge University Press, New York, 2004.  doi: 10.1017/CBO9780511804441.  Google Scholar
[6]

I. Dagher, Quadratic kernel-free non-linear support vector machine, Journal of Global Optimization, 41 (2008), 15-30.  doi: 10.1007/s10898-007-9162-0.  Google Scholar

[7]

R. Fisher, The use of multiple measurements in taxonomic problems, Annals of Human Genetics, 7 (1936), 179-188.  doi: 10.1111/j.1469-1809.1936.tb02137.x.  Google Scholar

[8]

T. GestelB. Baesens and J. Garcia, A support vector machine approach to credit scoring, Journal of Bank and Finance, 2 (2003), 73-82.   Google Scholar

[9]

J. Han and M. Kamber, Data Mining: Concepts and Techniques, 2nd edition, Morgan Kaufmann, San Francisco, CA, 2006. Google Scholar

[10]

L. Han and H. Zhao, Orthogonal support vector machine for credit scoring, Engineering Applications of Artificial Intelligence, 26 (2013), 848-862.  doi: 10.1016/j.engappai.2012.10.005.  Google Scholar

[11]

T. Harris, Credit scoring using the clustered support vector machine, Expert Systems with Applications, 42 (2015), 741-750.  doi: 10.1016/j.eswa.2014.08.029.  Google Scholar

[12]

C. HuangM. Chen and C. Wang, Credit scoring with a data mining approach based on support vector machines, Expert Systems with Applications, 33 (2007), 847-856.  doi: 10.1016/j.eswa.2006.07.007.  Google Scholar

[13]

X. JiangY. Zhang and J. Lv, Fuzzy svm with a new fuzzy membership function, Neural Computing and Applications, 15 (2006), 268-276.  doi: 10.1007/s00521-006-0028-z.  Google Scholar

[14]

C. Lin and S. Wang, Fuzzy support vector machines, IEEE Transactions on Neural Networks, 13 (2002), 464-471.   Google Scholar

[15]

F. Liu and X. Xue, Subgradient-based neural network for nonconvex optimization problems in support vector machines with indefinite kernels, Journal of Industrial and Management Optimization, 12 (2016), 285-301.  doi: 10.3934/jimo.2016.12.285.  Google Scholar

[16]

J. LuoS.-C. FangY. Bai and Z. Deng, Fuzzy quadratic surface support vector machine based on fisher discriminant analysis, Journal of Industrial and Management Optimization, 12 (2016), 357-373.  doi: 10.3934/jimo.2016.12.357.  Google Scholar

[17]

J. Luo, S.-C. Fang, Z. Deng and X. Guo, Quadratic surface support vector machine for binary classification, Asia-Pacific Journal Of Operational Research, 33 (2016), 1650046. doi: 10.1142/S0217595916500469.  Google Scholar

[18]

A. MarquesV. Garcia and J. Sanchez, On the suitability of resampling techniques for the class imbalance problem in credit scoring, Journal of the Operational Research Society, 64 (2013), 1060-1070.  doi: 10.1057/jors.2012.120.  Google Scholar

[19]

D. Martin, Early warning of bank failure: a logistic regression approach, Journal of Banking and Finance, 1 (1977), 249-276.   Google Scholar

[20] B. Schölkopf and A. Smola, Learning with Kernels: Support Vector Machines, Regularization, Optimization, and Beyond, MIT Press, Cambridge, MA, 2002.  doi: 10.1016/B978-044451378-6/50001-6.  Google Scholar
[21]

Y. TianM. SunZ. DengJ. Luo and Y. Li, A new fuzzy set and non-kernel svm approach for mislabeled binary classification with applications, IEEE Transactions on Fuzzy Systems, 25 (2017), 1536-1545.   Google Scholar

[22]

W. TungaC. Queka and P. Cheng, Genso-ews: A novel neural-fuzzy based early warning system for predicting bank failures, Neural Networks, 17 (2004), 567-587.  doi: 10.1016/j.neunet.2003.11.006.  Google Scholar

[23]

J. Wiginton, A note on the comparison of logic and discriminant models of customer credit behavior, Journal of Financial and Quantitative Analysis, 15 (1980), 757-770.   Google Scholar

[24]

X. YanY. BaiS.-C. Fang and J. Luo, A kernel-free quadratic surface support vector machine for semi-supervised learning, Journal of the Operational Research Society, 67 (2016), 1001-1011.  doi: 10.1007/s10957-015-0843-4.  Google Scholar

[25]

X. ZhangX. Xiao and G. Xu, Fuzzy support vector machine based on affinity among samples, Journal of Software, 17 (2006), 951-958.  doi: 10.1360/jos170951.  Google Scholar

[26]

H. ZhongC. MiaoZ. Shen and Y. Feng, Comparing the learning effectiveness of BP, ELM, I-ELM, and SVM for corporate credit ratings, Neurocomputing, 128 (2014), 285-295.  doi: 10.1016/j.neucom.2013.02.054.  Google Scholar

[27]

L. ZhouK. Lai and J. Yen, Credit scoring models with auc maximization based on weighted svm, International Journal of Information Technology and Decision Making, 4 (2009), 677-696.  doi: 10.1142/S0219622009003582.  Google Scholar

Table 1.  Credit Data Sets
data set # of features Class $ C_1 $ Class $ C_2 $
name # of points name # of points
German 20 Creditworthy 700 Non-creditworthy 300
Australian 14 Non-default 383 Default 307
Chinese 7 Good credit 58 Bad credit 48
data set # of features Class $ C_1 $ Class $ C_2 $
name # of points name # of points
German 20 Creditworthy 700 Non-creditworthy 300
Australian 14 Non-default 383 Default 307
Chinese 7 Good credit 58 Bad credit 48
Table 2.  German Credit Data Test
model misclassification rate (%) CPU time (s)
mean std
LOG_REG 23.04 0.35 0.14
FFBP_NN 24.30 0.57 3.83
SVM_GausKer 24.31 0.71 3.30
W2NSVM_GausKer 23.85 0.56 5.72
W2NSVM_QuadKer 23.92 0.81 5.36
FSVMWCS_GausKer 23.42 1.84 6.87
Clu_SVM 24.49 0.71 0.25
Dagher's QSVM 24.26 0.62 4.63
SQSSVM 23.86 0.59 2.82
FNKSVM-FWS 21.36 0.51 4.23
model misclassification rate (%) CPU time (s)
mean std
LOG_REG 23.04 0.35 0.14
FFBP_NN 24.30 0.57 3.83
SVM_GausKer 24.31 0.71 3.30
W2NSVM_GausKer 23.85 0.56 5.72
W2NSVM_QuadKer 23.92 0.81 5.36
FSVMWCS_GausKer 23.42 1.84 6.87
Clu_SVM 24.49 0.71 0.25
Dagher's QSVM 24.26 0.62 4.63
SQSSVM 23.86 0.59 2.82
FNKSVM-FWS 21.36 0.51 4.23
Table 3.  Australian Credit Data Test
model misclassification rate (%) CPU time (s)
mean std
LOG_REG 13.56 0.27 0.12
FFBP_NN 14.42 1.16 2.72
SVM_GausKer 15.00 1.06 1.30
W2NSVM_GausKer 14.87 0.53 2.73
W2NSVM_QuadKer 14.59 0.46 3.01
FSVMWCS_GausKer 14.63 3.68 3.75
Clu_SVM 14.34 0.53 0.16
Dagher's QSVM 26.42 1.23 1.63
SQSSVM 14.57 0.57 0.80
FNKSVM-FWS 11.96 0.43 1.56
model misclassification rate (%) CPU time (s)
mean std
LOG_REG 13.56 0.27 0.12
FFBP_NN 14.42 1.16 2.72
SVM_GausKer 15.00 1.06 1.30
W2NSVM_GausKer 14.87 0.53 2.73
W2NSVM_QuadKer 14.59 0.46 3.01
FSVMWCS_GausKer 14.63 3.68 3.75
Clu_SVM 14.34 0.53 0.16
Dagher's QSVM 26.42 1.23 1.63
SQSSVM 14.57 0.57 0.80
FNKSVM-FWS 11.96 0.43 1.56
Table 4.  Chinese Credit Data Test
model misclassification rate (%) CPU time (s)
mean std
LOG_REG 7.56 0.57 0.235
FFBP_NN 24.01 2.25 4.412
SVM_GausKer 13.75 0.90 0.034
W2NSVM_GausKer 12.13 1.89 0.053
W2NSVM_QuadKer 12.07 2.01 0.062
FSVMWCS_GausKer 21.18 2.88 0.063
Clu_SVM 10.96 0.55 0.048
Dagher's QSVM 11.24 2.33 0.087
SQSSVM 10.87 1.96 0.056
FNKSVM-FWS 8.50 0.51 0.083
model misclassification rate (%) CPU time (s)
mean std
LOG_REG 7.56 0.57 0.235
FFBP_NN 24.01 2.25 4.412
SVM_GausKer 13.75 0.90 0.034
W2NSVM_GausKer 12.13 1.89 0.053
W2NSVM_QuadKer 12.07 2.01 0.062
FSVMWCS_GausKer 21.18 2.88 0.063
Clu_SVM 10.96 0.55 0.048
Dagher's QSVM 11.24 2.33 0.087
SQSSVM 10.87 1.96 0.056
FNKSVM-FWS 8.50 0.51 0.083
Table 5.  Robustness of Models on Australian Credit Data
model mean of misclassification rates (%)
without outliers with outliers
LOG_REG 13.56 17.87
FFBP_NN 14.42 15.94
SVM_GausKer 15.00 15.80
W2NSVM_GausKer 14.87 15.65
W2NSVM_QuadKer 14.59 15.36
FSVMWCS_GausKer 14.63 18.43
Clu_SVM 14.34 17.84
Dagher's QSVM 26.42 53.21
SQSSVM 14.57 15.58
FNKSVM-FWS 11.96 12.61
model mean of misclassification rates (%)
without outliers with outliers
LOG_REG 13.56 17.87
FFBP_NN 14.42 15.94
SVM_GausKer 15.00 15.80
W2NSVM_GausKer 14.87 15.65
W2NSVM_QuadKer 14.59 15.36
FSVMWCS_GausKer 14.63 18.43
Clu_SVM 14.34 17.84
Dagher's QSVM 26.42 53.21
SQSSVM 14.57 15.58
FNKSVM-FWS 11.96 12.61
[1]

Reza Chaharpashlou, Abdon Atangana, Reza Saadati. On the fuzzy stability results for fractional stochastic Volterra integral equation. Discrete & Continuous Dynamical Systems - S, 2020  doi: 10.3934/dcdss.2020432

[2]

Bahaaeldin Abdalla, Thabet Abdeljawad. Oscillation criteria for kernel function dependent fractional dynamic equations. Discrete & Continuous Dynamical Systems - S, 2020  doi: 10.3934/dcdss.2020443

[3]

Mohammed Abdulrazaq Kahya, Suhaib Abduljabbar Altamir, Zakariya Yahya Algamal. Improving whale optimization algorithm for feature selection with a time-varying transfer function. Numerical Algebra, Control & Optimization, 2021, 11 (1) : 87-98. doi: 10.3934/naco.2020017

[4]

Xin Guo, Lexin Li, Qiang Wu. Modeling interactive components by coordinate kernel polynomial models. Mathematical Foundations of Computing, 2020, 3 (4) : 263-277. doi: 10.3934/mfc.2020010

[5]

Yifan Chen, Thomas Y. Hou. Function approximation via the subsampled Poincaré inequality. Discrete & Continuous Dynamical Systems - A, 2021, 41 (1) : 169-199. doi: 10.3934/dcds.2020296

[6]

Xiyou Cheng, Zhitao Zhang. Structure of positive solutions to a class of Schrödinger systems. Discrete & Continuous Dynamical Systems - S, 2020  doi: 10.3934/dcdss.2020461

[7]

Shiqiu Fu, Kanishka Perera. On a class of semipositone problems with singular Trudinger-Moser nonlinearities. Discrete & Continuous Dynamical Systems - S, 2020  doi: 10.3934/dcdss.2020452

[8]

Meilan Cai, Maoan Han. Limit cycle bifurcations in a class of piecewise smooth cubic systems with multiple parameters. Communications on Pure & Applied Analysis, 2021, 20 (1) : 55-75. doi: 10.3934/cpaa.2020257

[9]

Lingfeng Li, Shousheng Luo, Xue-Cheng Tai, Jiang Yang. A new variational approach based on level-set function for convex hull problem with outliers. Inverse Problems & Imaging, , () : -. doi: 10.3934/ipi.2020070

[10]

Yolanda Guerrero–Sánchez, Muhammad Umar, Zulqurnain Sabir, Juan L. G. Guirao, Muhammad Asif Zahoor Raja. Solving a class of biological HIV infection model of latently infected cells using heuristic approach. Discrete & Continuous Dynamical Systems - S, 2020  doi: 10.3934/dcdss.2020431

[11]

Zedong Yang, Guotao Wang, Ravi P. Agarwal, Haiyong Xu. Existence and nonexistence of entire positive radial solutions for a class of Schrödinger elliptic systems involving a nonlinear operator. Discrete & Continuous Dynamical Systems - S, 2020  doi: 10.3934/dcdss.2020436

[12]

Mengni Li. Global regularity for a class of Monge-Ampère type equations with nonzero boundary conditions. Communications on Pure & Applied Analysis, 2021, 20 (1) : 301-317. doi: 10.3934/cpaa.2020267

[13]

Vieri Benci, Marco Cococcioni. The algorithmic numbers in non-archimedean numerical computing environments. Discrete & Continuous Dynamical Systems - S, 2020  doi: 10.3934/dcdss.2020449

[14]

Héctor Barge. Čech cohomology, homoclinic trajectories and robustness of non-saddle sets. Discrete & Continuous Dynamical Systems - A, 2020  doi: 10.3934/dcds.2020381

[15]

Ying Lin, Qi Ye. Support vector machine classifiers by non-Euclidean margins. Mathematical Foundations of Computing, 2020, 3 (4) : 279-300. doi: 10.3934/mfc.2020018

[16]

Sergey Rashkovskiy. Hamilton-Jacobi theory for Hamiltonian and non-Hamiltonian systems. Journal of Geometric Mechanics, 2020, 12 (4) : 563-583. doi: 10.3934/jgm.2020024

[17]

Noufel Frikha, Valentin Konakov, Stéphane Menozzi. Well-posedness of some non-linear stable driven SDEs. Discrete & Continuous Dynamical Systems - A, 2021, 41 (2) : 849-898. doi: 10.3934/dcds.2020302

[18]

Yangrong Li, Shuang Yang, Qiangheng Zhang. Odd random attractors for stochastic non-autonomous Kuramoto-Sivashinsky equations without dissipation. Electronic Research Archive, 2020, 28 (4) : 1529-1544. doi: 10.3934/era.2020080

[19]

Pengyu Chen. Non-autonomous stochastic evolution equations with nonlinear noise and nonlocal conditions governed by noncompact evolution families. Discrete & Continuous Dynamical Systems - A, 2020  doi: 10.3934/dcds.2020383

[20]

Lin Shi, Xuemin Wang, Dingshi Li. Limiting behavior of non-autonomous stochastic reaction-diffusion equations with colored noise on unbounded thin domains. Communications on Pure & Applied Analysis, 2020, 19 (12) : 5367-5386. doi: 10.3934/cpaa.2020242

2019 Impact Factor: 1.366

Metrics

  • PDF downloads (56)
  • HTML views (579)
  • Cited by (0)

Other articles
by authors

[Back to Top]