Credit Risk Assessment Method of P2P Online Loan Borrowers Based on Deep Forest

Computer Science ›› 2021, Vol. 48 ›› Issue (11A): 429-434.doi: 10.11896/jsjkx.201000013

• Image Processing & Multimedia Technology • Previous Articles     Next Articles

Credit Risk Assessment Method of P2P Online Loan Borrowers Based on Deep Forest

WANG Xiao-xiao1, WANG Ting-wen1, MA Yu-ling2, FAN Jia-yi3, CUI Chao-ran1   

  1. 1 School of Computer Science and Technology,Shandong University of Finance and Economics,Jinan 250014,China
    2 School of Computer Science and Technology,Shandong Jianzhu University,Jinan 250101,China
    3 School of Business,Qingdao University,Qingdao,Shandong 266000,China
  • Online:2021-11-10 Published:2021-11-12
  • About author:WANG Xiao-xiao,born in 1996,postgraduate.Her main research interest include data mining and so on.
    CUI Chao-ran,born in 1987,professor,is a member of China Computer Federation.His main research interests include information retrieval,multimedia,recom-mender systems and machine learning.
  • Supported by:
    National Natural Science Foundation of China(61701281,62077033).

Abstract: P2P online lending is an emerging financial business model in recent years,which has many advantages of low investment threshold,convenient transaction and low financing cost.However,at the same time of rapid growth,the credit risk problem in the lending process has become increasingly prominent,and the endless stream of borrowers running away and even fraud have left a heavy shadow on the industry.Aiming at this problem,a credit risk assessment method of P2P online loan borrowers based on deep forest is proposed.Firstly,the features are extracted from the basic information and the historical loan information of the borrower.Then,the deep forest model was constructed by multi-granularity scanning and cascade forest module to predict the default of borrowers.At the same time,Gini index was used to calculate the feature importance score of random forest,and Borda count method was used to sort and fusion,so as to give a certain explanation to the prediction results of the model.On the two public datasets of LendingClub and Paipaidai,the proposed method was compared with methods such as support vector machines,random forests,and wide and deep networks.Experiments show that the method has better performance,and the feature importance rating is consistent with people's intuitive understanding and objective understanding.

Key words: Credit risk assessment, Deep forest, Feature impertance, Per-to-per lending, Unbalanced dataset

CLC Number: 

  • TP391
[1]OHLSON J A.Financial Ratios and the Probabilistic Prediction of Bankruptcy[J].Journal of Accounting Research,1980,18(1):109-131.
[2]XIAO H M,HOU Y,CUI C N.Evaluation of P2P Lending Borrower's Credit on BP Artificial Neural Network [J].Operations Research and Management,2018,27 (9):112-118.
[3]ZHOU Z H,FENG J.Deep Forest:Towards An Alternative to Deep Neural Networks[C]//Proceedings of the 26th International Joint Conference on Artificial Intelligence.2017:3553-3559.
[4]BREIMAN L,FRIEDMAN J,OLSHEN R,et al.Classificationand Regression Trees[M].New York:Chapman & Hall,1984.
[5]LU H Y.Construction of risk evaluation system of P2P online loan platform based on SVM [J].Science and Technology Economics Market,2018(2):70-74.
[6]TAN Z M,XIE K,PENG Y P.Research on Credit Risk Evalua-
tion of P2P Online Borrowers Based on Gradient Boosting Decision Tree Model [J].Soft Science,2018,32(12):136-140.
[7]XU T T.Application of random forest in credit risk assessment of P2P online loan borrowing [D].Jinan:Shandong University,2017.
[8]MA P J,WANG Y,YU L,et al.Risk assessment of P2P net-work lending based on cost-sensitive decision tree [J].Computer Integrated Manufacturing System,2018,243 (7):296-302.
[9]ZHANG Y C,SONG X P,LUO Y.Research on Customer CreditEvaluation Based on Fuzzy Support Vector Machine [J].Statistics and Decision,2008(7):16-19.
[10]WANG C R,HAN D M,LIU Q G,et al.A Deep Learning Approach for Credit Scoring of Peer-to-Peer Lending Using Attention Mechanism LSTM[J].IEEE Access,2018(7):2161-2168.
[11]YANG Z,ZHANG Y S,GUO B H,et al.DeepCredit:Exploiting User Cickstream for Loan Risk Prediction in P2P Lending[C]//International AAAI Conference on Web and Social Media Twelfth International AAAI Conference on Web and Social Media.Palo Alto,California USA:AAAI,2018:444-443.
[12]BASTANI K,ASGARI E,NAMAVARI H.Wide and deeplearning for peer-to-peer lending[J].Expert Systems With Applications,2019,134:209-224.
[13]TONG T,LUO S L,PAN L M,Zhang Tiemei.Scale data mining method based on deep forest[J].Electronic Design Engineering,2020,28(13):88-91,96.
[14]UTKIN L V,RYABININ M A.A Siamese Deep Forest[J].Journal of Knowledge-Based Systems[J].arXiv:1704.08715vl,2017:5-6.
[15]GE S L,YE J,HE M X.Prediction model of user purchase behavior based on deep forest[J].Computer Science,2019,46(9):190-194.
[16]LU X D,DUAN Z M,QIAN Y K,et al.A Malicious Code Classification Method Based on Deep Forest[J].Journal of Software,2020,31(5):1454-1464.
[1] JIANG Peng-fei, WEI Song-jie. Classification and Evaluation of Mobile Application Network Behavior Based on Deep Forest and CWGAN-GP [J]. Computer Science, 2020, 47(1): 287-292.
[2] GE Shao-lin, YE Jian, HE Ming-xiang. Prediction Model of User Purchase Behavior Based on Deep Forest [J]. Computer Science, 2019, 46(9): 190-194.
[3] HAN Hui,WANG Li-ming,CHAI Yu-mei,LIU Zhen. Text Sentiment Classification Based on Deep Forests with Enhanced Features [J]. Computer Science, 2019, 46(7): 172-179.
[4] YANG De-jie, ZHANG Ning, YUAN Ji, BAI Lu. Individual Credit Risk Assessment Based on Stacked Denoising Autoencoder Networks [J]. Computer Science, 2019, 46(10): 7-13.
[5] XUE Can-guan, YAN Xue-feng. Software Defect Prediction Based on Improved Deep Forest Algorithm [J]. Computer Science, 2018, 45(8): 160-165.
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!