Data mining methods linked to artificial intelligence applicable to credit risk

Main Article Content

Patricia Jimbo Santana
https://orcid.org/0000-0001-7432-1622
Augusto Villa Monte
Laura Lanzarini
https://orcid.org/0000-0001-7027-7564
Aurelio Fernández Bariviera

Abstract

The Financial institutions, when properly selecting their clients, reduce their credit risk, banks use different methodologies in order to classify their clients according to the default risk they present; For this we analyze a set of personal variables as well as the financial situation of the client that is subject to credit. The exhaustive analysis and processing of customer information takes a long time, one reason being that the data to be analyzed are not homogeneous. This paper presents an alternative method capable of constructing, from the available information, a set of classification rules with three main characteristics: adequate accuracy, low cardinality and ease of interpretation. The latter is given by the use of a reduced number of attributes in the conformation of the antecedent. This feature added to the low cardinality of the set of rules allows to distinguish very useful patterns in the understanding of the relations between the data and to make decisions. When it comes to deciding the granting of credits, it is extremely useful to have a tool of this type. The simpler the model, the smaller the number of characteristics of the subject of credit that must be analyzed so that decisions can be taken more quickly. This allows the method to be attractive to credit officers in financial institutions, since It´s possible to give a response to the applicant of the credit in less time obtaining a competitive advantage. The proposed methodology has been applied to two databases known in the literature and to two real databases of Ecuadorian financial institutions, a Savings and Credit Cooperative and a Bank that grant different types of loans and have agencies in the coast, Sierra and oriente. The results obtained have been satisfactory. Finally, the conclusions are presented and future lines of research are suggested.

Downloads

Download data is not yet available.

Metrics

Metrics Loading ...

Article Details

How to Cite
Jimbo Santana, P., Villa Monte, A., Lanzarini, L., & Fernández Bariviera, A. (2017). Data mining methods linked to artificial intelligence applicable to credit risk. FIGEMPA: Investigación Y Desarrollo, 3(1), 98–106. https://doi.org/10.29166/revfig.v1i1.61
Section
Artículos
Author Biographies

Patricia Jimbo Santana, Universidad Central del Ecuador. Quito, Ecuador

Facultad de Ciencias Administrativas. Carrera de Contabilidad y Auditoría. Quito – Ecuador.

Orcid: 0000-0001-7432-1622

Augusto Villa Monte, Universidad Nacional de la Plata. Buenos Aires, Argentina

Instituto de Investigación en Informática LIDI. Universidad Nacional de la Plata, Buenos Aires, Argentina.

Orcid: 0000-0002-9854-3083

Laura Lanzarini, Universidad Nacional de la Plata. Buenos Aires, Argentina

Instituto de Investigación en Informática LIDI. Universidad Nacional de la Plata, Buenos Aires, Argentina.

Orcid: 0000-0001-7027-7564

Aurelio Fernández Bariviera, Universitat Rovirai Virgili. Reus, España

Departament of Business. Universitat Rovirai Virgili. Avenidade la Universitat.1 Reus, Spain.

Orcid: 0000-0003-1014-1010

References

. Agrawal, R., Srikant, R. (1994) Fast algorithms for mining association rules in large databases. In: Proceedings of the 20th International Conference on Very Large Data Bases, VLDB ’94, pp. 487–499.

Aggarwal, C. (2015): Data Mining: The Textbook. Springer Publishing Company, Incorporated.

. Altman, E.I., (1968). Financial ratios, discriminant analysis and the prediction of corporate bankruptcy. The Journal of Finance, 23(4), pp.589–609.

Altman, E.I., and A. Sounders. (1998). Credit risk measurement: Developments over the last 20 years. Journal of Banking and Finance, 21, 1721-1742.

. Duffie, D., Singleton, K. J. (2003): Credit Risk: Pricing, Measurement, and Management. Princeton University Press. ISBN 0-691-09046-7.

. Frank, E., Witten, I. H., (1998). Generating accurate rule sets without global optimization. In: Proceedings of the Fifteenth International Conference on Machine Learning, ICML ’98., pp. 144–151.

Hung, C. & Huang, L. (2010). Extracting Rules from Optimal Clusters of Self-Organizing Maps. In Second International Conference on Computer Modeling and Simulation. ICCMS ’10. pp. 382–386

. MacQueen, J. B. (1967). Some methods for classification and analysis of multivariate observations,” in Proc. of the fifth Berkeley Symposium on Mathematical Statistics and Probability, L. M. L. Cam and J. Neyman, Eds., vol. 1. University of California Press, pp. 281,297.

. Kennedy, J. & Eberhart, R. (1995). Particle swarm optimization. In, Proceedings of IEEE International Conference on Neural Networks. pp. 1942–1948, vol.4.

. Kohonen, T. (2012). Self-Organizing Maps. Vol. 30, Springer Series in Information Sciences. Springer, Heidelberg.

. Lanzarini, L., Villa Monte, A., Aquino, G., De Giusti, A. (2015). Obtaining classification rules using lvqPSO. 6th International Conference Advances in Swarm and Computational Intelligence. Lecture Notes in Computer Science. Vol 9140 183-193. https://doi.org/10.1007/978-3-319-20466-6_20

. Lanzarini, L., Villa-Monte, A., Ronchetti, F. (2015). SOM+PSO: A Novel Method to Obtain Classification Rules. Journal of Computer Science & Technology (JCS&T), 15(1), 15-22.

. Quinlan, J.R. (1993). C4.5: programs for machine learning. Morgan Kaufmann Publishers.

Roszbach, K. (2003). Bank Lending Policy, Credit Scoring and the Survival of Loans. Sveriges Risksbank Working Paper Series, 154.

Saunders, A., Allen L. (2002): Credit Risk Measurement: New Approaches to Value at Risk and Other Paradigms, 2nd Edition. John Wiley & Sons, Inc. ISBN: 978-0-471-27476-6.

Venturini, G. (1993). Sia: A supervised inductive algorithm with genetic search for learning attributes based concepts, in Machine Learning: ECML-93, ser. Lecture Notes in Computer Science, P. Brazdil, Ed. Springer Berlin Heidelberg, vol. 667, pp. 280–296.

Wang, Z., Sun, X. & Zhang, D. (2007). A PSO-Based Classification Rule Mining Algorithm. In D.-S. Huang, L. Heutte, & M. Loog, eds. Advanced Intelligent Computing Theories and Applications. With Aspects of Artificial Intelligence: Third International Conference on Intelligent Computing, ICIC 2007, Qingdao, China, August 21-24, 2007. Proceedings. Berlin, Heidelberg: Springer Berlin Heidelberg, pp. 377–384.

. Witten, I.H., Eibe, F. & Hall, M.A. (2011). Data Mining Practical Machine Learning Tools and Techniques. 3rd. ed., San Francisco, CA: Morgan Kaufmann Publishers Inc.

Zadeh, L.A. (1965). Fuzzy sets. Information and Control, 8(3), pp.338–353.

Asok, K., Nanda, Moshe Shaked. (2001). The Hazard Rate and the Reversed Hazard Rate Orders, with Applications to Order Statistics, pp.853– 864.

Ríos Insua, M.J. (1984). On the hierarchical models and their relationship with the decision problem with partial information a priori.