Title Churn prediction in telecommunication industry using machine learning /
Translation of Title Klientų praradimo prognozavimas telekomunikacijų srityje naudojant mašininį mokymąsi.
Authors Latvaitis, Tautvydas
Full Text Download
Pages 23
Keywords [eng] Keywords: Customer churn, churn prediction, machine learning, Random Forest, XGBoost, Support Vector Machine Raktiniai žodžiai: Klientų praradimas, praradimo prognozavimas, mašininis mokymasis, Atsitiktinis Miškas, XGBoost, Atraminiai Vektoriai
Abstract [eng] The digitization and rapid growth of technological advancement has changed the way companies do business. With more and more services to chose from, consumer churning has become one of the biggest threats to all companies. In this paper we analyze a machine learning based churn prediction model for a telecommunication service provider. The main aim is to find the best churn prediction model and implement the model on real data. Three machine learning algorithms are compared: Random Forest classifier, XGBoost classifier and Support Vector Machine classifier. The models are evaluated using the metrics, accuracy, precision, recall, and F-scores. The prediction performance is tested on a data set provided by one of the telecommunication companies in Lithuania. It was found that the data is imbalanced with a majority of non-churners, so we also used two balancing techniques, Random Under-Sampling and Random Over-Sampling, to see if they improve the results of our models. We conclude that machine learning is a very useful process for customer churn prediction and that Random Forest and XGBoost models are better at predicting churn than the Support Vector Machine model.
Dissertation Institution Vilniaus universitetas.
Type Master thesis
Language English
Publication date 2022