An ensemble classification model for detecting voice phishing in telecommunication networks and its integration into a visual analysis tool

Çalık, Hüseyin Eren

An ensemble classification model for detecting voice phishing in telecommunication networks and its integration into a visual analysis tool

buir.advisor	Doğrusöz, Uğur
dc.contributor.author	Çalık, Hüseyin Eren
dc.date.accessioned	2022-09-20T06:29:37Z
dc.date.available	2022-09-20T06:29:37Z
dc.date.copyright	2022-09
dc.date.issued	2022-09
dc.date.submitted	2022-09-19
dc.description	Cataloged from PDF version of article.	en_US
dc.description	Includes bibliographical references (leaves 42-45).	en_US
dc.description.abstract	Voice phishing, a method of social engineering fraud performed over phone calls, has been a major problem globally since the use of phones became widespread. Traditional and modern methods to detect these fraud schemes include visual analysis of the customers’ behaviour, rule-based systems and machine learning models such as clustering, decision trees, shallow classifiers and deep learning models. Visual analysis depends only on human expertise and requires very high labor force to be effective. Rule-based systems are useful for extreme cases but are vulnerable to concept drifts. The-state-of-the-art methods generally utilize machine learning approaches. However, they require one or more of feature engineering done by experts, high computational power and privacy infringements. Therefore, in collaboration with Turkcell Technology, we aimed to develop a system that benefits from the advantages of the traditional methods while exploiting the effectiveness and efficiency of the state-of-the-art ones to tackle this issue. In doing so, we integrated an ensemble learning model to an existing visualization tool for detecting fraud users. This tool visualizes relational data as knowledge graphs, shows the informational data as texts and statistical data with charts and texts. Our ensemble learning model has two deep neural networks and one decision tree classifier. Multiple neural networks are used to reduce the variance and make a more stable model. One of them is composed of an input layer, two hidden layers with 200 nodes using Rectified Linear Unit (ReLU) activation function, each followed by a dropout layer and an output layer of one node with sigmoid activation function. We used dropout layers in this network to prevent over-fitting. The second neural network we built has 3 hidden layers instead with node numbers 64, 64 and 32, respectively, with ReLU as their activation function. To feed these models, a total of 34 features, 20 of which are raw, have been engineered with Turkcell fraud experts. The aggregation of the outputs is done by taking their average. We measured the success of our model by calculating the F1 Score as the class imbalance is high. Our model’s F1 score is 0.82 with a precision of 0.82 and a recall of 0.83. Also, with the integration of our model into this visualization tool, a framework was formed allowing mobile network operators to examine and detect fraud cases more efficiently and act accordingly.	en_US
dc.description.statementofresponsibility	by Hüseyin Eren Çalık	en_US
dc.format.extent	xiii, 50 leaves : charts (some color) ; 30 cm.	en_US
dc.identifier.itemid	B161321
dc.identifier.uri	http://hdl.handle.net/11693/110545
dc.language.iso	English	en_US
dc.rights	info:eu-repo/semantics/openAccess	en_US
dc.subject	Telecommunication networks	en_US
dc.subject	Call graphs	en_US
dc.subject	Visual network analysis	en_US
dc.subject	Graph visualization	en_US
dc.subject	Fraud detection	en_US
dc.subject	Machine learning	en_US
dc.subject	Deep neural networks	en_US
dc.title	An ensemble classification model for detecting voice phishing in telecommunication networks and its integration into a visual analysis tool	en_US
dc.title.alternative	Telekomünkasyon ağlarında sesli olta saldırılarının birleşik sınıflandırma modeli ile tespiti ve görsel analiz aracına entegrasyonu	en_US
dc.type	Thesis	en_US
thesis.degree.discipline	Computer Engineering
thesis.degree.grantor	Bilkent University
thesis.degree.level	Master's
thesis.degree.name	MS (Master of Science)

Files

Original bundle

Now showing 1 - 1 of 1

Name:: B161321.pdf
Size:: 1.5 MB
Format:: Adobe Portable Document Format
Description:: Full printable version

Download

License bundle

Now showing 1 - 1 of 1

Name:: license.txt
Size:: 1.69 KB
Format:: Item-specific license agreed upon to submission
Description:

Download

Collections

Graduate School of Engineering and Science