Aykırı değer tespitinde yoğunluk tabanlı kümeleme yöntemleri

Tekbir, Mennan

dc.contributor.advisor	Varlı, Songül
dc.contributor.author	Tekbir, Mennan
dc.date.accessioned	2020-12-29T09:58:40Z
dc.date.available	2020-12-29T09:58:40Z
dc.date.submitted	2009
dc.date.issued	2018-08-06
dc.identifier.uri	https://acikbilim.yok.gov.tr/handle/20.500.12812/389123
dc.description.abstract	Fraud that causes high amounts of finance loss, has became one of the serious problems. Either proactive efforts that focuses on prevention of fraud or working on fraud detection always use data mining approaches.Outlier detection, which is one of the data mining studies, detects objects that has different behavior in similar elements. These elements are usually nominated to be fraudulent elements. Clustering methods are mostly used for outlier detection. Clustering algorithms that are sensitive to noise or the inconsistent elements, are playing an active role in the detection of fraudulent behavior.Clustering is one of the data mining methods that is used for the unsupervised analysis of the data. Especially, if the data has not enough information(foreknowledge), similar data is grouped by the help of the clustering methods. DBSCAN, which is the one of the density-based clustering methods, does the process of clustering, according to density of data.Although DBSCAN method seems effective in the small data sets, its efficiency decreases with the growing of data volumes. Because of this reason, DBSCAN as a clustering method is not considered a suitable clustering method for large data sets.In the scope of this thesis, R-P-DBSCAN (Recursive-Partitioned DBSCAN) algorithm is proposed. The new algorithm is based on partitioning & combining and DBSCAN algorithm is used for data clustering. Large-volume data sets are divided into smaller pieces and clustered by DBSCAN. Then, combining each clustered piece, until whole set of data is clustered. Each cluster obtained by R-P-DBSCAN, is the same as the clusters obtained with the classical DBSCAN.The results obtained with R-P-DBSCAN have shown that, the proposed algorithm has better clustering performance (until 85%) according to classical DBSCAN algorithm
dc.description.abstract	Fraud that causes high amounts of finance loss, has became one of the serious problems. Either proactive efforts that focuses on prevention of fraud or working on fraud detection always use data mining approaches.Outlier detection, which is one of the data mining studies, detects objects that has different behavior in similar elements. These elements are usually nominated to be fraudulent elements. Clustering methods are mostly used for outlier detection. Clustering algorithms that are sensitive to noise or the inconsistent elements, are playing an active role in the detection of fraudulent behavior.Clustering is one of the data mining methods that is used for the unsupervised analysis of the data. Especially, if the data has not enough information(foreknowledge), similar data is grouped by the help of the clustering methods. DBSCAN, which is the one of the density-based clustering methods, does the process of clustering, according to density of data.Although DBSCAN method seems effective in the small data sets, its efficiency decreases with the growing of data volumes. Because of this reason, DBSCAN as a clustering method is not considered a suitable clustering method for large data sets.In the scope of this thesis, R-P-DBSCAN (Recursive-Partitioned DBSCAN) algorithm is proposed. The new algorithm is based on partitioning & combining and DBSCAN algorithm is used for data clustering. Large-volume data sets are divided into smaller pieces and clustered by DBSCAN. Then, combining each clustered piece, until whole set of data is clustered. Each cluster obtained by R-P-DBSCAN, is the same as the clusters obtained with the classical DBSCAN.The results obtained with R-P-DBSCAN have shown that, the proposed algorithm has better clustering performance (until 85%) according to classical DBSCAN algorithm	en_US
dc.language	Turkish
dc.language.iso	tr
dc.rights	info:eu-repo/semantics/openAccess
dc.rights	Attribution 4.0 United States	tr_TR
dc.rights.uri	https://creativecommons.org/licenses/by/4.0/
dc.subject	Bilgisayar Mühendisliği Bilimleri-Bilgisayar ve Kontrol	tr_TR
dc.subject	Computer Engineering and Computer Science and Control	en_US
dc.title	Aykırı değer tespitinde yoğunluk tabanlı kümeleme yöntemleri
dc.title.alternative	Density-based clustering methods for outlier detection
dc.type	masterThesis
dc.date.updated	2018-08-06
dc.contributor.department	Bilgisayar Mühendisliği Anabilim Dalı
dc.identifier.yokid	352846
dc.publisher.institute	Fen Bilimleri Enstitüsü
dc.publisher.university	YILDIZ TEKNİK ÜNİVERSİTESİ
dc.identifier.thesisid	243932
dc.description.pages	70
dc.publisher.discipline	Diğer

Files in this item

Name:: yokAcikBilim_352846.pdf
Size:: 5.765Mb
Format:: PDF
Description:: File_352846

View/Open

This item appears in the following Collection(s)

TEZLER

Show simple item record

Except where otherwise noted, this item's license is described as info:eu-repo/semantics/openAccess