Aykırı değer tespitinde yoğunluk tabanlı kümeleme yöntemleri
dc.contributor.advisor | Varlı, Songül | |
dc.contributor.author | Tekbir, Mennan | |
dc.date.accessioned | 2020-12-29T09:58:40Z | |
dc.date.available | 2020-12-29T09:58:40Z | |
dc.date.submitted | 2009 | |
dc.date.issued | 2018-08-06 | |
dc.identifier.uri | https://acikbilim.yok.gov.tr/handle/20.500.12812/389123 | |
dc.description.abstract | Fraud that causes high amounts of finance loss, has became one of the serious problems. Either proactive efforts that focuses on prevention of fraud or working on fraud detection always use data mining approaches.Outlier detection, which is one of the data mining studies, detects objects that has different behavior in similar elements. These elements are usually nominated to be fraudulent elements. Clustering methods are mostly used for outlier detection. Clustering algorithms that are sensitive to noise or the inconsistent elements, are playing an active role in the detection of fraudulent behavior.Clustering is one of the data mining methods that is used for the unsupervised analysis of the data. Especially, if the data has not enough information(foreknowledge), similar data is grouped by the help of the clustering methods. DBSCAN, which is the one of the density-based clustering methods, does the process of clustering, according to density of data.Although DBSCAN method seems effective in the small data sets, its efficiency decreases with the growing of data volumes. Because of this reason, DBSCAN as a clustering method is not considered a suitable clustering method for large data sets.In the scope of this thesis, R-P-DBSCAN (Recursive-Partitioned DBSCAN) algorithm is proposed. The new algorithm is based on partitioning & combining and DBSCAN algorithm is used for data clustering. Large-volume data sets are divided into smaller pieces and clustered by DBSCAN. Then, combining each clustered piece, until whole set of data is clustered. Each cluster obtained by R-P-DBSCAN, is the same as the clusters obtained with the classical DBSCAN.The results obtained with R-P-DBSCAN have shown that, the proposed algorithm has better clustering performance (until 85%) according to classical DBSCAN algorithm | |
dc.description.abstract | Fraud that causes high amounts of finance loss, has became one of the serious problems. Either proactive efforts that focuses on prevention of fraud or working on fraud detection always use data mining approaches.Outlier detection, which is one of the data mining studies, detects objects that has different behavior in similar elements. These elements are usually nominated to be fraudulent elements. Clustering methods are mostly used for outlier detection. Clustering algorithms that are sensitive to noise or the inconsistent elements, are playing an active role in the detection of fraudulent behavior.Clustering is one of the data mining methods that is used for the unsupervised analysis of the data. Especially, if the data has not enough information(foreknowledge), similar data is grouped by the help of the clustering methods. DBSCAN, which is the one of the density-based clustering methods, does the process of clustering, according to density of data.Although DBSCAN method seems effective in the small data sets, its efficiency decreases with the growing of data volumes. Because of this reason, DBSCAN as a clustering method is not considered a suitable clustering method for large data sets.In the scope of this thesis, R-P-DBSCAN (Recursive-Partitioned DBSCAN) algorithm is proposed. The new algorithm is based on partitioning & combining and DBSCAN algorithm is used for data clustering. Large-volume data sets are divided into smaller pieces and clustered by DBSCAN. Then, combining each clustered piece, until whole set of data is clustered. Each cluster obtained by R-P-DBSCAN, is the same as the clusters obtained with the classical DBSCAN.The results obtained with R-P-DBSCAN have shown that, the proposed algorithm has better clustering performance (until 85%) according to classical DBSCAN algorithm | en_US |
dc.language | Turkish | |
dc.language.iso | tr | |
dc.rights | info:eu-repo/semantics/openAccess | |
dc.rights | Attribution 4.0 United States | tr_TR |
dc.rights.uri | https://creativecommons.org/licenses/by/4.0/ | |
dc.subject | Bilgisayar Mühendisliği Bilimleri-Bilgisayar ve Kontrol | tr_TR |
dc.subject | Computer Engineering and Computer Science and Control | en_US |
dc.title | Aykırı değer tespitinde yoğunluk tabanlı kümeleme yöntemleri | |
dc.title.alternative | Density-based clustering methods for outlier detection | |
dc.type | masterThesis | |
dc.date.updated | 2018-08-06 | |
dc.contributor.department | Bilgisayar Mühendisliği Anabilim Dalı | |
dc.identifier.yokid | 352846 | |
dc.publisher.institute | Fen Bilimleri Enstitüsü | |
dc.publisher.university | YILDIZ TEKNİK ÜNİVERSİTESİ | |
dc.identifier.thesisid | 243932 | |
dc.description.pages | 70 | |
dc.publisher.discipline | Diğer |