Show simple item record

dc.contributor.advisorVarlı, Songül
dc.contributor.authorTekbir, Mennan
dc.date.accessioned2020-12-29T09:58:40Z
dc.date.available2020-12-29T09:58:40Z
dc.date.submitted2009
dc.date.issued2018-08-06
dc.identifier.urihttps://acikbilim.yok.gov.tr/handle/20.500.12812/389123
dc.description.abstractFraud that causes high amounts of finance loss, has became one of the serious problems. Either proactive efforts that focuses on prevention of fraud or working on fraud detection always use data mining approaches.Outlier detection, which is one of the data mining studies, detects objects that has different behavior in similar elements. These elements are usually nominated to be fraudulent elements. Clustering methods are mostly used for outlier detection. Clustering algorithms that are sensitive to noise or the inconsistent elements, are playing an active role in the detection of fraudulent behavior.Clustering is one of the data mining methods that is used for the unsupervised analysis of the data. Especially, if the data has not enough information(foreknowledge), similar data is grouped by the help of the clustering methods. DBSCAN, which is the one of the density-based clustering methods, does the process of clustering, according to density of data.Although DBSCAN method seems effective in the small data sets, its efficiency decreases with the growing of data volumes. Because of this reason, DBSCAN as a clustering method is not considered a suitable clustering method for large data sets.In the scope of this thesis, R-P-DBSCAN (Recursive-Partitioned DBSCAN) algorithm is proposed. The new algorithm is based on partitioning & combining and DBSCAN algorithm is used for data clustering. Large-volume data sets are divided into smaller pieces and clustered by DBSCAN. Then, combining each clustered piece, until whole set of data is clustered. Each cluster obtained by R-P-DBSCAN, is the same as the clusters obtained with the classical DBSCAN.The results obtained with R-P-DBSCAN have shown that, the proposed algorithm has better clustering performance (until 85%) according to classical DBSCAN algorithm
dc.description.abstractFraud that causes high amounts of finance loss, has became one of the serious problems. Either proactive efforts that focuses on prevention of fraud or working on fraud detection always use data mining approaches.Outlier detection, which is one of the data mining studies, detects objects that has different behavior in similar elements. These elements are usually nominated to be fraudulent elements. Clustering methods are mostly used for outlier detection. Clustering algorithms that are sensitive to noise or the inconsistent elements, are playing an active role in the detection of fraudulent behavior.Clustering is one of the data mining methods that is used for the unsupervised analysis of the data. Especially, if the data has not enough information(foreknowledge), similar data is grouped by the help of the clustering methods. DBSCAN, which is the one of the density-based clustering methods, does the process of clustering, according to density of data.Although DBSCAN method seems effective in the small data sets, its efficiency decreases with the growing of data volumes. Because of this reason, DBSCAN as a clustering method is not considered a suitable clustering method for large data sets.In the scope of this thesis, R-P-DBSCAN (Recursive-Partitioned DBSCAN) algorithm is proposed. The new algorithm is based on partitioning & combining and DBSCAN algorithm is used for data clustering. Large-volume data sets are divided into smaller pieces and clustered by DBSCAN. Then, combining each clustered piece, until whole set of data is clustered. Each cluster obtained by R-P-DBSCAN, is the same as the clusters obtained with the classical DBSCAN.The results obtained with R-P-DBSCAN have shown that, the proposed algorithm has better clustering performance (until 85%) according to classical DBSCAN algorithmen_US
dc.languageTurkish
dc.language.isotr
dc.rightsinfo:eu-repo/semantics/openAccess
dc.rightsAttribution 4.0 United Statestr_TR
dc.rights.urihttps://creativecommons.org/licenses/by/4.0/
dc.subjectBilgisayar Mühendisliği Bilimleri-Bilgisayar ve Kontroltr_TR
dc.subjectComputer Engineering and Computer Science and Controlen_US
dc.titleAykırı değer tespitinde yoğunluk tabanlı kümeleme yöntemleri
dc.title.alternativeDensity-based clustering methods for outlier detection
dc.typemasterThesis
dc.date.updated2018-08-06
dc.contributor.departmentBilgisayar Mühendisliği Anabilim Dalı
dc.identifier.yokid352846
dc.publisher.instituteFen Bilimleri Enstitüsü
dc.publisher.universityYILDIZ TEKNİK ÜNİVERSİTESİ
dc.identifier.thesisid243932
dc.description.pages70
dc.publisher.disciplineDiğer


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record

info:eu-repo/semantics/openAccess
Except where otherwise noted, this item's license is described as info:eu-repo/semantics/openAccess