Kümeleme problemi için geçerlilik indeksleri üzerine

Sarı, Büşra

dc.contributor.advisor	Ordin, Burak
dc.contributor.author	Sarı, Büşra
dc.date.accessioned	2023-09-22T12:26:32Z
dc.date.available	2023-09-22T12:26:32Z
dc.date.submitted	2023-01-03
dc.date.issued	2022
dc.identifier.uri	https://acikbilim.yok.gov.tr/handle/20.500.12812/740456
dc.description.abstract	Veri kümeleme problemi mühendislikte, tıpta, ekonomide vb. pek çok alanda önemli uygulamalara sahip bir kombinatoryal optimizasyon problemidir. Veri kümeleme probleminde amaç küme içi benzerliğin enbüyüklenip, kümeler arası benzerliğin enküçüklenmesidir. Buna görede modellediği gerçek hayat problemi için gözle görülemeyen/gizli kalmış desenlerin ortaya çıkartılması amaçlanır.Geçmişten günümüze veri kümeleme probleminin çözümü için pek çok çözüm yöntemi önerilmiştir. Geliştirilen/geliştirilmekte olan yöntemlerin ilgili alanda ne kadar geçerliliğe sahip sonuçlar ürettiğini değerlendirmek zor ve önemli bir süreçtir. Literatürde bu değerlendirme süreci için çeşitli kümeleme geçerlilik indeksleri önerilmiştir. Herhangi bir kümeleme problemi üzerinde uygulanan çözüm algoritmasının sonucunun bir geçerlilik indeksi tarafından ne kadar geçerli olduğunun belirlenmesi kesin sınırları olmayan açık uçlu bir konudur.Bu tez çalışmasında, kümeleme probleminin çözümü için literatürde yeralan bazı küme geçerlilik indeksleri incelenmiştir. Bunun yanısıra dört gerçek veriseti üzerinde k-means ve global k-means algoritması uygulanarak elde edilen sonuçlar üzerinde içsel değerlendirme ölçülerinden Davies Bouldin ve dışsal değerlendirme ölçülerinden F-Ölçümü kullanılarak hesaplama denemelerinin sonuçları analiz edilmiştir.
dc.description.abstract	Data clustering problem can be found in engineering, medicine, economics etc. It is a combinatorial optimization problem with important applications in many fields. The purpose of the data clustering problem is to maximize the similarity within the cluster and to minimize the similarity between the clusters. Accordingly, it is aimed to reveal invisible/hidden patterns for the real life problem it models. Many solution methods have been proposed to solve the data clustering problem from past to present. It is a difficult and important process to evaluate the validity of the developed/under development methods in the relevant field. Various clustering validity indices have been proposed for this evaluation process in the literature. Determining how valid the result of the solution algorithm applied on any clustering problem is by a validity index is an open-ended issue with no clear boundaries. In this thesis, some clustering validity indexes in the literature were examined to solve the clustering problem. In addition, the results of the calculation trials were analyzed by using Davies Bouldin from internal evaluation measures and F-Measure from external evaluation measures on the results obtained by applying k-means and global k-means algorithm on four real datasets.	en_US
dc.language	Turkish
dc.language.iso	tr
dc.rights	info:eu-repo/semantics/openAccess
dc.rights	Attribution 4.0 United States	tr_TR
dc.rights.uri	https://creativecommons.org/licenses/by/4.0/
dc.subject	Matematik	tr_TR
dc.subject	Mathematics	en_US
dc.title	Kümeleme problemi için geçerlilik indeksleri üzerine
dc.title.alternative	On validity indexes for the clustering problem
dc.type	masterThesis
dc.date.updated	2023-01-03
dc.contributor.department	Matematik Ana Bilim Dalı
dc.subject.ytm	Mathematics
dc.identifier.yokid	10333691
dc.publisher.institute	Fen Bilimleri Enstitüsü
dc.publisher.university	EGE ÜNİVERSİTESİ
dc.identifier.thesisid	760551
dc.description.pages	59
dc.publisher.discipline	Bilgisayar Bilimleri Bilim Dalı

Files in this item

Files	Size	Format	View
There are no files associated with this item.

This item appears in the following Collection(s)

TEZLER

Show simple item record

Except where otherwise noted, this item's license is described as info:eu-repo/semantics/openAccess