Makine öğrenmesi algoritmalarına dayalı yazılım hata tahmini: Çevresel metriklerin etkisi

Odabaşi, Merve

dc.contributor.advisor	Gül, Ensar
dc.contributor.author	Odabaşi, Merve
dc.date.accessioned	2021-05-08T07:33:14Z
dc.date.available	2021-05-08T07:33:14Z
dc.date.submitted	2019
dc.date.issued	2019-11-08
dc.identifier.uri	https://acikbilim.yok.gov.tr/handle/20.500.12812/631408
dc.description.abstract	Bu çalışmada 2012-2019 yılları arasında makine öğrenmesi algoritmaları ile yazılım hata tahminleme konusunda, akademik ve sektörel alanda yapılan çalışmalar incelenmiştir. Yapılan inceleme sonucunda akademik alanda, yazılım test alanında makine öğrenmesi çalışmalarının trend olduğu görülmüştür. Word Quality Report'ta kaliteli verinin olmamasından dolayı sektörde makine öğrenmesi çalışmalarının kullanımının az olduğu belirtilmiştir. İlgili çalışmalarda, tahminlemeler esnasında sadece yazılımsal metriklerin kullanıldığı görülmüştür. Bu çalışmada ise, Yazılımsal metriklerin yanında, çevresel metriklerin de sonuç üzerinde etkili olup olmadığı, çevresel metriklerin kullanılarak makine öğrenmesi algoritmaları ile başarılı sonuç tahminleme yapılıp yapılamayacağı, hangi çevresel etkenlerin sonuç üzerinde ne kadar etkili olduğu araştırılmıştır. Çalışma için veri seti, 4 analist, 8 yazılım mühendisi ve 5 test uzmanından oluşan bir ekibinin yapmış olduğu 10 adet projenin çıktıları ile oluşturulmuştur. 36 adet metrik ile 6676 adet test durumu değerlendirilmiştir. Hatayı sadece hata olarak kabul etmenin yanında, hatanın önem derecesi ve test edilemeyecek olan senaryolarda hesaba katılmıştır. Toplamda dokuz adet sonuç tahminlemesi modelde kullanılmıştır. Ön işleme aşamasında PCA analizi yapılmış, 36 adet metrik içerisinden 12 adedinin sonuca etkili olduğu görülmüştür. Dört farklı algoritma ile modeller oluşturulup; karar ağacı algoritmasında 89/%, en yakın komşular algoritmasında 87/%, rastgele ormanlar algoritmasında 88/% ve naive bayes algoritmasında 91/% başarı elde edilmiştir. Sonuç olarak çevresel metriklerin de yazılım test sonucu tahminlemesinde etikili olduğu ve makine öğrenmesi algoritmaları ile kullanıldığında, yüksek oranda başarılı sonuç tahminlemesi yapılabileceği görülmüştür.
dc.description.abstract	According to the Word Quality report, the use of machine learning studies in the industrial field is rare due to the severe lack of quality data. Software metrics are generally utilized during software fault prediction in this field. In this study, besides the software metrics, the environmental metrics are also explored to see whether they also affect the results of machine learning and if they could, what would be the success rates and which environmental metrics are more effective on the results. The data set for this study was generated from combining various data from 10 projects that were produced a team of 4 analysts, 8 software engineers, and 5 test experts. 36 metrics and 6676 test cases in total were evaluated. The errors occurred in the test cases are not just considered as an error, their priority and cases that can not be tested are also taken into consideration. 9 fault level are employed in models. During the pre-processing phase, the PCA analysis is conducted, out of 36 metrics 12 are effective to the results. Models are created with four different algorithms which have achieved a success rate of; 89/% by the decision tree algorithm, 87/% by the nearest neighbors algorithm, 88/% by the random forests algorithm and 91/% by the Naive Bayes algorithm. In conclusion, it was observed that environmental metrics are indeed effective in software fault prediction and when applied with machine learning algorithms a high rate of success can be achieved.	en_US
dc.language	Turkish
dc.language.iso	tr
dc.rights	info:eu-repo/semantics/openAccess
dc.rights	Attribution 4.0 United States	tr_TR
dc.rights.uri	https://creativecommons.org/licenses/by/4.0/
dc.subject	Bilgisayar Mühendisliği Bilimleri-Bilgisayar ve Kontrol	tr_TR
dc.subject	Computer Engineering and Computer Science and Control	en_US
dc.title	Makine öğrenmesi algoritmalarına dayalı yazılım hata tahmini: Çevresel metriklerin etkisi
dc.title.alternative	Software fault prediction based on machine learning algorithms: The effect of environmental metrics
dc.type	masterThesis
dc.date.updated	2019-11-08
dc.contributor.department	Bilgi Güvenliği Mühendisliği Ana Bilim Dalı
dc.subject.ytm	Machine learning
dc.subject.ytm	Software test
dc.subject.ytm	Software metrics
dc.subject.ytm	Software reliability
dc.subject.ytm	Error estimation
dc.identifier.yokid	10264599
dc.publisher.institute	Fen Bilimleri Enstitüsü
dc.publisher.university	İSTANBUL ŞEHİR ÜNİVERSİTESİ
dc.identifier.thesisid	575792
dc.description.pages	93
dc.publisher.discipline	Diğer

Files in this item

Name:: yokAcikBilim_10264599.pdf
Size:: 1.901Mb
Format:: PDF
Description:: File_10264599

View/Open

This item appears in the following Collection(s)

TEZLER

Show simple item record

Except where otherwise noted, this item's license is described as info:eu-repo/semantics/openAccess