`R` programlama dilinde tahmin edici veri madenciliği algoritmalarının modellenmesi ve performanslarının karşılaştırılması

Can, Şengül

dc.contributor.advisor	Gerşil, Mustafa
dc.contributor.author	Can, Şengül
dc.date.accessioned	2023-09-22T11:37:38Z
dc.date.available	2023-09-22T11:37:38Z
dc.date.submitted	2022-10-19
dc.date.issued	2022
dc.identifier.uri	https://acikbilim.yok.gov.tr/handle/20.500.12812/734429
dc.description.abstract	Günümüz ekonomisi dinamik bir yapıdadır. Gelişen bilgi teknolojileriyle birlikte kayıt altında tutulan veri sayısı da artmıştır. Artan veri miktarı, veriler arasındaki sarmal ilişkileri görmeyi zorlaştırmaktadır. Ham veriden bilgi elde edilmesi ve elde edilen bilginin gelecek tahminlerinde kullanılması işletmeler için kritik öneme sahiptir. Veri tahmini kesinlik içermeyen ve karmaşık bir süreçtir. Ancak doğruya en yakın tahmin işletmelerin stratejik karar almasında oldukça önemlidir. Veri tahmini ekonomi alanında yaygın olarak kullanılmaktadır. Bu çalışmada ekonomik kalkınma için büyük öneme sahip ihracat verileri incelenmiştir. Türkiye İstatistik Kurumu ve Merkez Bankası istatistikleri kullanılarak veri ambarı oluşturulmuştur. İstatistiksel analizlerde sıklıkla tercih edilen R programlama dili kullanılarak algoritmalar geliştirilmiştir. R programlama dilinde yapay sinir ağı, regresyon ve zaman serisi algoritmaları geliştirilmiştir. Çalışmanın birinci aşamasında; R programında yapay sinir ağı algoritması geliştirilmiştir. Bu aşamada farklı ağ topolojileri test edilerek en başarılı yapay sinir ağı belirlenmiştir. Buna göre (5,3) topolojisine sahip ağın en başarılı performansa sahip olduğu görülmüştür. Çalışmanın ikinci aşamasında R programında regresyon algoritması geliştirilmiştir. Çalışmanın son aşamasında R programında zaman serisi algoritması geliştirilmiştir. Naive Bayes ve ARIMA modelleri test edilmiş ve ARIMA(1,0,0)(2,0,0) modelinin daha başarılı olduğu görülmüştür. Yapay sinir ağı (5,3), regresyon ve ARIMA(1,0,0)(2,0,0) algoritmaları veri ambarındaki eğitim verisi üzerinde denenmiştir. Algoritmaların başarıları istatistiksel hata oranları hesaplanarak karşılaştırılmıştır. Buna göre en başarılı tahmin algoritmasının yapay sinir ağı olduğu görülmüştür.
dc.description.abstract	Today's economy is dynamic. With the developing information technologies, the number of recorded data has also increased. The increasing amount of data makes it difficult to see the relationships among the data. Obtaining information from raw data and using the obtained information in future predictions are of critical importance for businesses. Data estimation is an imprecise and complex process. However, estimation which is the closest to right is very important for businesses to make strategic decision. Data forecasting is widely used in economics.In this study, export data, which is of great importance for economic development, was examined. A data warehouse was created using statistics from the Turkish Statistical Institute and The Central Bank of the Republic of Turkey. Algorithms were developed using the R programming language, which is frequently preferred in statistical analysis. Artificial neural network, regression and time series algorithms were developed in R programming language. In the first stage of the study, an artificial neural network algorithm was developed in the R program. At this stage, different network topologies were tested and the most successful artificial neural network was determined. Accordingly, it was detected that the network with the (5,3) topology had the most successful performance. In the second stage of the study, a regression algorithm was developed in the R program. In the last stage of the study, the time series algorithm was developed in the R program. Naive Bayes and ARIMA models were tested and it was detected that the ARIMA(1,0,0)(2,0,0) model was more successful. Artificial neural network (5,3), regression and ARIMA(1,0,0)(2,0,0) algorithms were tested for the training data in the data warehouse. The success of the algorithms was compared by calculating the statistical error rates. Accordingly, it was concluded that the most successful prediction algorithm was the artificial neural network.	en_US
dc.language	Turkish
dc.language.iso	tr
dc.rights	info:eu-repo/semantics/openAccess
dc.rights	Attribution 4.0 United States	tr_TR
dc.rights.uri	https://creativecommons.org/licenses/by/4.0/
dc.subject	İstatistik	tr_TR
dc.subject	Statistics	en_US
dc.subject	İşletme	tr_TR
dc.subject	Business Administration	en_US
dc.title	`R` programlama dilinde tahmin edici veri madenciliği algoritmalarının modellenmesi ve performanslarının karşılaştırılması
dc.title.alternative	Modeling of predictive data mining algorithms in the `R` programming language and comparison of their performances
dc.type	doctoralThesis
dc.date.updated	2022-10-19
dc.contributor.department	İşletme Ana Bilim Dalı
dc.identifier.yokid	10314101
dc.publisher.institute	Sosyal Bilimler Enstitüsü
dc.publisher.university	MANİSA CELÂL BAYAR ÜNİVERSİTESİ
dc.identifier.thesisid	734837
dc.description.pages	151
dc.publisher.discipline	İşletme Bilim Dalı

Files in this item

Files	Size	Format	View
There are no files associated with this item.

This item appears in the following Collection(s)

TEZLER

Show simple item record

Except where otherwise noted, this item's license is described as info:eu-repo/semantics/openAccess