Yapay sinir ağları ile kişilerin ses örneklerinden kimliklerinin tanınması

Cansiz, Murat

dc.contributor.advisor	Karamancıoğlu, Abdurrahman
dc.contributor.author	Cansiz, Murat
dc.date.accessioned	2020-12-02T09:31:47Z
dc.date.available	2020-12-02T09:31:47Z
dc.date.submitted	1997
dc.date.issued	2020-11-18
dc.identifier.uri	https://acikbilim.yok.gov.tr/handle/20.500.12812/25133
dc.description.abstract	ÖZET Bu tezde son yıllarda yaygın olarak kullanılan Yapay Sinir Ağları (YSA) yönteminin konuşan kişinin kimliğinin tanınması (speaker identification) amacıyla kullanımı uygulaması yapılmaktadır. Genelde yapay sinir ağlarının kullanıldığı ses teknolojisi ile yapılan uygulamalarda sesli ve sessiz sesler ile ve daha da ilerisi kelimelerin, cümlelerin ve sürekli konuşmanın tanınması için önceden yapılan çalışmalar temel olarak alınmıştır. Ses örneklerinin YSA da kullanılması için bu referans bilgilerden yararlanılarak sesin Matematiksel Modeli bir algoritma yardımıyla elde edilmiştir. Sesin matematiksel modelini elde ederken Doğrusal Öngörü Ana/izi (IPC -Lineer Predictive Coding-) ve DFT (Discrete Fourier Transfom) analizleri yapılmıştır. Bu analizler için de sesin bilgisayarda bulunan spektrumundan yararlanılmıştır. Bu uygulamada insanların kimliklerinin tanınması için sadece 'a' sesinden yararlanılmıştır, 'a' sesinin periyodik işaretlerin toplamından oluşmasından yararlanılarak her periyot için 'Hamming Pencereleme Metodu' ile 28 adet katsayı elde edilmiştir. Elde ettiğimiz bu 28 katsayıdan 16 tanesi DFT analizi ile 12 tanesi ise LPC analizi ile bulunmuştur. Her kişinin ses örnekleri ile elde edilen bu 28 adet katsayılar grubu YSA yöntemi olarak kullandığımız 'hatanın geriye yayılarak düzeltilmesi' (Backpropagation) algoritması ile çıkışta '0' ve '1' çıktısı verecek şekilde eğitilerek, konuşan kişinin tanınması uygulaması yapılmıştır. r YSA yöntemi ile ilgili bilgi ikinci bölümde verilmiş ve Backpropagation yönteminin algoritması anlatılmaya çalışılmıştır. Sesin matematiksel modellenmesi için kullanılan LPC ve DFT analizleri de ileriki bölümlerde tartışılacak ve yorumlanacaktır. Altıncı bölümde uygulamanın nasıl yapıldığı, son bölümde ise sonuçlar ve öneriler yer almaktadır. Vİİİ
dc.description.abstract	ABSTRACT In this thesis an artificial neural network (ANN) is implemented, which has been used frequently as an implementation model in recent years, to recognize speaker identification. In general, in the implementation through this kind of voice tecnology for the identification of vowel and consonant letters, evenmore words and speech are based on the previous studies. This kind of reference information were applied for the usage of voice patterns in the ANN. The analyses of lineer predictive coding and discrete fourier transform had been made to obtain the mathematical model of voice. For these analyses the sound of spectrums which have been in computer has been used. In this implementation only the letter of 'a' was applied to recognize the identifications of people. Using a backround and the hamming window method 28 coefficents have been obtained for each period. Of those 28 coefficents, 16 were obtained by the analysis of DFT and 12 were obtained by the analysis of LPC. These 28 coefficent groups which had been obtained from various speakers, have been used to have '0' or T as an output, depending on the speakers identity, by using Backpropagatbn algoritm. A backround on ANN have been given and the algoritm of Backpropagation Model's are explained in the second section. The analyses of LPC and DFT, which have been used for mathematical model of sounds, is discussed and interpreted in the following sections. In the sixth section, how the implementation have been made will take place. In the final section conclusions are presented. IX	en_US
dc.language	Turkish
dc.language.iso	tr
dc.rights	info:eu-repo/semantics/embargoedAccess
dc.rights	Attribution 4.0 United States	tr_TR
dc.rights.uri	https://creativecommons.org/licenses/by/4.0/
dc.subject	Elektrik ve Elektronik Mühendisliği	tr_TR
dc.subject	Electrical and Electronics Engineering	en_US
dc.title	Yapay sinir ağları ile kişilerin ses örneklerinden kimliklerinin tanınması
dc.title.alternative	Recognizing speaker identification fron speech samples with artificial neural network
dc.type	masterThesis
dc.date.updated	2020-11-18
dc.contributor.department	Elektrik-Elektronik Mühendisliği Anabilim Dalı
dc.subject.ytm	Voice recognition systems
dc.subject.ytm	Artificial neural networks
dc.identifier.yokid	58162
dc.publisher.institute	Fen Bilimleri Enstitüsü
dc.publisher.university	AFYON KOCATEPE ÜNİVERSİTESİ
dc.identifier.thesisid	58162

Files in this item

Name:: yokAcikBilim_58162.pdf
Size:: 2.165Mb
Format:: PDF
Description:: File_58162

View/Open

This item appears in the following Collection(s)

TEZLER

Show simple item record

Except where otherwise noted, this item's license is described as info:eu-repo/semantics/embargoedAccess