Müzikal enstrümanların doğrusal ayırtaç analizi yöntemiyle ayırt edilmesi

Yavuz Çayci, Çiğdem

View/Open

File_9012977 (3.350Mb)

Date

2007

Author

Yavuz Çayci, Çiğdem

Metadata

Show full item record

Abstract

Bu çalısmada birden fazla enstrümanla çalınan bir eserdeki Türk Müzigienstrümanlarının neler olduguna, insan duyma sisteminin yeteneklerine yakınveya esit ölçüde karar verebilecek bir model olusturulması amaçlanmıstır. Bununiçin dört farklı Türk Müzigi enstrümanın on bes ayrı kombinasyonundanyararlanılmıstır. İnsan algı yapısı örneginde oldugu gibi birden fazla enstrümanlaçalınan eserdeki enstrümanlar birbirinden ayrıstırılmadan, bir bütün olarak elealınmıs; sesin spektral, zamana baglı ve kepstral özelliklerinden yararlanılmıstır.Bu özelliklerin enstrümanları tanımadaki basarısı tek tek, gruplar halinde ve birbütün olarak incelenmis, çıkan sonuçların, enstrümanları ayırt etmedekiperformansları degerlendirilmistir.Ses ayırt etme algoritmalarının uygulanmasında karsılasılan en büyüksorunlardan birisi ayrım isleminde kullanılan çok boyutlu verilerin islenmesisırasında kaçınılmaz olarak ortaya çıkan islemsel yüktür. Bu çalısmada çoksayıdaki özelligin yaratacagı islemsel yükü azaltmak amacıyla en önemliözelliklerin kesfedilip, verimsiz özelliklerin elenmesi için yüksek boyuttan dahadüsük boyuta dogrusal dönüsüm elde edilmesini saglayan Dogrusal Ayırtaç Analizi(Linear Discriminant Analysis - LDA) yöntemi kullanılmıstır. Özellik vektörlerininenstrümanları tanımadaki basarısı LDA'nın uygulandıgı ve uygulanmadıgıdurumlarda ölçülmüs, LDA'nın özellik vektörlerini ayrımın en iyi yapılabilecegiboyuta yansıtabilme kabiliyeti sayesinde ayırt etme isleminde hesaplama kolaylıgıgetirdigi gözlenmistir.Tanıma isleminde sınıflandırma algoritmasının seçimi sistemin performansınıdogrudan etkiler. Bu çalısmada enstrümanların ayırt edilmesinde sınıflandırmaalgoritması olarak, Gauscul Karısım Yogunlugu Modeli (Gaussian Mixture Model -GMM) ve k-En Yakın Komsu (k-Nearest Neighbour ? k-NN) Metodu kullanılmıs vebu iki yöntem verimlilik ve basarı oranı açısından birbiriyle kıyaslanmıstır. Gausculkarısım yogunlugu modelinde, Gauscul bilesenlerin sayısındaki degisimin sistemperformansı üzerindeki etkisi incelenmistir.Egitim asamasında iki farklı eserin herbiri 10'ar saniye uzunlugunda toplam 44parçaya ayrılmıs bölümleri kullanılmıstır. Test asamasında da, test edilecek eser10'ar saniyelik bölümlere ayrılarak herbir bölümün hangi enstrümana ait oldugubelirlenmis ve toplam oran 10'ar saniyelik bölümlerden elde edilen sonuçlarınagırlıklı ortalaması alınarak hesaplanmıstır. MFCC (Mel Frekansı KepstralKatsayıları), delta MFCC ve delta delta MFCC'nin GMM algoritmasıylasınıflandırılması sonucu %88.77 basarı oranı elde edilirken; spektral, zamanbölgesi ve kepstral özellik vektörlerinin, GMM algoritmasıyla sınıflandırmasısonucunda %78.54`e varan ayırt etme performansı gözlenmistir. Yine buözelliklerin, LDA ile daha düsük boyuta indirildikten sonra sınıflandırılması ile eldeedilen sonuçlarda performansın %90.93 oldugu görülmüstür.Anahtar Kelimeler: Enstrüman Tanıma, Polifonik Müzik, Gauscul KarısımYogunlugu Modeli (GMM), k-En Yakın Komsu Metodu (k-NN), Dogrusal AyırtaçAnalizi (LDA), Spektral Özellikler, Zaman Bölgesi Özellikleri, Kepstral Katsayılar.

In this work, it is aimed to make a model that can decide which instruments areplaying in a multi instrumental Turkish Music as good as the human auditorysystem. Fifteen combinations of four different Turkish Music instruments are usedfor this purpose. Like in the example of human perception structure, theinstruments in multi instrumental music are analysed as a whole without beingseperated. Spectral, time domain and cepstral features of sound are used. Therecognition performance of these properties are analysed one by one, group bygroup and as a whole.One of the most important problems in sound recognition algorithms is thecomputational complexity which occurs during the processing of multi dimensionaldata used in recognition of instruments. To decrease the computationalcomplexity, Linear Discriminant Analysis (LDA), which provides the featuretransformation from higher dimension to lower dimension by discovering the mostimportant features and eliminating uneffective ones, is used. The success rates offeature vectors are observed in both cases when LDA is applied and is not appliedthen, as a result of these works, having the capability of transforming featurevectors to a low dimensional vector space where separation can be applied best,LDA provides a good computational performance in recognition process .The choice of classification algorithm affects the system performance directly. Inthis work, in recognition of instruments, Gaussian Mixture Model (GMM) and k ?Nearest Neighbour (k-NN) methods are used as classification algorithms andthese two methods are compared according to their success rates andcomputational efficiency. In Gaussian Mixture Model, the effect of changing thenumber of mixtures of Gaussian components on system performance areobserved.In training stage, 44 different parts of two different songs, having ten secondduration each, is used. In testing stage, the test song is divided into parts of tenseconds and the total success rate is calculated by using weighted average ofsuccess rates of the 10 second parts success rates. While having a success rateof 88.77% as a result of classifying MFCC (Mel Frequency Cepstral Coefficients),delta MFCC and delta delta MFCC with GMM algorithm, recognition performanceof 78.54% is observed by classifying spectral, time domain and cepstral featureswith GMM. Using the same features with LDA, it is observed that the success rateof 90.93% can be obtained after classification.Key Word: Instrument Recognition, Polyphonic Music, Gaussian Mixture Model(GMM), k-Nearest Neighbour (k-NN), Linear Discriminant Analysis (LDA), SpectralFeatures, Time Domain Features, Cepstral Coefficients.

URI

https://acikbilim.yok.gov.tr/handle/20.500.12812/480161

Collections

TEZLER

Except where otherwise noted, this item's license is described as info:eu-repo/semantics/openAccess