Spektral ters entropi özniteliğinin ağırlıklı bulanık kümelenmesi ile ses verilerinin bölütlenmesi

Jabbari, Nasim

View/Open

File_10055570 (3.868Mb)

Date

2014

Author

Jabbari, Nasim

Metadata

Show full item record

Abstract

Müzik sesinden elde edilen sayısal ses verileri çok karmaşık yapılar ve çeşitli stiller içermektedir. Bu çalışmadaki amaç, bulanık kümeleme yöntemiyle ses verisini içeriğine göre bölütlere ayırarak her notanın bölgesini kestirmektir. Literatürde bölütleme için geliştirilen birçok yöntem bulunmaktadır. Bu yöntemlerden bazıları; Mel Frekans Kepstrum Katsayıları, Sıfır Geçiş, Fark Fonksiyonların Ortalama Büyüklüğü, Enerji Eşik Değeri Yöntemi, Entropi ve Ayrık Fourier dönüşümü yöntemleridir. Bu yöntemlerden en çok kullanılan yöntem eşik değeri yardımıyla yapılan bölütleme işlemidir. Fakat bu yöntem birçok ses verisi için uygun değildir. Eğer verideki genlik değerleri artan bir fonksiyon oluşturuyorsa, nota bölütleri art arda çalınmaya başlanmışsa ya da notada herhangi bir boğumlanma olmuşsa bu yöntem istenilen verimde bir bölütleme yapamaz.Bu çalışmada bölütleme için uygulanabilecek yeni bir yöntem önerilmiştir. Bu yöntemde ses verisi üzerinde küçük çerçeveler alınarak bir çeşit öznitelik çıkarımı yapılır. Bu öznitelik katsayılarına göre bir bulanık kümeleme uygulanarak bölütleme bölgeleri belirlenir. Böylece seslilik bölgeleri belirlenmiş olur. Bu durumda sessizlik bölgeleri düşük üyelik değerlerinde seslilik kümlerine katılır. Her seslilik bölgesi bulanık bir kümeyle bölütlere ayrılmış olur.Anahtar Kelimeler: Ses sinyali bölütleme, Bulanık bölütleme, Öznitelik çıkarıma

The numerical audio data obtained from the music volume involve very complex structures and various styles. The purpose of this paper, is both to separate the audio data into segments based on the content of them using fuzzy clustering method and also to predict the region of each note. In the literature, there are many methods developed for segmentation. Some of these methods are: Mel Frequency Campestral Coefficients, Zero-Crossing, Average Magnitude Difference Function, Energy Threshold Value, Entropy and Discrete Fourier transform methods. The most commonly used method is segmentation process using threshold value. This method cannot segment efficiently in some situations provided that the amplitude of values create an increasing function or musical notes and start to play in succession or any made musical notes fade out. This method cannot desire segmentation efficiency. In this paper, a new method is proposed that uses segmentation.In this method, various feature coefficient extraction is performed by taking small frameworks on the audio data. The regions of the segment are determined using fuzzy clustering according to these feature coefficients, therefore, the silence regions are determined. In this case, silence zones join silence clusters taking low membership values. Therefore, each silence region becomes separated from the segment with a fuzzy cluster.Key Words: Audio segmentation, Fuzzy segmentation, Feature extraction

URI

https://acikbilim.yok.gov.tr/handle/20.500.12812/478834

Collections

TEZLER

Except where otherwise noted, this item's license is described as info:eu-repo/semantics/openAccess