Comparative analysis of vector quantization methods used in speech processing

Faraj, Hiba Faraj.Ali

dc.contributor.advisor	Özaydın, Selma
dc.contributor.author	Faraj, Hiba Faraj.Ali
dc.date.accessioned	2020-12-04T11:15:35Z
dc.date.available	2020-12-04T11:15:35Z
dc.date.submitted	2019
dc.date.issued	2020-02-07
dc.identifier.uri	https://acikbilim.yok.gov.tr/handle/20.500.12812/77566
dc.description.abstract	Vektör nicemleme teknikleri konuşma sinyalini sıkıştırmada hayati bir öneme sahiptir. Çok çeşitli vektör nicemleme metotları mevcuttur. Herbir teknik kendine has avantaj ve dezavantajlar içermektedir ve tüm yönleriyle mükemmel sonuçlar veren bir vektör nicemleme metodu henüz yoktur. Bu tez çalışması, mevcut vektör nicemleme tekniklerinin performansını yeni metotlar uygulayarak iyileştirmeyi amaçlamaktadır. Bu tezde mevcut metotlardan hareketle melez vektör nicemleme teknikleri uygulanmıştır. Tasarlanan vektör nicemleyicilerin performansı, spectral distorsiyon, hesapsal karmaşa ve hafıza gereksinimleri bakımından değerlendirilmiştir. Bu tez çalışması kapsamında Çok aşamalı vektör nicemleme (MSVQ) metodu, Split vektör nicemleme (SVQ) metodu, Artık sinyal vektör nicemleme (RVQ) metodu, ve sesli/sessiz artık sinyal vektör nicemleme metodu (VUV_RMSVQ) analiz edilmiştir. VUV_RMSVQ metodu en iyi test sonuçlarını verdiğinden, bu metotla optimum kod tablosu tasarlamada yeni metotlar bulabilmek için araştırma derinleştirilmiştir. Daha sonra, tüm tasarlanan vektör nicemleme metotlarının performansları var olan metotlarla karşılaştırılmıştır. Tüm çalışma standart TIMIT veritabanı kullanılarak ve bu veritabanında temiz ve gürültülü ses verileri kullanılarak yürütülmüştür. Herbir vektör nicemleme metodu için bir Doğrusal öngörülü kodlama (LPC) tabanlı kod tablosu üretim algoritması tasarlanmıştır. Vektör nicemleme LPC analiz ve sentez arasında gerçekleştirilen bir işlemdir. Vektör nicemleme için gerekli konuşma parametreleri çizgi spectrum frekanslarıdır (LSF) ve bunlar LPC katsayılarından elde edilirler. Tez çalışmasının başlangıcında, MSVQ ve SVQ metotları ile kod tabloları tasarladık ve bunları spectral distorsiyon bakımından karşılaştırdık. MSVQ metodu ile tasarlanan kod tablolarının daha iyi sonuçlar verdiğini gördük. Daha sonra, kod tablosu tasarlamak için RMSVQ ve RSVQ metotlarını kullandık. Sonuçlardan görüldü ki en iyi sonuç RMSVQ metodu tarafından verildi. Sonuç olarak, RSMVQ metodu ile devam ettik ve spectral distorsiyon için en iyi performansı başarabilmek için sesli/sessiz karar metodunu RSMVQ metodu ile birleştirdik. Test sonuçlarına göre, en iyi performansın VUV_RMSVQ metodu ile başarıldığı görüldü.
dc.description.abstract	Vector quantization techniques play a vital role in compression of speech signals. There are a variety of vector quantization techniques. Each technique has its own advantages and disadvantages and there is no vector quantization technique presenting perfect results in all aspects till now. This thesis deals with enhancing the performance of the existing vector quantization techniques by using new methods. In this thesis hybrid vector quantization techniques which are produced from the existing methods are proposed. The performance of the designed vector quantizers are evaluated in terms of the spectral distortion measured, computational complexity and memory requirements.In the scope of this thesis, Multistage vector quantization (MSVQ), Split Vector Quantization (SVQ), Residual Vector Quantization (RVQ), Residual Multistage Vector Quantization (R-MSVQ), Residual Split Vector Quantization (R_SVQ) and voiced/unvoiced Residual Multistage Vector Quantization methods (VUV_RMSVQ) are analyzed. Because the VUV_RMSVQ method gave the better test results, further research is directed to find an optimum performance for codebook design with this method. Then, the overall performance of the proposed vector quantization techniques is compared with the existing vector quantization techniques. Whole work is carried out using the standard TIMIT database and both clean and noisy data are tested to evaluate the performance of the designed codebooks against noise. A linear predictive coding (LPC) based codebook generation algorithm is designed for each vector quantization method. Vector quantization is the process done in between LPC analysis and synthesis. The speech parameters required for vector quantization are the line spectral frequencies (LSF) and are obtained from the LPC coefficients.At the beginning of the thesis study, we designed codebooks with MSVQ and SVQ methods and we compared them in terms of spectral distortion. We found that the codebooks with MSVQ method gave better performance. Then, we used the RMSVQ and RSVQ methods to design codebooks. It is seen that the best result was given by RMSVQ. As a result, we continued with RMSVQ and we combined the voiced and unvoiced decision method and RSMVQ technique to achieve better result for spectral distortion. According to the results, it is seen that the best performance is achieved with VUV_RMSVQ method	en_US
dc.language	English
dc.language.iso	en
dc.rights	info:eu-repo/semantics/openAccess
dc.rights	Attribution 4.0 United States	tr_TR
dc.rights.uri	https://creativecommons.org/licenses/by/4.0/
dc.subject	Elektrik ve Elektronik Mühendisliği	tr_TR
dc.subject	Electrical and Electronics Engineering	en_US
dc.title	Comparative analysis of vector quantization methods used in speech processing
dc.title.alternative	Konuşma işlemede kullanılan vektör nicemleme metotlarının karşılaştırmalı analizi
dc.type	doctoralThesis
dc.date.updated	2020-02-07
dc.contributor.department	Elektronik ve Haberleşme Mühendisliği Anabilim Dalı
dc.identifier.yokid	10318151
dc.publisher.institute	Fen Bilimleri Enstitüsü
dc.publisher.university	ÇANKAYA ÜNİVERSİTESİ
dc.identifier.thesisid	608108
dc.description.pages	126
dc.publisher.discipline	Diğer

Files in this item

Name:: yokAcikBilim_10318151.pdf
Size:: 3.400Mb
Format:: PDF
Description:: File_10318151

View/Open

This item appears in the following Collection(s)

TEZLER

Show simple item record

Except where otherwise noted, this item's license is described as info:eu-repo/semantics/openAccess