A single chip solution for text-to-speech synthesis

Aktan, Ozan

dc.contributor.advisor	Dündar, Günhan
dc.contributor.author	Aktan, Ozan
dc.date.accessioned	2020-12-04T11:16:50Z
dc.date.available	2020-12-04T11:16:50Z
dc.date.submitted	2004
dc.date.issued	2018-08-06
dc.identifier.uri	https://acikbilim.yok.gov.tr/handle/20.500.12812/77692
dc.description.abstract	ÖZET METİNDEN KONUŞMA SENTEZİ İÇİN TEK YONGALI ÇÖZÜM Elektronik aygıtlarda bulunan konuşma arayüzleri, insan-makine haberleşme sistemlerinde önemli rol oynamaktadır. Bunun temel sebepleri arasında bilginin küçük taşınabilir aletlerde kullanılabilirliği ve güvenlik sebeplerinden dolayı görsel bir arayüz kullanmanın uygunsuz olması ya da taşınabilir uygulamaların bağlantılanabilirlik ve karmaşıklığım arttırması yer almaktadır. Dolayısıyla metinden konuşma sentezi, düşük bant genişliğine sahip metni kullanıcıya kolay anlaşılabilir bilgi olarak sunan konuşma arayüzünün önemli bir parçasıdır. Bu tezde, metinden konuşma sentezi için tek yongalı bir çözüm sunulmaktadır. Tümleşik devre, ASCII formatmda gelen harfleri sınırsız kelime haznesiyle konuşmaya çevirirken harflerin içinde geçtiği metinden faydalanmaktadır. Sistemin, daha önceden kaydedilmiş insan sesi örneklerinin LPC yöntemiyle kodlanarak saklandığı bir veritabam bulunmaktadır.Tasarlanan sistem, bir ana işlemciyle haberleşerek ASCII formatmdaki metni kabul etmektedir. Yonga, kaydedilmiş insan sesi örneklerini kullanarak konuşma sentezi gerçekleştirmektedir. Önerilen sistem, sınırsız kelime haznesiyle gerçek zamanlı metinden konuşma sentezim, insan sesi elemanlarını art arda birleştirerek gerçekleştirmesi açısından bir ilktir. Yonga, yüksek seviyeli donanım tanımlama dili olan VHDL ile gerçeklenmiş ve AMS 0.35um üç metal teknolojisinde parametrik olarak sentezlenmiştir. Bu gerçek, tümleşik devrenin tasarım sırasında yapılacak küçük değişiklikler ile diğer uygulamalarda bir fikri hak (İP) olarak kullanılmasını mümkün kılmaktadır. Ayrıca yonganın karmaşıklığının son derece düşük olması, düşük güç tüketimi sağlamakla birlikte yonganın FPGA olarak gerçeklenmesi veya daha büyük yongalarda fikri hak (İP) olarak kullanılabilmesine olanak sağlamaktadır. Sunulan sistemin çok lisanda metinden konuşma sentezlemesi mümkündür ve birçok uygulama alanı bulunmaktadır.
dc.description.abstract	IV ABSTRACT A SINGLE CHIP SOLUTION FOR TEXT-TO-SPEECH SYNTHESIS Speech interfaces to electronic devices play an important role in man-machine communication systems. This stems from several factors including the availability of information on small portable devices, an increasing realization of safety factors whereby using a visual interface is inappropriate, and the increasing complexity and connectivity of portable information appliances. Therefore, text-to-speech synthesis is a vital component of a speech interface, which allows low-bandwidth text to supply a user with easy to understand information. A single chip solution for text-to-speech synthesis is presented in this thesis. The integrated circuit converts incoming letters in ASCII format to unlimited vocabulary speech by using clues from the text's context. The system has a language dependent database, which contains pre-recorded human speech samples coded by the LPC method and communicates with a host processor accepting streaming text in ASCII format. The chip generates speech output from incoming text by utilizing recorded samples of natural voice. The proposed system is the first hardware solution for synthesizing unlimited vocabulary Turkish speech in real time by concatenating human speech elements. The chip is implemented using high-level hardware description language VHDL and synthesized in AMS 0.35um triple metal technology parametrically. This fact allows the integrated circuit to be used as an IP in other applications with some minor modifications in the design. Furthermore, the chip has a very low-complexity, resulting in low power and flexibility for FPGA implementation or incorporation into larger chips as IP. The presented system also supports multi-lingual text-to-speech synthesis and has many application areas.	en_US
dc.language	English
dc.language.iso	en
dc.rights	info:eu-repo/semantics/embargoedAccess
dc.rights	Attribution 4.0 United States	tr_TR
dc.rights.uri	https://creativecommons.org/licenses/by/4.0/
dc.subject	Elektrik ve Elektronik Mühendisliği	tr_TR
dc.subject	Electrical and Electronics Engineering	en_US
dc.title	A single chip solution for text-to-speech synthesis
dc.title.alternative	Metinden konuşma sentezi için tek yongalı çözüm
dc.type	masterThesis
dc.date.updated	2018-08-06
dc.contributor.department	Elektrik-Elektronik Mühendisliği Anabilim Dalı
dc.identifier.yokid	169861
dc.publisher.institute	Fen Bilimleri Enstitüsü
dc.publisher.university	BOĞAZİÇİ ÜNİVERSİTESİ
dc.identifier.thesisid	152409
dc.description.pages	57
dc.publisher.discipline	Diğer

Files in this item

Name:: yokAcikBilim_169861.pdf
Size:: 2.065Mb
Format:: PDF
Description:: File_169861

View/Open

This item appears in the following Collection(s)

TEZLER

Show simple item record

Except where otherwise noted, this item's license is described as info:eu-repo/semantics/embargoedAccess