Modelling of Turkey Turkish words by discrete Markov processes

Güventürk, Jale

dc.contributor.advisor	Koltuksuz, Ahmet Hasan
dc.contributor.author	Güventürk, Jale
dc.date.accessioned	2021-05-08T08:08:43Z
dc.date.available	2021-05-08T08:08:43Z
dc.date.submitted	1998
dc.date.issued	2020-12-24
dc.identifier.uri	https://acikbilim.yok.gov.tr/handle/20.500.12812/643129
dc.description.abstract	SUMMARY The Redhouse Turkish Dictionary was transferred to the electronic environment and then gone under a filtration process. The filtration process involved all words starting with capital letters being replaced with small case ones, spaces between idioms and two or more word phrases being deleted to make them appear as a single word, and words that are spelled exactly the same but carry different meanings were eliminated so that only a single one was left in the sample space. Cryptanalytical measures of Turkey Turkish words converted into their corresponding c-v patterns were obtained by Markov processes approach. These measures were obtained for 0, 1st, 2nd, 3rd, and 4th degree approaches each. For each word available in the sample space and/or dictionary; 1 ) The word itself 2) It's c-v pattern 3) Word length 4) Its c-v pattern's conditional probability starting from 0 order to n^ order (n=wordlength~ 1 ) 5) For each order the corresponding entropy values were calculated. The number of words analyzed is 21,395. 35
dc.description.abstract	ÖZET Redhouse Türkçe sözlüğü elektronik ortama aktarılmış ve daha sonra saflaştırma işlemine tabii tutulmuştur. Saflaştırma işleminin aşamaları sırasıyla şöyledir; büyük harfle başlayan kelimeler küçük harflerle değiştirilmiş, deyimler ve birden fazla sözcükten oluşan isimler arasındaki boşluklar silinmiş, aynı şekilde yazılan fakat farklı anlamlar taşıyan kelimelerden yanlızca bir tanesi kalmak üzere diğerleri örnek uzayından silinmiş, son olarak günümüz Türkçesi'nde kullanılmayan sözcükler elimine edilmiştir. Türkiye Türkçesi'nde kullanılan kelimelerin kriptanalitik ölçütleri ayrık Markov yaklaşımlarıyla belirlenmiştir. Bu ölçütler sırasıyla 0, 1., 2., 3. ve 4. derece yaklaşımlarla elde edilmiştir. Örnek uzayında ve/veya sözlükte yer alan tüm kelimeler için; 1 ) Kelimenin kendisi 2) Sesli-sessiz deseni 3) Kelime uzunluğu 4) 0 ile n arası yaklaşımların her biri için sesli-sessiz deseninin koşullu olasılık (n=kelime uzunluğu -1) 5) Her derece için karşılık gelen entropi değerleri belirlenmiştir. Analiz edilen toplam sözcük sayısı 21,395 tanedir. 36	en_US
dc.language	English
dc.language.iso	en
dc.rights	info:eu-repo/semantics/embargoedAccess
dc.rights	Attribution 4.0 United States	tr_TR
dc.rights.uri	https://creativecommons.org/licenses/by/4.0/
dc.subject	Bilgisayar Mühendisliği Bilimleri-Bilgisayar ve Kontrol	tr_TR
dc.subject	Computer Engineering and Computer Science and Control	en_US
dc.title	Modelling of Turkey Turkish words by discrete Markov processes
dc.type	masterThesis
dc.date.updated	2020-12-24
dc.contributor.department	Bilgisayar Mühendisliği Ana Bilim Dalı
dc.subject.ytm	Markov approach
dc.subject.ytm	Turkish
dc.subject.ytm	Cryptanalysis
dc.identifier.yokid	78037
dc.publisher.institute	Fen Bilimleri Enstitüsü
dc.publisher.university	İZMİR YÜKSEK TEKNOLOJİ ENSTİTÜSÜ
dc.identifier.thesisid	78037
dc.description.pages	66
dc.publisher.discipline	Diğer

Files in this item

Name:: yokAcikBilim_78037.pdf
Size:: 1.583Mb
Format:: PDF
Description:: File_78037

View/Open

This item appears in the following Collection(s)

TEZLER

Show simple item record

Except where otherwise noted, this item's license is described as info:eu-repo/semantics/embargoedAccess