Metin dosyalarının birleştirilmesinde yakınlık ölçütlerinin kullanılması

Şahin, Alperen

dc.contributor.advisor	Sencar, Hüsrev Taha
dc.contributor.author	Şahin, Alperen
dc.date.accessioned	2021-05-08T11:21:50Z
dc.date.available	2021-05-08T11:21:50Z
dc.date.submitted	2015
dc.date.issued	2018-08-06
dc.identifier.uri	https://acikbilim.yok.gov.tr/handle/20.500.12812/683178
dc.description.abstract	Parçalanmış dosyaların kurtarılması öncelikli olarak parçalanmış dosyalar arasındaki yakınlığın doğru bir şekilde değerlendirilmesine dayanır. Metin tabanlı dosyalar veriyi oldukça zayıf bir yapıda tuttuğu için parçaların birleştirilmesi işi zorlu bir iştir. Bu çalışmada, metin dosyalarının birleştirilmesinde kullanılan mevcut yakınlık ölçütlerini değerlendirdik. Aldığımız sonuçlara göre mevcut metotların her birinin de tek başına değerlendirildiğinde hedeflenen noktadan uzak olduğu görüldü. Daha sonra PTCR isimli yeni metodu tanıttık. Bu metot metin dosyaları içerisinde gerek dosya tanımı, gerek verilerin sunulması gerekse de verilerin işlenmesinde kullanılan oldukça sınırlı karakteristik yapılardan yakınlık değerleri çıkarmayı hedefleyen bir metottur. Yaklaşımımız, daha verimli yakınlık değerlendirmeleri elde etmek için dosya içerisindeki dosyaya özel olan etiket-kelimelerin sıralamaları üzerine istatiksel bir model kurmaktadır. Sonuçlara göre birleştirme performansı PTCR metodunun da katkısıyla dikkate değer bir iyileşme göstermiştir.
dc.description.abstract	Recovery of fragmented files relies on the ability to accurately evaluate the adjacency of two fragments. Text-based files typically organize data in a very weakly structured manner; therefore, fragment reassembly remains a challenging task. In this work, we evaluate existing adjacency measures that can be used for assembling fragmented test files. Our results show that individual performances of existing measures are far from adequately addressing this need. We then introduce a new approach that attempts to exploit the limited structural characteristics of text files which utilize constructs for description, presentation, and processing of file data. Our approach builds a statistical model of the ordering of file-type specific constructs and incorporates this information into adjacency measures for more reliable fragment reassembly. Results show that reassembly accuracy increases significantly with this approach.	en_US
dc.language	Turkish
dc.language.iso	tr
dc.rights	info:eu-repo/semantics/openAccess
dc.rights	Attribution 4.0 United States	tr_TR
dc.rights.uri	https://creativecommons.org/licenses/by/4.0/
dc.subject	Bilgisayar Mühendisliği Bilimleri-Bilgisayar ve Kontrol	tr_TR
dc.subject	Computer Engineering and Computer Science and Control	en_US
dc.subject	Bilim ve Teknoloji	tr_TR
dc.subject	Science and Technology	en_US
dc.title	Metin dosyalarının birleştirilmesinde yakınlık ölçütlerinin kullanılması
dc.title.alternative	A study on adjajency measures for reassembling text files
dc.type	masterThesis
dc.date.updated	2018-08-06
dc.contributor.department	Bilgisayar Mühendisliği Ana Bilim Dalı
dc.identifier.yokid	10086348
dc.publisher.institute	Fen Bilimleri Enstitüsü
dc.publisher.university	TOBB EKONOMİ VE TEKNOLOJİ ÜNİVERSİTESİ
dc.identifier.thesisid	409927
dc.description.pages	40
dc.publisher.discipline	Diğer

Files in this item

Name:: yokAcikBilim_10086348.pdf
Size:: 3.019Mb
Format:: PDF
Description:: File_10086348

View/Open

This item appears in the following Collection(s)

TEZLER

Show simple item record

Except where otherwise noted, this item's license is described as info:eu-repo/semantics/openAccess