Ottoman-Turkish optical character recognition and latin transcription

Doğru, Mustafa

dc.contributor.advisor	Koyuncu, Fatih
dc.contributor.author	Doğru, Mustafa
dc.date.accessioned	2021-05-08T12:36:36Z
dc.date.available	2021-05-08T12:36:36Z
dc.date.submitted	2016
dc.date.issued	2018-08-06
dc.identifier.uri	https://acikbilim.yok.gov.tr/handle/20.500.12812/702281
dc.description.abstract	Arşivlerde veya çevrim içi kaynaklarda sayısız Osmanlıca belgeler vardır. Bu belgeler maalesef Osmanlıca okuyamayan kişiler tarafından anlaşılamamaktadır. Osmanlı Türkçesi optik karakter tanıma ve Latince transkripsiyonu bu problemin çözümü olabilir. Bu tezde Tesseract optik karakter tanıma motoru Osmanlıca karakterleri tanımak için kullanılmıştır. Ayrıca Osmanlı Türkçesinden Latinceye transkripsiyon için çeşitli metotlar geliştirilmiştir. Bazı Osmanlıca resimlerdeki karakterler optik karakter tanıma metotları ile tanınamamaktadır. Tanınamayan bu karakterleri Osmanlıca alfabesi ile yazmak için Osmanlıca klavye geliştirilmiştir. Transkripsiyon işlemi için sözlük tabloları kullanılmaktadır. Sözlük tablolarındaki veriyi zenginleştirmek transkripsiyon başarısını artıracağından dolayı sözlük tablolarını geliştirmek için bir uygulama geliştirilmiştir.
dc.description.abstract	There are numerous documents in Ottoman-Turkish on the archives or online resources. Unfortunately these documents could not be understood by the people who cannot read Ottoman-Turkish alphabet. Ottoman-Turkish optical character recognition and Latin transcription could be the solution of this problem. In this thesis, Tesseract optical character recognition engine is used in order to recognize Ottoman-Turkish characters. Also, various methods are developed for the transcription from Ottoman Turkish to Latin. Characters on some Ottoman-Turkish images could not be recognized by optical character recognition methods. So, Ottoman-Turkish keyboard was developed for writing unrecognized characters with Ottoman-Turkish alphabet. Dictionary tables are used for transcription process. So enrichment data in the dictionary tables will increase of transcription success. Thus, an application was developed for enrichment data in the dictionary tables.	en_US
dc.language	English
dc.language.iso	en
dc.rights	info:eu-repo/semantics/openAccess
dc.rights	Attribution 4.0 United States	tr_TR
dc.rights.uri	https://creativecommons.org/licenses/by/4.0/
dc.subject	Bilgisayar Mühendisliği Bilimleri-Bilgisayar ve Kontrol	tr_TR
dc.subject	Computer Engineering and Computer Science and Control	en_US
dc.title	Ottoman-Turkish optical character recognition and latin transcription
dc.title.alternative	Osmanlı Türkçesi optik karakter tanıma ve latince transkripsiyonu
dc.type	masterThesis
dc.date.updated	2018-08-06
dc.contributor.department	Bilgisayar Mühendisliği Ana Bilim Dalı
dc.identifier.yokid	10100885
dc.publisher.institute	Fen Bilimleri Enstitüsü
dc.publisher.university	YILDIRIM BEYAZIT ÜNİVERSİTESİ
dc.identifier.thesisid	440087
dc.description.pages	79
dc.publisher.discipline	Diğer

Files in this item

Name:: yokAcikBilim_10100885.pdf
Size:: 3.629Mb
Format:: PDF
Description:: File_10100885

View/Open

This item appears in the following Collection(s)

TEZLER

Show simple item record

Except where otherwise noted, this item's license is described as info:eu-repo/semantics/openAccess