Automatic knowledge extraction for filling in biography forms from Turkish texts

Demirci, İlknur

dc.contributor.advisor	Orhan, Zeynep
dc.contributor.author	Demirci, İlknur
dc.date.accessioned	2021-05-07T11:39:58Z
dc.date.available	2021-05-07T11:39:58Z
dc.date.submitted	2009
dc.date.issued	2018-08-06
dc.identifier.uri	https://acikbilim.yok.gov.tr/handle/20.500.12812/616053
dc.description.abstract	Bu çalışma, Türkçe metinlerden biyografi formları oluşturmak için otomatik bilgi çıkarımı projesinin nasıl yapıldığını anlatmaktadır. Çalışmanın verimini ve sonuçların kalitesini arttırmak için, altı biyografi kategorisi seçilmistir. Bu kategoriler; Cumhurbaşkanları, Devlet adamları, Yazarlar, Şairler, Oyuncular ve Şarkıcılar olmak üzere okuyucular tarafından en sık incelenen biyografi türleridir.Yapılan incelemeler sonucu bu biyografilerde en çok vurgulanan altı tane alan belirlenmiştir. Bu alanlar; Doğum Tarihi. Ölüm tarihi, Eğitim,Tecrübe, Eserler ve Ödüller bilgilerini içermektedir.Belirtilen alanlar için düzenli ifadeler ile kurallar oluşturarak bilgi çıkarımı yapılmıştır. Bu kuralların herbiri belirlenmiş olan alanlar için özel olarak oluşturulup, kuralların Türkçe metinler üzerinde uygulanması ile herbir alan için bilgi çıkarımı yapılmıştır.Çıkarımı yapılan bilginin doğruluğunu ölçmek için özel bir test platformu oluşturulmuştur. Bu platformdan çıkan sonuçlara göre, otomatik biyografi formu oluşturma projesi, özellikle Türkçe ile oluşturulacak formlar için ileri seviyede geliştirilebilir ve gelecek vaadeden bir projedir.
dc.description.abstract	This study represents the idea on building an automatic knowledge extraction for filling in biography forms from Turkish Texts. There are six biography categories, chosen to be analysed in this study: Presidents, Politicians, Authors, Poets, Actors, and Singers, which are found to be the most frequently read biography types by the users.Analyzing these biographies led to the observation that the most important emphasis is put on six particular fields; these fields are Date of Birth, Date of Death, Education, Experience, Contributions, and Rewards. Information for the fields to be filled is extracted by creating rules of regular expressions. The rules are tailored according to the structure of desired data blocks. Information is then extracted for each field by running these regular expression rules on Turkish texts.A separate testing platform is designed to evaluate the accuracy of extracted data. Results of the testing platform have shown this study to be a promising process to be further developed especially for Turkish language forms.	en_US
dc.language	English
dc.language.iso	en
dc.rights	info:eu-repo/semantics/openAccess
dc.rights	Attribution 4.0 United States	tr_TR
dc.rights.uri	https://creativecommons.org/licenses/by/4.0/
dc.subject	Bilgisayar Mühendisliği Bilimleri-Bilgisayar ve Kontrol	tr_TR
dc.subject	Computer Engineering and Computer Science and Control	en_US
dc.title	Automatic knowledge extraction for filling in biography forms from Turkish texts
dc.title.alternative	Türkçe metinlerden biyografi formları doldurmak için otomatik bilgi çıkarımı
dc.type	masterThesis
dc.date.updated	2018-08-06
dc.contributor.department	Bilgisayar Mühendisliği Ana Bilim Dalı
dc.identifier.yokid	341246
dc.publisher.institute	Fen Bilimleri Enstitüsü
dc.publisher.university	FATİH ÜNİVERSİTESİ
dc.identifier.thesisid	342777
dc.description.pages	106
dc.publisher.discipline	Diğer

Files in this item

Name:: yokAcikBilim_341246.pdf
Size:: 1.754Mb
Format:: PDF
Description:: File_341246

View/Open

This item appears in the following Collection(s)

TEZLER

Show simple item record

Except where otherwise noted, this item's license is described as info:eu-repo/semantics/openAccess