Low level image processing tools for optical character recognition of Turkish newspapers

Bariş, Ufuk

View/Open

File_82861 (9.896Mb)

Date

1999

Author

Bariş, Ufuk

Metadata

Show full item record

Abstract

ÖZET Bu tezde yer alan yöntemlerin amacı yazı imge bloğunu ASCII metin haline çevirmektir. İlk önce öncül satır belirleme ve nihai satır ayırma adımlarını içeren geliştirilmiş bir satır bölütleme yöntemi uygulanmıştır. Daha sonra her damganın yeri saptanmış ve bir silme çerçeve içerisine yerleştirilmiştir. Damga bölütlendirme adımım damga önsınıflandırma yöntemi takip etmiş ve damgalar 8 önsınıfa ayrılmıştır. Bu önsımflardan bir tanesi Türkçe noktalı küçük harfleri (i, ö, ü) içerir. Karşılaştırmalı olarak değerlendirilen çift doğrusal ve en yakın komşuluk aradeğerleme yöntemlerinden çift doğrusal aradağerleme yöntemi boyut düzgelemesi için kullanılmıştır. Bir damga tanıma yöntemini geribesleme olarak kullanan bitişik damga bölütlendirme yöntemi bitişik damgaların ayrılmasında kullanılmıştır. Birçok öznitelik arasında, şablonlar, pencere gözleri ve Fourier belirteçleri birbirlerini tamamlama özelliklerine sahip oldukları için öznitelik olarak kullanılmıştır. Damga sınıflandırma için, RBF yapay sinir ağı ve şablon eşleme yöntemleri seçilmiştir. RBF yapay sinir ağı, pencere gözü ve Fourier belirteçleri ile beslenmiştir. Daha sonra basit dilbilimsel kuralları içeren ardişleme yöntemi kullanılmıştır. En son olarak geliştirilen birleştirme adımı yardımı ile sınırlandırıcı çıkışları birleştirilmiş ve nihayi metin elde edilmiştir. Sonuç olarak, % 99.2 doğru tanıma başanmına sahip damga tanıma yazılımı geliştirilmiştir.

IV ABSTRACT The objective of the methods which are included by this thesis is to convert a text image block to ASCII text. Firstly we have implemented a modified line segmentation method that comprises an initial line identification step and a final line segmentation step. Each character was then located, segmented and put into its bounding box. The character segmentation step was followed by a preclassification step in which characters were separated into 8 preclasses (one of them is for Turkish lower case letters with dots (i,ü,ö)). We have comparatively assessed bilinear and nearest neighbour interpolation methods and bilinear was used for size normalization. A merged character segmentation method employing feedback from a character recognition tool was used. Among several possible feature types, we have used templates, zoning and Fourier descriptors. The rationale for their choice was that they were somewhat complementary to each other. For character classification, the RBF neural network and template matching were chosen. The RBF classifier was applied to zoning and Fourier descriptors feature sets. A post processing step that includes some basic linguistic rules was employed. The outputs were combined to get the ultimate OCR text output. In conclusion, we have developed a complete OCR software that archives 99.2 % correct recognition performance.

URI

https://acikbilim.yok.gov.tr/handle/20.500.12812/80000

Collections

TEZLER

Except where otherwise noted, this item's license is described as info:eu-repo/semantics/embargoedAccess