Derin öğrenme yöntemleri ile periferik yayma görüntülerinin analizi ve sınıflandırılması

Baydilli, Yusuf Yargi

View/Open

File_10326504 (5.951Mb)

Date

2020

Author

Baydilli, Yusuf Yargi

Metadata

Show full item record

Abstract

Beyaz kan hücrelerinin insan bağışıklık sisteminden sorumlu olması ve alt-tiplerinin kan hastalıklarının belirlenmesinde önemli bir indikatör olarak görev yapması nedeniyle, beyaz kan hücrelerinin doğru bir şekilde tanımlanması büyük bir önem arz etmektedir. Fakat, hücrelerin hematolojist tarafından manüel olarak incelenmesi süreci uzatmakta, aynı zamanda hataya yatkın bir hale getirmektedir. Dolayısıyla hem bu süreci hızlandırabilecek hem de yüksek başarı oranlarına ulaşabilecek bir makine öğrenmesi modeli gereksinimi oluşmaktadır.Derin öğrenme, ham verilerden herhangi bir ön-işleme sürecine ihtiyaç duymadan çıkarım yapabilmesi ve bu çıkarımları iyi bir şekilde özetleyebilmesi sayesinde, son on yılda oldukça fazla tercih edilen makine öğrenmesi yöntemlerinden biri olmuştur. Fakat, bu yöntemlerin ulaşacağı başarı değerleri, eğitim aşamasında kullanılan örnek sayısı ile doğru orantılıdır. Ne yazık ki, medikal veri toplama sürecinin gerek etik gerekse maliyet açısından meşakkatli olması, bu yöntemler ile kararlı bir sunum ortaya koymanın önündeki en büyük engeli oluşturmaktadır. Bu yüzden derin öğrenme yöntemlerini kullanarak medikal veri analizi gerçekleştirmek isteyen araştırmacılar, veri arttırma tekniklerinden faydalanarak veya farklı koşullarda elde edilen verileri birleştirerek veri seti boyutunu yüksek noktalara taşımaya çalışmaktadır. Fakat, birinci seçenek ön-işleme süreci anlamına gelirken, ikinci seçenek ise alan kayması probleminin ortaya çıkmasına neden olmaktadır.Bu çalışmada medikal verilerin analizi sürecinde sıklıkla karşılaşılan problemlere beyaz kan hücrelerinin alt-tiplerinin sınıflandırılması özelinde çözümler aranmıştır. Bu amaçla çalışmanın ilk bölümünde, az veri ile baştan-sona bir eğitim imkânı sağlayacak derin öğrenme modeli ihtiyacı göz önünde bulundurularak, literatürde sıklıkla kullanılan yöntemler detaylı bir şekilde incelenmiştir. Gerçekleştirilen testler sonunda, Kapsül Ağlarının bu ihtiyacı giderebileceği ulaştığı yüksek doğruluk değerleriyle gösterilmiştir (%96,86). Çalışmanın ikinci aşamasında ise, beyaz kan hücreleri veri setinde gözlemlenen alan kayması probleminin çözümüne yönelik yeni bir model önerilmiştir. Model sayesinde yalnızca bir veri setinden çıkarılan özelliklerin, diğer veri setlerinde de efektif bir şekilde kullanılması sağlanmıştır. Dokuz farklı veri seti üzerinde yapılan testlerde başarılı sonuçlar elde edilmiş, modelin tüm veri setleri üzerinde ulaştığı toplam doğruluk değeri ise %98,09 olarak ölçülmüştür.

Due to the fact that white blood cells are responsible for the human immune system and sub-types of them serve as an important indicator to the detection of blood diseases, it is so important that white blood cells are accurately identified. However, the manual examination of these cells by the hematologist prolongs the process and at the same time makes it prone to error. Therefore, there is a need for a machine learning model that able to accelerate this process and achieve high success rates.Deep learning has become one of the most preferred methods in machine learning in the last decade, since it can extract features from raw data without any pre-process stage and summarize these inferences well. However, the success of these methods is directly proportional to the number of samples used in the training phase. Unfortunately, the fact that the medical data collection process is difficult due to both ethical and cost reasons constitutes the biggest obstacle to making a stable presentation with these methods. Therefore, researchers who want to perform medical data analysis using deep learning methods try to increase the data set size by taking advantage of data augmentation techniques or by combining the data obtained under different conditions. However, while the first option means to pre-process, the second option causes the domain shift problem.In this study, solutions to the problems frequently encountered during the medical data analysis were searched in the basis of classification of white blood cells sub-types. For this purpose, in the first part of the study, by considering the need for a deep learning model that will provide an end-to-end training with few samples, the methods frequently used in the literature are examined in detail. At the end of the tests carried out, it was shown that the Capsule Networks could meet this need with high accuracy values (96.86%). In the second stage of the study, a new model was proposed to solve the domain shift problem observed in the white blood cell data set. Thanks to the model, the features extracted from only one data set was able to be used effectively in other data sets. Tests performed on nine different data sets yielded successful results and the total accuracy of the model on all data sets was measured as 98.09%.

URI

https://acikbilim.yok.gov.tr/handle/20.500.12812/99448

Collections

TEZLER

Except where otherwise noted, this item's license is described as info:eu-repo/semantics/openAccess