Generative vs. discriminative models for vision based hand gesture recognition

Keskin, Cem

View/Open

File_10151272 (14.37Mb)

Date

2017

Author

Keskin, Cem

Metadata

Show full item record

Abstract

Bu tezde, dinamik el hareketleri gibi zaman dizilerinin modellenmesi konusunu inceliyoruz.Modelleme problemine otomata ve simgesel dil teorileri kullanarak yaklaşıyorve el hareketlerinin modellenmesi açısından önemli özelliklerin tanımlanmasını sağlıyoruz.Bu sayede önerilen modellerin başarımları ve yeteneklerini ayrıntılı şekilde inceleyecekbir yaklaşım geliştirmiş oluyoruz. Bu yöntemle varolan modellerin eksikliklerinitanımlıyor ve hangi özelliklere ihtiyaçları olduğunu keşfediyoruz. Özellikle saklıMarkov modelleri, girdi-çıktı Markov modelleri, belirli süre modelleri, saklı koşullu rassalalanlar ve saklı yarı Markov modelleri inceliyor ve karşılaştırıyoruz. Bunların sonucundaaraştırmamızda ortaya çıkan tüm önşartlara uyan bir saklı yarı Markov modelörneği öneriyoruz. Ayrıca sol-sağ yapıdaki bir modelin izole el hareketleri için en uygunmodel olduğunu gösteriyoruz. Son olarak bütün modelleri karşılaştırıyor ve sonuçlarınıbelgeliyoruz. Tezin ikinci kısmında rassal karar ormanları kullanarak el şekli ve pozutanıma problemine yoğunlaşıyoruz. Bilinen bir beden pozu kestirim yöntemini eleuyarlıyoruz, ve aynı yöntemi geliştirerek el şeklini bir defada tanıyan bir yöntem öneriyoruz.Ayrıca çok katmanlı bir uzman karar ormanı ağı kullanarak başarım oranınıartıran veya hafıza kullanımını düşüren bir model öneriyor ve karşılaştırıyoruz.

In this thesis, we focus on the problem of modelling sequential data, and particularlyhand gestures. We approach the modelling problem using automata theory andtheory of formal languages, which allows us to determine the crucial aspects of handgestures. Furthermore, we show how this approach can help us assess the capabilitiesof candidate models. The resulting framework can identify problems of models, and setrequirements for models to properly represent the gestures. We use this approach to examinecommon graphical models such as hidden Markov models (HMM), input-outputHMMs, explicit duration models, hidden conditional random fields, and hidden semiMarkov models (HSMM). We also devise an efficient variant of HSMMs that conformsto all of the requirements set by our previous analysis. We further show that mixturesof left-right models is the most suitable setting for gestures. Finally, we compare allthe mentioned models and report the results. In the second part of the thesis, we focuson modelling hand shape with randomized decision forests (RDF). In particular, weextend a known body pose estimation method to hand pose, and then introduce a novelRDF that directly estimates the hand shape. Furthermore, we propose a multi-layeredexpert network consisting of RDFs that either considerably increases the accuracy, orreduces memory requirements without sacrificing accuracy.

URI

https://acikbilim.yok.gov.tr/handle/20.500.12812/72884

Collections

TEZLER

Except where otherwise noted, this item's license is described as info:eu-repo/semantics/openAccess