A gaze-centered multimodal approach to face-to-face interaction

Arslan Aydin, Ülkü

dc.contributor.advisor	Acartürk, Cengiz
dc.contributor.advisor	Kalkan, Sinan
dc.contributor.author	Arslan Aydin, Ülkü
dc.date.accessioned	2020-12-10T09:11:04Z
dc.date.available	2020-12-10T09:11:04Z
dc.date.submitted	2020
dc.date.issued	2020-02-25
dc.identifier.uri	https://acikbilim.yok.gov.tr/handle/20.500.12812/224761
dc.description.abstract	Yüz yüze iletişim, doğası gereği, etkileşimin, hem sözsel hem de sözsel olmayan sinyallerini içeren çok modlu bir yaklaşımla karakterize edilmesini gerektirir. Bakış, iletişim sürecinde, sosyal hedeflere ulaşmada kilit rol oynayan sözsüz bir ipucudur. Bu çalışmanın amacı iki yönlüdür: (i) bakış davranışını (göz kaçırma ve yüze bakma) yüz yüze iletişimdeki bakış ve konuşma arasındaki ilişkilerle incelemek, (ii) bakış davranışlarını tahmin etmek için, üst seviye konuşma özellikleri kullanan hesaplamalı modeler oluşturmak. Çiftlerin (mülakatı yapan bir profesyonel ve iş başvurusu yapan aday) sahte iş görüşmeleri yaptığı iş görüşmeleri ayarladık. Deneyde anadil konuşanlarından oluşan 28 çift yer aldı. İki göz izleme gözlüğü, çevredeki görüntü, ses ve katılımcıların baktıkları pozisyonları kaydetti. İlk amaca yönelik olarak, yüz izlemede kullanılan görüntü, göz izleme cihazlarından bakış ve konuşma segmentasyonunda kullanılan sesi içeren, çok modlu verilerin analizleri için MAGiC (İletişimde Bakışları Analiz Etmek için Çok Modlu Çerçeve) adlı açık kaynaklı bir çerçeve geliştirdik. Konuşmayı iki yöntemle etiketledik: (i) Diyalog Eylemi Etiketleme için ISO 24617-2 standardı ve (ii) sosyal bağlamda bakış davranışlarını inceleyen önceki çalışmalarda kullanılan etiketleri kullanma. Daha sonra her iki etiketleme yöntemini kullanarak iki CNN mimarisinin, VGGNet ve ResNet, basitleştirilmiş versiyonlarını eğittik.
dc.description.abstract	Face-to-face conversation implies that interaction should be characterized as an inherently multimodal phenomenon involving both verbal and nonverbal signals. Gaze is a nonverbal cue that plays a key role in achieving social goals during the course of conversation. The purpose of this study is twofold: (i) to examine gaze behavior (i.e., aversion and gaze on face) and relations between gaze and speech in face to face interaction, (ii) to construct computational models to predict gaze behavior using high-level speech features. We employed a job interview setting, where pairs (a professional interviewer and an interviewee) conducted mock job interviews. Twenty-eight pairs of native speakers took part in the experiment. Two eye-tracking glasses recorded the scene video, the audio and the eye gaze position of the participants. To achieve the first purpose, we developed an open-source framework, named MAGiC (A Multimodal Framework for Analyzing Gaze in Communication), for the analyses of multimodal data including video recording data for face tracking, gaze data from the eye trackers, and the audio data for speech segmentation. We annotated speech with two methods: (i) ISO 24617-2 Standard for Dialogue Act Annotation and, (ii) using tags employed by the previous studies that examined gaze behavior in a social context. We then trained simplified versions of two CNN architectures (VGGNet and ResNet) by using both speech annotation methods.	en_US
dc.language	English
dc.language.iso	en
dc.rights	info:eu-repo/semantics/openAccess
dc.rights	Attribution 4.0 United States	tr_TR
dc.rights.uri	https://creativecommons.org/licenses/by/4.0/
dc.subject	Bilgisayar Mühendisliği Bilimleri-Bilgisayar ve Kontrol	tr_TR
dc.subject	Computer Engineering and Computer Science and Control	en_US
dc.subject	Mühendislik Bilimleri	tr_TR
dc.subject	Engineering Sciences	en_US
dc.title	A gaze-centered multimodal approach to face-to-face interaction
dc.title.alternative	Yüz yüze iletişime bakış merkezli çok modlu yaklaşım
dc.type	doctoralThesis
dc.date.updated	2020-02-25
dc.contributor.department	Bilişsel Bilim Anabilim Dalı
dc.subject.ytm	Face to face communication
dc.subject.ytm	Eye tracking
dc.subject.ytm	Speech labeling
dc.subject.ytm	Computational methods
dc.subject.ytm	Multivariate time series
dc.identifier.yokid	10322091
dc.publisher.institute	Enformatik Enstitüsü
dc.publisher.university	ORTA DOĞU TEKNİK ÜNİVERSİTESİ
dc.identifier.thesisid	611880
dc.description.pages	185
dc.publisher.discipline	Diğer

Files in this item

Name:: yokAcikBilim_10322091.pdf
Size:: 5.334Mb
Format:: PDF
Description:: File_10322091

View/Open

This item appears in the following Collection(s)

TEZLER

Show simple item record

Except where otherwise noted, this item's license is described as info:eu-repo/semantics/openAccess