Improved probabilistic matrix factorization model for sparse datasets

Ar, Yilmaz

dc.contributor.advisor	Taşkaya Temizel, Tuğba
dc.contributor.author	Ar, Yilmaz
dc.date.accessioned	2020-12-10T09:13:35Z
dc.date.available	2020-12-10T09:13:35Z
dc.date.submitted	2014
dc.date.issued	2018-08-06
dc.identifier.uri	https://acikbilim.yok.gov.tr/handle/20.500.12812/225356
dc.description.abstract	Dünya çapındaki ağ üzerindeki bilgi miktarı, ağ ve bilgi teknolojilerindeki ilerlemeler nedeniyle önemli ölçüde artmıştır. Bu durum kullanıcılar için ilgili ve yararlı bilgiler elde etmeyi zor hale getirmiştir ve bu nedenle bilgi filtreleme ihtiyacı oluşmuştur. Öneri Sistemleri (ÖS) bu probleme bir çözüm olarak ortaya çıkmıştır. Yaygın olarak kullanılan ÖS yaklaşımlarından biri olan Ortak Filtreleme (OF), kullanıcıların bir ürün üzerindeki tercihini tahmin etmeyi amaçlamaktadır. OF ardındaki ana fikir, geçmişte aynı fikirde olan kullanıcıların, gelecekte de aynı fikirde olacaklarıdır. Bir OF tekniği olarak Olasılıksal Matris Çarpanlarına Ayrışımı (OMÇA) genellikle yüksek doğruluk ve ölçeklenebilirlik nedeniyle literatürde tercih edilmektedir. Bu tezde, OMÇA metodunda yer alan kullanıcı ve ürün gizli vektörlerin başlatma tekniklerinin önemi gerçek ve sentetik veri kümeleri ile gösterilerek yeni beş başlatma tekniği önerilmektedir. Önerilen yaklaşımlar literatürdeki diğer başlatma teknikleri ile karşılaştırıldığında çok seyrek veri setleri için daha iyi sonuçlar üretmektedir.
dc.description.abstract	The amount of information on the World Wide Web has increased significantly owing to advancing web and information technologies. This has made it difficult for users to obtain relevant and useful information thus there is a need for information filtering. Recommender Systems (RS) have emerged as a technique to overcome the problem. Collaborative Filtering (CF) that is one of the widely used RS approaches aims to predict users' preference concerning an item. The main idea behind CF is the users who agreed in the past will agree in the future. The Probabilistic Matrix Factorization (PMF) is the preferred CF technique in the literature due to its high accuracy and scalability. This thesis demonstrates the importance of the initialization techniques for the user and the item latent vectors in the PMF algorithm with real and synthetic datasets and proposes five different initialization techniques. The suggested approaches produce better results in comparison with the state-of-the-art techniques in particularly very sparse datasets.	en_US
dc.language	English
dc.language.iso	en
dc.rights	info:eu-repo/semantics/openAccess
dc.rights	Attribution 4.0 United States	tr_TR
dc.rights.uri	https://creativecommons.org/licenses/by/4.0/
dc.subject	Bilgisayar Mühendisliği Bilimleri-Bilgisayar ve Kontrol	tr_TR
dc.subject	Computer Engineering and Computer Science and Control	en_US
dc.title	Improved probabilistic matrix factorization model for sparse datasets
dc.title.alternative	Seyrek veri kümeleri için iyileştirilmiş olasılıksal matris çarpanlarına ayrışımı modeli
dc.type	doctoralThesis
dc.date.updated	2018-08-06
dc.contributor.department	Diğer
dc.identifier.yokid	10056432
dc.publisher.institute	Enformatik Enstitüsü
dc.publisher.university	ORTA DOĞU TEKNİK ÜNİVERSİTESİ
dc.identifier.thesisid	379861
dc.description.pages	101
dc.publisher.discipline	Diğer

Files in this item

Name:: yokAcikBilim_10056432.pdf
Size:: 1.109Mb
Format:: PDF
Description:: File_10056432

View/Open

This item appears in the following Collection(s)

TEZLER

Show simple item record

Except where otherwise noted, this item's license is described as info:eu-repo/semantics/openAccess