Lexicon based opinion mining on twitter data by using hadoop

Alksso, Mohammed Raaed Mahmood

dc.contributor.advisor	Görür, Abdül Kadir
dc.contributor.author	Alksso, Mohammed Raaed Mahmood
dc.date.accessioned	2020-12-04T11:19:21Z
dc.date.available	2020-12-04T11:19:21Z
dc.date.submitted	2017
dc.date.issued	2018-08-06
dc.identifier.uri	https://acikbilim.yok.gov.tr/handle/20.500.12812/77965
dc.description.abstract	Bu tezde, Hadoop tarafından Sanal Makine ile makine öğrenme metodolojilerini kullanarak elde edilen varsayımların bulgularını vurgulayacağız. Pratik kurulum, belirli kelimeleri kullanarak twitler incelemek ve bulmak için deneyler gerçekleştirmek üzere başlatıldı ve bu twitler sadece belirli bir alan içerisinde toplanacak ve veriler Hadoop'ta saklanacaktır. Ardından, ön işleme işlemleri gibi eğitim verileri, gerekli olmayan her şeyi kaldırıp, özellikleri ayıklamaktır. Bundan sonra, twit microblog'un metinlerini analiz etme yeteneğiyle makine öğrenme algoritmaları (gözetim altında ve denetlenmemiş) kullanarak twitlerin sınıflandırılması, farklı türdeki sözlüğün duygularını algılamaktır. Ayrıca Mahout'daki kümesi, aynı kutupta veri toplayıp olumlu ifade edilen veya en iyi hizmetin ne olduğunu bilmek için kullanılmıştır. Sonunda, sınıflamanın doğruluğuna dayanarak elde edilen başarılı sonuçlardan toplanan hedefleri kanıtlıyoruz.
dc.description.abstract	In this thesis, we will highlight findings of the assumptions obtained by using the methodologies of machine learning with Hadoop by Virtual Machine.The practical setup was started to carry out the experiments to study and find tweets by specific words and these tweets are to be collected only within a specific domain and data is to be saved in Hadoop. Then, training data such as the pre-processing operations is to remove all things which are not necessary and extract the features. After that, the classification of tweets using machine learning algorithms (supervised and unsupervised) with the ability to analyse the texts of tweet microblog is to detect emotions by different types of the lexicon. Furthermore, the cluster in Mahout was used to collect data at same polar to know what is best service or product which was expressed positively.Finally, we prove the objectives which were collected from the achieved results based on accuracy of the classification.	en_US
dc.language	English
dc.language.iso	en
dc.rights	info:eu-repo/semantics/openAccess
dc.rights	Attribution 4.0 United States	tr_TR
dc.rights.uri	https://creativecommons.org/licenses/by/4.0/
dc.subject	Bilim ve Teknoloji	tr_TR
dc.subject	Science and Technology	en_US
dc.title	Lexicon based opinion mining on twitter data by using hadoop
dc.title.alternative	Hadoop kullanarak twitter verileri üzerindeki görüş madenciliği tabanlı veri sözlüğü
dc.type	masterThesis
dc.date.updated	2018-08-06
dc.contributor.department	Matematik Anabilim Dalı
dc.identifier.yokid	10162252
dc.publisher.institute	Fen Bilimleri Enstitüsü
dc.publisher.university	ÇANKAYA ÜNİVERSİTESİ
dc.identifier.thesisid	495504
dc.description.pages	80
dc.publisher.discipline	Bilgi Teknolojileri Bilim Dalı

Files in this item

Name:: yokAcikBilim_10162252.pdf
Size:: 1.623Mb
Format:: PDF
Description:: File_10162252

View/Open

This item appears in the following Collection(s)

TEZLER

Show simple item record

Except where otherwise noted, this item's license is described as info:eu-repo/semantics/openAccess