Pemilihan Metode dan Algoritma dalam Analisis Sentimen di Media Sosial : Sistematic Literature Review

This article use Systematic Literature Review (SLR) to classify sentiment analysis based on case studies, methods, social media, and platforms. The coding stage is divided into three stages, namely the open, selective and axial coding. The literature study on sentiment analysis is divided into two parts: identifying gaps based on case studies and data sources and identifying gaps based on the methods or algorithms used. The gap identification results based on case studies and data sources show that popular review topics are synonymous with entertainment, economic and political content. Therefore, the quantity of research with review topics related to the implementation of education, the dynamics of the bureaucracy, health facilities and services, and non-governmental organization’s activities need to be increased. Meanwhile, the most dominant platforms used as data sources are website and mobile-based applications. The results of the gap identification based on the method and algorithm show that the quantity of research with the Naive Bayes Classifier (NBC) and Support Vector Machine (SVM) method or algorithm is more dominant than the k-Nearest Neighbor (k-NN) and Lexicon-based. Thus, it is necessary to increase the number of other classification methods such as Particle Swarm Optimization, BM25, Decision Tree, K-Means, and Neural Networks.

Download Full-text

Aspect Term Extraction for Aspect Based Opinion Mining

International Journal of Innovative Technology and Exploring Engineering - Special Issue ◽

10.35940/ijitee.k2050.0981119 ◽

2019 ◽

Vol 8 (11) ◽

pp. 2228-2233

Keyword(s):

Support Vector Machine ◽

Sentiment Analysis ◽

Random Fields ◽

Opinion Mining ◽

Nearest Neighbor ◽

Conditional Random Fields ◽

International Workshop ◽

Support Vector ◽

K Nearest Neighbor ◽

Term Extraction

Opinion Mining (OM) is also called as Sentiment Analysis (SA). Aspect Based Opinion Mining (ABOM) is also called as Aspect Based Sentiment Analysis (ABSA). In this paper, three new features are proposed to extract the aspect term for Aspect Based Sentiment Analysis (ABSA). The influence of the proposed features is evaluated on five classifiers namely Decision Tree (DT), Naive Bayes (NB), K-Nearest Neighbor (KNN), Support Vector Machine (SVM) and Conditional Random Fields (CRF). The proposed features are evaluated on the Two datasets on Restaurant and Laptop domains available in International Workshop on Semantic Evaluation 2014 i.e. SemEval 2014. The influence of proposed features is evaluated using Precision, Recall and F1 measures. The proposed features are highly influencing for aspect term extraction on classifiers. The performance of SVM and CRF classifiers with proposed features is more influencing for aspect term extraction compared with NB, DT and KNN classifiers.

Download Full-text

Twitter sentiment analysis of the relocation of Indonesia's capital city

Bulletin of Electrical Engineering and Informatics ◽

10.11591/eei.v9i4.2352 ◽

2020 ◽

Vol 9 (4) ◽

pp. 1620-1630

Author(s):

Edi Sutoyo ◽

Ahmad Almaarif

Keyword(s):

Support Vector Machine ◽

Sentiment Analysis ◽

Nearest Neighbor ◽

Point Of View ◽

Capital City ◽

Support Vector ◽

K Nearest Neighbor ◽

Evaluation Algorithm ◽

The Government ◽

The Many

Indonesia has a capital city which is one of the many big cities in the world called Jakarta. Jakarta's role in the dynamics that occur in Indonesia is very central because it functions as a political and government center, and is a business and economic center that drives the economy. Recently the discourse of the government to relocate the capital city has invited various reactions from the community. Therefore, in this study, sentiment analysis of the relocation of the capital city was carried out. The analysis was performed by doing a classification to describe the public sentiment sourced from twitter data, the data is classified into 2 classes, namely positive and negative sentiments. The algorithms used in this study include Naïve Bayes classifier, logistic regression, support vector machine, and K-nearest neighbor. The results of the performance evaluation algorithm showed that support vector machine outperformed as compared to 3 algorithms with the results of Accuracy, Precision, Recall, and F-measure are 97.72%, 96.01%, 99.18%, and 97.57%, respectively. Sentiment analysis of the discourse of relocation of the capital city is expected to provide an overview to the government of public opinion from the point of view of data coming from social media.

Download Full-text

Evaluating Annotated Dataset of Customer Reviews for Aspect Based Sentiment Analysis

Journal of Web Engineering ◽

10.13052/jwe1540-9589.2122 ◽

2021 ◽

Author(s):

Dimple Chehal ◽

Parul Gupta ◽

Payal Gulati

Keyword(s):

Machine Learning ◽

Support Vector Machine ◽

Sentiment Analysis ◽

Nearest Neighbor ◽

Supervised Machine Learning ◽

Support Vector ◽

Product Reviews ◽

K Nearest Neighbor ◽

Customer Reviews ◽

Percent Accuracy

Sentiment analysis of product reviews on e-commerce platforms aids in determining the preferences of customers. Aspect-based sentiment analysis (ABSA) assists in identifying the contributing aspects and their corresponding polarity, thereby allowing for a more detailed analysis of the customer’s inclination toward product aspects. This analysis helps in the transition from the traditional rating-based recommendation process to an improved aspect-based process. To automate ABSA, a labelled dataset is required to train a supervised machine learning model. As the availability of such dataset is limited due to the involvement of human efforts, an annotated dataset has been provided here for performing ABSA on customer reviews of mobile phones. The dataset comprising of product reviews of Apple-iPhone11 has been manually annotated with predefined aspect categories and aspect sentiments. The dataset’s accuracy has been validated using state-of-the-art machine learning techniques such as Naïve Bayes, Support Vector Machine, Logistic Regression, Random Forest, K-Nearest Neighbor and Multi Layer Perceptron, a sequential model built with Keras API. The MLP model built through Keras Sequential API for classifying review text into aspect categories produced the most accurate result with 67.45 percent accuracy. K- nearest neighbor performed the worst with only 49.92 percent accuracy. The Support Vector Machine had the highest accuracy for classifying review text into aspect sentiments with an accuracy of 79.46 percent. The model built with Keras API had the lowest 76.30 percent accuracy. The contribution is beneficial as a benchmark dataset for ABSA of mobile phone reviews.

Download Full-text

Ensemble of Classifiers and Term Weighting Schemes for Sentiment Analysis in Turkish

10.52460/src.2021.004 ◽

2021 ◽

Vol 1 (1) ◽

pp. 1-12

Author(s):

Aytuğ Onan ◽

Keyword(s):

Sentiment Analysis ◽

Language Processing ◽

Nearest Neighbor ◽

Text Messages ◽

Support Vector ◽

K Nearest Neighbor ◽

Term Weighting ◽

Text Documents ◽

Weighting Schemes ◽

Short Text

With the advancement of information and communication technology, social networking and microblogging sites have become a vital source of information. Individuals can express their opinions, grievances, feelings, and attitudes about a variety of topics. Through microblogging platforms, they can express their opinions on current events and products. Sentiment analysis is a significant area of research in natural language processing because it aims to define the orientation of the sentiment contained in source materials. Twitter is one of the most popular microblogging sites on the internet, with millions of users daily publishing over one hundred million text messages (referred to as tweets). Choosing an appropriate term representation scheme for short text messages is critical. Term weighting schemes are critical representation schemes for text documents in the vector space model. We present a comprehensive analysis of Turkish sentiment analysis using nine supervised and unsupervised term weighting schemes in this paper. The predictive efficiency of term weighting schemes is investigated using four supervised learning algorithms (Naive Bayes, support vector machines, the k-nearest neighbor algorithm, and logistic regression) and three ensemble learning methods (AdaBoost, Bagging, and Random Subspace). The empirical evidence suggests that supervised term weighting models can outperform unsupervised term weighting models.

Download Full-text

Product Review Based Customer Sentiment Analysis using an Ensemble of mRMR and Forest Optimization Algorithm (FOA)

International Journal of Applied Metaheuristic Computing ◽

10.4018/ijamc.2022010107 ◽

2022 ◽

Vol 13 (1) ◽

pp. 0-0

Keyword(s):

Feature Selection ◽

Sentiment Analysis ◽

Optimization Algorithm ◽

Nearest Neighbor ◽

Hybrid Approach ◽

Support Vector ◽

K Nearest Neighbor ◽

Feature Selection Technique ◽

Feature Selection Problem

This research presents a way of feature selection problem for classification of sentiments that use ensemble-based classifier. This includes a hybrid approach of minimum redundancy and maximum relevance (mRMR) technique and Forest Optimization Algorithm (FOA) (i.e. mRMR-FOA) based feature selection. Before applying the FOA on sentiment analysis, it has been used as feature selection technique applied on 10 different classification datasets publically available on UCI machine learning repository. The classifiers for example k-Nearest Neighbor (k-NN), Support Vector Machine (SVM) and Naïve Bayes used the ensemble based algorithm for available datasets. The mRMR-FOA uses the Blitzer’s dataset (customer reviews on electronic products survey) to select the significant features. The classification of sentiments has noticed to improve by 12 to 18%. The evaluated results are further enhanced by the ensemble of k-NN, NB and SVM with an accuracy of 88.47% for the classification of sentiment analysis task.

Download Full-text

Sentiment Analysis of Law Enforcement Performance Using Support Vector Machine and K-Nearest Neighbor

2018 3rd International Conference on Information Technology, Information System and Electrical Engineering (ICITISEE) ◽

10.1109/icitisee.2018.8720969 ◽

2018 ◽

Cited By ~ 1

Author(s):

Sean Semuel Istia ◽

Hindriyanto Dwi Purnomo

Keyword(s):

Support Vector Machine ◽

Law Enforcement ◽

Sentiment Analysis ◽

Nearest Neighbor ◽

Support Vector ◽

K Nearest Neighbor

Download Full-text

Implementation of machine learning to increase productivity in the manufacturing industry: a literature review.

Operations Excellence Journal of Applied Industrial Engineering ◽

10.22441/oe.2021.v13.i2.026 ◽

2021 ◽

Vol 13 (2) ◽

pp. 267

Author(s):

Yulio Agefa Purmala

Keyword(s):

Artificial Intelligence ◽

Machine Learning ◽

Literature Review ◽

Manufacturing Industry ◽

Nearest Neighbor ◽

Support Vector ◽

K Nearest Neighbor ◽

Increase Productivity ◽

And Performance ◽

Goals And Objectives

Industry 4.0 is currently developing quite rapidly, one of the technologies that is currently very popular in the industry is artificial intelligence, where an event can be diagnosed and predicted more quickly and accurately. One of the branches of artificial intelligence that can do this is Machine Learning, and its application can now be found in daily activities. In the manufacturing industry, the application of Machine Learning is one of them is to increase productivity through the results of analysis and predictions given based on the experience gained. This study uses a systematic literature review method, in which several articles were collected from several journal databases such as Elsevier, IEE, Springer, Taylor & Francis and ACM, with the publication period of the articles from 2015 to 2020. A total of 100 articles were collected, then re-validated. suitability based on the main goals and objectives of the research. There were 36 articles that were validated and used as a reference for a more in-depth review and analysis of their boundaries, so that there was a gap for further research. In this literature review study, its application is very helpful in making decisions in improving the quality, efficiency, and performance of companies in the manufacturing industry. The most popular algorithms used in this study include random forest, support vector machine, neural network, linear regression, and k-nearest neighbor. Finally, in this study it was found that the application of Machine Learning in diagnosing or predicting an event is suggested by modeling more than one algorithm to find and determine which algorithm is the most accurate and suitable to be applied to the phenomenon that occurs.

Download Full-text

Supervised Machine Learning Algorithms for Sentiment Analysis of Bangla Newspaper

International Journal of Innovative Computing ◽

10.11113/ijic.v11n2.321 ◽

2021 ◽

Vol 11 (2) ◽

pp. 15-23

Author(s):

Sabrina Jahan Maisha ◽

Nuren Nafisa ◽

Abdul Kadar Muhammad Masum

Keyword(s):

Machine Learning ◽

Sentiment Analysis ◽

Language Processing ◽

Nearest Neighbor ◽

Online News ◽

Machine Learning Algorithms ◽

Supervised Machine Learning ◽

Support Vector ◽

K Nearest Neighbor ◽

Aged People

We can state undoubtedly that Bangla language is rich enough to work with and implement various Natural Language Processing (NLP) tasks. Though it needs proper attention, hardly NLP field has been explored with it. In this age of digitalization, large amount of Bangla news contents are generated in online platforms. Some of the contents are inappropriate for the children or aged people. With the motivation to filter out news contents easily, the aim of this work is to perform document level sentiment analysis (SA) on Bangla online news. In this respect, the dataset is created by collecting news from online Bangla newspaper archive. Further, the documents are manually annotated into positive and negative classes. Composite process technique of “Pipeline” class including Count Vectorizer, transformer (TF-IDF) and machine learning (ML) classifiers are employed to extract features and to train the dataset. Six supervised ML classifiers (i.e. Multinomial Naive Bayes (MNB), K-Nearest Neighbor (K-NN), Random Forest (RF), (C4.5) Decision Tree (DT), Logistic Regression (LR) and Linear Support Vector Machine (LSVM)) are used to analyze the best classifier for the proposed model. There has been very few works on SA of Bangla news. So, this work is a small attempt to contribute in this field. This model showed remarkable efficiency through better results in both the validation process of percentage split method and 10-fold cross validation. Among all six classifiers, RF has outperformed others by 99% accuracy. Even though LSVM has shown lowest accuracy of 80%, it is also considered as good output. However, this work has also exhibited surpassing outcome for recent and critical Bangla news indicating proper feature extraction to build up the model.

Download Full-text

Predicting Risk in Sentiment Analysis using Machine Learning

International Journal of Engineering and Advanced Technology - Regular Issue ◽

10.35940/ijeat.a9540.109119 ◽

2019 ◽

Vol 9 (1) ◽

pp. 455-460

Keyword(s):

Sentiment Analysis ◽

Nearest Neighbor ◽

Confusion Matrix ◽

The Other ◽

Risk Level ◽

Support Vector ◽

Risk Modeling ◽

K Nearest Neighbor ◽

Law Enforcement Agencies ◽

Prism Model

The purpose of this research is to do risk modeling after a sentiment analysis of Twitter posts based on a particular or certain sentiment with the help of the PRISM model .The model is named PRISM as the results obtained are an amalgamation of seven different attributes used in the research for comparison and tabulation of quantitative scores. These attributes are Accuracy, Precision, Recall, F1-Score, Support, Confusion Matrix, and Tweets. PRISM model can serve the law enforcement agencies in many ways and help them maintain peace, law and order in society as it is a proactive model. The sub-modules which are part of the PRISM model help to give quantitative values to predict the risk level on the sentiment of interest. After analysis of obtained testing results, it is observed that Support Vector Machine gives better results in accuracy, precision, F1-Score, Support and Recall as compared to the other three classifier models i.e. Naive Bayes, Decision Tree, and K nearest neighbor. It is also observed that with an increase or decrease in data, regarding the number of tweets, the fluctuation in performance of SVM is most stable i.e. it shows the least deviation and variation. The other algorithms show a considerable deviation in their performance.

Download Full-text

Optimizing Error Rate in Intrusion Detection System Using Artificial Neural Network Algorithm

International Journal of Emerging Research in Management and Technology ◽

10.23956/ijermt.v6i9.102 ◽

2018 ◽

Vol 6 (9) ◽

pp. 152

Author(s):

S. Vijaya Rani ◽

G. N. K. Suresh Babu

Keyword(s):

Neural Network ◽

Artificial Neural Network ◽

Intrusion Detection ◽

Error Rate ◽

Learning Process ◽

Nearest Neighbor ◽

Detection System ◽

Support Vector ◽

K Nearest Neighbor ◽

Artificial Neural

The illegal hackers penetrate the servers and networks of corporate and financial institutions to gain money and extract vital information. The hacking varies from one computing system to many system. They gain access by sending malicious packets in the network through virus, worms, Trojan horses etc. The hackers scan a network through various tools and collect information of network and host. Hence it is very much essential to detect the attacks as they enter into a network. The methods available for intrusion detection are Naive Bayes, Decision tree, Support Vector Machine, K-Nearest Neighbor, Artificial Neural Networks. A neural network consists of processing units in complex manner and able to store information and make it functional for use. It acts like human brain and takes knowledge from the environment through training and learning process. Many algorithms are available for learning process This work carry out research on analysis of malicious packets and predicting the error rate in detection of injured packets through artificial neural network algorithms.

Download Full-text