scholarly journals Pemilihan Metode dan Algoritma dalam Analisis Sentimen di Media Sosial : Sistematic Literature Review

2021 ◽  
Vol 3 (2) ◽  
pp. 278-302
Author(s):  
Yerik Afrianto Singgalen

This article use Systematic Literature Review (SLR) to classify sentiment analysis based on case studies, methods, social media, and platforms. The coding stage is divided into three stages, namely the open, selective and axial coding. The literature study on sentiment analysis is divided into two parts: identifying gaps based on case studies and data sources and identifying gaps based on the methods or algorithms used. The gap identification results based on case studies and data sources show that popular review topics are synonymous with entertainment, economic and political content. Therefore, the quantity of research with review topics related to the implementation of education, the dynamics of the bureaucracy, health facilities and services, and non-governmental organization’s activities need to be increased. Meanwhile, the most dominant platforms used as data sources are website and mobile-based applications. The results of the gap identification based on the method and algorithm show that the quantity of research with the Naive Bayes Classifier (NBC) and Support Vector Machine (SVM) method or algorithm is more dominant than the k-Nearest Neighbor (k-NN) and Lexicon-based. Thus, it is necessary to increase the number of other classification methods such as Particle Swarm Optimization, BM25, Decision Tree, K-Means, and Neural Networks.

Opinion Mining (OM) is also called as Sentiment Analysis (SA). Aspect Based Opinion Mining (ABOM) is also called as Aspect Based Sentiment Analysis (ABSA). In this paper, three new features are proposed to extract the aspect term for Aspect Based Sentiment Analysis (ABSA). The influence of the proposed features is evaluated on five classifiers namely Decision Tree (DT), Naive Bayes (NB), K-Nearest Neighbor (KNN), Support Vector Machine (SVM) and Conditional Random Fields (CRF). The proposed features are evaluated on the Two datasets on Restaurant and Laptop domains available in International Workshop on Semantic Evaluation 2014 i.e. SemEval 2014. The influence of proposed features is evaluated using Precision, Recall and F1 measures. The proposed features are highly influencing for aspect term extraction on classifiers. The performance of SVM and CRF classifiers with proposed features is more influencing for aspect term extraction compared with NB, DT and KNN classifiers.


2020 ◽  
Vol 9 (4) ◽  
pp. 1620-1630
Author(s):  
Edi Sutoyo ◽  
Ahmad Almaarif

Indonesia has a capital city which is one of the many big cities in the world called Jakarta. Jakarta's role in the dynamics that occur in Indonesia is very central because it functions as a political and government center, and is a business and economic center that drives the economy. Recently the discourse of the government to relocate the capital city has invited various reactions from the community. Therefore, in this study, sentiment analysis of the relocation of the capital city was carried out. The analysis was performed by doing a classification to describe the public sentiment sourced from twitter data, the data is classified into 2 classes, namely positive and negative sentiments. The algorithms used in this study include Naïve Bayes classifier, logistic regression, support vector machine, and K-nearest neighbor. The results of the performance evaluation algorithm showed that support vector machine outperformed as compared to 3 algorithms with the results of Accuracy, Precision, Recall, and F-measure are 97.72%, 96.01%, 99.18%, and 97.57%, respectively. Sentiment analysis of the discourse of relocation of the capital city is expected to provide an overview to the government of public opinion from the point of view of data coming from social media. 


Author(s):  
Dimple Chehal ◽  
Parul Gupta ◽  
Payal Gulati

Sentiment analysis of product reviews on e-commerce platforms aids in determining the preferences of customers. Aspect-based sentiment analysis (ABSA) assists in identifying the contributing aspects and their corresponding polarity, thereby allowing for a more detailed analysis of the customer’s inclination toward product aspects. This analysis helps in the transition from the traditional rating-based recommendation process to an improved aspect-based process. To automate ABSA, a labelled dataset is required to train a supervised machine learning model. As the availability of such dataset is limited due to the involvement of human efforts, an annotated dataset has been provided here for performing ABSA on customer reviews of mobile phones. The dataset comprising of product reviews of Apple-iPhone11 has been manually annotated with predefined aspect categories and aspect sentiments. The dataset’s accuracy has been validated using state-of-the-art machine learning techniques such as Naïve Bayes, Support Vector Machine, Logistic Regression, Random Forest, K-Nearest Neighbor and Multi Layer Perceptron, a sequential model built with Keras API. The MLP model built through Keras Sequential API for classifying review text into aspect categories produced the most accurate result with 67.45 percent accuracy. K- nearest neighbor performed the worst with only 49.92 percent accuracy. The Support Vector Machine had the highest accuracy for classifying review text into aspect sentiments with an accuracy of 79.46 percent. The model built with Keras API had the lowest 76.30 percent accuracy. The contribution is beneficial as a benchmark dataset for ABSA of mobile phone reviews.


2021 ◽  
Vol 1 (1) ◽  
pp. 1-12
Author(s):  
Aytuğ Onan ◽  

With the advancement of information and communication technology, social networking and microblogging sites have become a vital source of information. Individuals can express their opinions, grievances, feelings, and attitudes about a variety of topics. Through microblogging platforms, they can express their opinions on current events and products. Sentiment analysis is a significant area of research in natural language processing because it aims to define the orientation of the sentiment contained in source materials. Twitter is one of the most popular microblogging sites on the internet, with millions of users daily publishing over one hundred million text messages (referred to as tweets). Choosing an appropriate term representation scheme for short text messages is critical. Term weighting schemes are critical representation schemes for text documents in the vector space model. We present a comprehensive analysis of Turkish sentiment analysis using nine supervised and unsupervised term weighting schemes in this paper. The predictive efficiency of term weighting schemes is investigated using four supervised learning algorithms (Naive Bayes, support vector machines, the k-nearest neighbor algorithm, and logistic regression) and three ensemble learning methods (AdaBoost, Bagging, and Random Subspace). The empirical evidence suggests that supervised term weighting models can outperform unsupervised term weighting models.


2022 ◽  
Vol 13 (1) ◽  
pp. 0-0

This research presents a way of feature selection problem for classification of sentiments that use ensemble-based classifier. This includes a hybrid approach of minimum redundancy and maximum relevance (mRMR) technique and Forest Optimization Algorithm (FOA) (i.e. mRMR-FOA) based feature selection. Before applying the FOA on sentiment analysis, it has been used as feature selection technique applied on 10 different classification datasets publically available on UCI machine learning repository. The classifiers for example k-Nearest Neighbor (k-NN), Support Vector Machine (SVM) and Naïve Bayes used the ensemble based algorithm for available datasets. The mRMR-FOA uses the Blitzer’s dataset (customer reviews on electronic products survey) to select the significant features. The classification of sentiments has noticed to improve by 12 to 18%. The evaluated results are further enhanced by the ensemble of k-NN, NB and SVM with an accuracy of 88.47% for the classification of sentiment analysis task.


Author(s):  
Yulio Agefa Purmala

Industry 4.0 is currently developing quite rapidly, one of the technologies that is currently very popular in the industry is artificial intelligence, where an event can be diagnosed and predicted more quickly and accurately. One of the branches of artificial intelligence that can do this is Machine Learning, and its application can now be found in daily activities. In the manufacturing industry, the application of Machine Learning is one of them is to increase productivity through the results of analysis and predictions given based on the experience gained. This study uses a systematic literature review method, in which several articles were collected from several journal databases such as Elsevier, IEE, Springer, Taylor & Francis and ACM, with the publication period of the articles from 2015 to 2020. A total of 100 articles were collected, then re-validated. suitability based on the main goals and objectives of the research. There were 36 articles that were validated and used as a reference for a more in-depth review and analysis of their boundaries, so that there was a gap for further research. In this literature review study, its application is very helpful in making decisions in improving the quality, efficiency, and performance of companies in the manufacturing industry. The most popular algorithms used in this study include random forest, support vector machine, neural network, linear regression, and k-nearest neighbor. Finally, in this study it was found that the application of Machine Learning in diagnosing or predicting an event is suggested by modeling more than one algorithm to find and determine which algorithm is the most accurate and suitable to be applied to the phenomenon that occurs.


2021 ◽  
Vol 11 (2) ◽  
pp. 15-23
Author(s):  
Sabrina Jahan Maisha ◽  
Nuren Nafisa ◽  
Abdul Kadar Muhammad Masum

We can state undoubtedly that Bangla language is rich enough to work with and implement various Natural Language Processing (NLP) tasks. Though it needs proper attention, hardly NLP field has been explored with it. In this age of digitalization, large amount of Bangla news contents are generated in online platforms. Some of the contents are inappropriate for the children or aged people. With the motivation to filter out news contents easily, the aim of this work is to perform document level sentiment analysis (SA) on Bangla online news. In this respect, the dataset is created by collecting news from online Bangla newspaper archive.  Further, the documents are manually annotated into positive and negative classes. Composite process technique of “Pipeline” class including Count Vectorizer, transformer (TF-IDF) and machine learning (ML) classifiers are employed to extract features and to train the dataset. Six supervised ML classifiers (i.e. Multinomial Naive Bayes (MNB), K-Nearest Neighbor (K-NN), Random Forest (RF), (C4.5) Decision Tree (DT), Logistic Regression (LR) and Linear Support Vector Machine (LSVM)) are used to analyze the best classifier for the proposed model. There has been very few works on SA of Bangla news. So, this work is a small attempt to contribute in this field. This model showed remarkable efficiency through better results in both the validation process of percentage split method and 10-fold cross validation. Among all six classifiers, RF has outperformed others by 99% accuracy. Even though LSVM has shown lowest accuracy of 80%, it is also considered as good output. However, this work has also exhibited surpassing outcome for recent and critical Bangla news indicating proper feature extraction to build up the model.


The purpose of this research is to do risk modeling after a sentiment analysis of Twitter posts based on a particular or certain sentiment with the help of the PRISM model .The model is named PRISM as the results obtained are an amalgamation of seven different attributes used in the research for comparison and tabulation of quantitative scores. These attributes are Accuracy, Precision, Recall, F1-Score, Support, Confusion Matrix, and Tweets. PRISM model can serve the law enforcement agencies in many ways and help them maintain peace, law and order in society as it is a proactive model. The sub-modules which are part of the PRISM model help to give quantitative values to predict the risk level on the sentiment of interest. After analysis of obtained testing results, it is observed that Support Vector Machine gives better results in accuracy, precision, F1-Score, Support and Recall as compared to the other three classifier models i.e. Naive Bayes, Decision Tree, and K nearest neighbor. It is also observed that with an increase or decrease in data, regarding the number of tweets, the fluctuation in performance of SVM is most stable i.e. it shows the least deviation and variation. The other algorithms show a considerable deviation in their performance.


Author(s):  
S. Vijaya Rani ◽  
G. N. K. Suresh Babu

The illegal hackers  penetrate the servers and networks of corporate and financial institutions to gain money and extract vital information. The hacking varies from one computing system to many system. They gain access by sending malicious packets in the network through virus, worms, Trojan horses etc. The hackers scan a network through various tools and collect information of network and host. Hence it is very much essential to detect the attacks as they enter into a network. The methods  available for intrusion detection are Naive Bayes, Decision tree, Support Vector Machine, K-Nearest Neighbor, Artificial Neural Networks. A neural network consists of processing units in complex manner and able to store information and make it functional for use. It acts like human brain and takes knowledge from the environment through training and learning process. Many algorithms are available for learning process This work carry out research on analysis of malicious packets and predicting the error rate in detection of injured packets through artificial neural network algorithms.


Sign in / Sign up

Export Citation Format

Share Document