scholarly journals Sentiment Analysis of Short Russian Texts Using BERT and Word2Vec Embeddings

Author(s):  
Ekaterina Popova ◽  
Vladimir Spitsyn

This article is devoted to modern approaches for sentiment analysis of short Russian texts from social networks using deep neural networks. Sentiment analysis is the process of detecting, extracting, and classifying opinions, sentiments, and attitudes concerning different topics expressed in texts. The importance of this topic is linked to the growth and popularity of social networks, online recommendation services, news portals, and blogs, all of which contain a significant number of people's opinions on a variety of topics. In this paper, we propose machine-learning techniques with BERT and Word2Vec embeddings for tweets sentiment analysis. Two approaches were explored: (a) a method, of word embeddings extraction and using the DNN classifier; (b) refinement of the pre-trained BERT model. As a result, the fine- tuning BERT outperformed the functional method to solving the problem.

2017 ◽  
Vol 1 (3) ◽  
pp. 83 ◽  
Author(s):  
Chandrasegar Thirumalai ◽  
Ravisankar Koppuravuri

In this paper, we will use deep neural networks for predicting the bike sharing usage based on previous years usage data. We will use because deep neural nets for getting higher accuracy. Deep neural nets are quite different from other machine learning techniques; here we can add many numbers of hidden layers to improve the accuracy of our prediction and the model can be trained in the way we want such that we can achieve the results we want. Nowadays many AI experts will say that deep learning is the best AI technique available now and we can achieve some unbelievable results using this technique. Now we will use that technique to predict bike sharing usage of a rental company to make sure they can take good business decisions based on previous years data.


Proceedings ◽  
2020 ◽  
Vol 54 (1) ◽  
pp. 8
Author(s):  
Julio J. Estévez-Pereira ◽  
Diego Fernández ◽  
Francisco J. Novoa

While traditional network security methods have been proven useful until now, the flexibility of machine learning techniques makes them a solid candidate in the current scene of our networks. In this paper, we assess how well the latter are capable of detecting security threats in a corporative network. To that end, we configure and compare several models to find the one which fits better with our needs. Furthermore, we distribute the computational load and storage so we can handle extensive volumes of data. The algorithms that we use to create our models, Random Forest, Naive Bayes, and Deep Neural Networks (DNN), are both divergent and tested in other papers in order to make our comparison richer. For the distribution phase, we operate with Apache Structured Streaming, PySpark, and MLlib. As for the results, it is relevant to mention that our dataset has been found to be effectively modelable with just a reduced number of features. Finally, given the outcomes obtained, we find this line of research encouraging and, therefore, this approach worth pursuing.


2018 ◽  
Vol 7 (2.32) ◽  
pp. 473
Author(s):  
Dorababu Sudarsa ◽  
Siva Kumar.P ◽  
L Jagajeevan Rao

The tremendous of the overall enormous net has conveyed a present day way of communicating the feelings of individuals. It's additionally a medium with a vast amount of data in which clients can see the assessment of different clients which can be ordered into exceptional entailment summons and are progressively more boom as a key component in decision making. This paper adds to the supposition assessment for customers assessment class that is utilized to analyze the records inside the type of the assortment of tweets wherein investigates are very unstructured and are both high fine or terrible, or somewhere in the middle of these . For this we first pre-prepared the dataset, after that extract the adjective from the dataset that has a couple of significance this is alluded to as capacity vector, at that point decided on the component vector posting and from that point accomplished device examining based write calculations particularly navie bayes, most entropy and svm along the edge of the semantic introduction based absolutely based on word net which extracts synonyms and similarity for the content characteristic. In the end, we measured the performance of the classifier in terms of considering, precision and accuracy. 


Author(s):  
Patrick Krauss ◽  
Claus Metzner ◽  
Nidhi Joshi ◽  
Holger Schulze ◽  
Maximilian Traxdorf ◽  
...  

AbstractAutomatic sleep stage scoring based on deep neural networks has come into focus of sleep researchers and physicians, as a reliable method able to objectively classify sleep stages, would save human resources and thus would simplify clinical routines. Due to novel open-source software libraries for Machine Learning in combination with enormous progress in hardware development in recent years a paradigm shift in the field of sleep research towards automatic diagnostics could be observed. We argue that modern Machine Learning techniques are not just a tool to perform automatic sleep stage classification but are also a creative approach to find hidden properties of sleep physiology. We have already developed and established algorithms to visualize and cluster EEG data, in a way so that we can already make first assessments on sleep health in terms of sleep-apnea and consequently daytime vigilance. In the following study, we further developed our method by the innovative approach to analyze cortical activity during sleep by computing vectorial cross-correlations of different EEG channels represented by hypnodensity graphs. We can show that this measure serves to estimate the period length of sleep cycles and thus can help to find disturbances due to pathological conditions.


Author(s):  
Daniel Röchert ◽  
German Neubaum ◽  
Stefan Stieglitz

Abstract Since social media have increasingly become forums to exchange personal opinions, more and more approaches have been suggested to analyze those sentiments automatically. Neural networks and traditional machine learning methods allow individual adaption by training the data, tailoring the algorithm to the particular topic that is discussed. Still, a great number of methodological combinations involving algorithms (e.g., recurrent neural networks (RNN)), techniques (e.g., word2vec), and methods (e.g., Skip-Gram) are possible. This work offers a systematic comparison of sentiment analytical approaches using different word embeddings with RNN architectures and traditional machine learning techniques. Using German comments of controversial political discussions on YouTube, this study uses metrics such as F1-score, precision and recall to compare the quality of performance of different approaches. First results show that deep neural networks outperform multiclass prediction with small datasets in contrast to traditional machine learning models with word embeddings.


2018 ◽  
Vol 34 (3) ◽  
pp. 569-581 ◽  
Author(s):  
Sujata Rani ◽  
Parteek Kumar

Abstract In this article, an innovative approach to perform the sentiment analysis (SA) has been presented. The proposed system handles the issues of Romanized or abbreviated text and spelling variations in the text to perform the sentiment analysis. The training data set of 3,000 movie reviews and tweets has been manually labeled by native speakers of Hindi in three classes, i.e. positive, negative, and neutral. The system uses WEKA (Waikato Environment for Knowledge Analysis) tool to convert these string data into numerical matrices and applies three machine learning techniques, i.e. Naive Bayes (NB), J48, and support vector machine (SVM). The proposed system has been tested on 100 movie reviews and tweets, and it has been observed that SVM has performed best in comparison to other classifiers, and it has an accuracy of 68% for movie reviews and 82% in case of tweets. The results of the proposed system are very promising and can be used in emerging applications like SA of product reviews and social media analysis. Additionally, the proposed system can be used in other cultural/social benefits like predicting/fighting human riots.


2018 ◽  
Vol 7 (2.32) ◽  
pp. 462
Author(s):  
G Krishna Chaitanya ◽  
Dinesh Reddy Meka ◽  
Vakalapudi Surya Vamsi ◽  
M V S Ravi Karthik

Sentiment or emotion behind a tweet from Twitter or a post from Facebook can help us answer what opinions or feedback a person has. With the advent of growing user-generated blogs, posts and reviews across various social media and online retails, calls for an understanding of these afore mentioned user data acts as a catalyst in building Recommender systems and drive business plans. User reviews on online retail stores influence buying behavior of customers and thus complements the ever-growing need of sentiment analysis. Machine Learning helps us to read between the lines of tweets by proving us with various algorithms like Naïve Bayes, SVM, etc. Sentiment Analysis uses Machine Learning and Natural Language Processing (NLP) to extract, classify and analyze tweets for sentiments (emotions). There are various packages and frameworks in R and Python that aid in Sentiment Analysis or Text Mining in general. 


Sign in / Sign up

Export Citation Format

Share Document