Discovery of rating fraud with real-time streaming visual analytics

Author(s):  
Kodzo Webga ◽  
Aidong Lu
Keyword(s):  
Author(s):  
Bo Beth Sun ◽  
Eric Zielonka ◽  
Aleksandr Fritz ◽  
Matthew Schofield ◽  
Brennan Ringel ◽  
...  

Author(s):  
Dorys Quiroz ◽  
Byron Guanochanga ◽  
Walter Fuertes ◽  
Diego Benítez ◽  
Jenny Torres ◽  
...  

Data ◽  
2020 ◽  
Vol 5 (1) ◽  
pp. 20
Author(s):  
Amir Haghighati ◽  
Kamran Sedig

Through social media platforms, massive amounts of data are being produced. As a microblogging social media platform, Twitter enables its users to post short updates as “tweets” on an unprecedented scale. Once analyzed using machine learning (ML) techniques and in aggregate, Twitter data can be an invaluable resource for gaining insight into different domains of discussion and public opinion. However, when applied to real-time data streams, due to covariate shifts in the data (i.e., changes in the distributions of the inputs of ML algorithms), existing ML approaches result in different types of biases and provide uncertain outputs. In this paper, we describe VARTTA (Visual Analytics for Real-Time Twitter datA), a visual analytics system that combines data visualizations, human-data interaction, and ML algorithms to help users monitor, analyze, and make sense of the streams of tweets in a real-time manner. As a case study, we demonstrate the use of VARTTA in political discussions. VARTTA not only provides users with powerful analytical tools, but also enables them to diagnose and to heuristically suggest fixes for the errors in the outcome, resulting in a more detailed understanding of the tweets. Finally, we outline several issues to be considered while designing other similar visual analytics systems.


Sign in / Sign up

Export Citation Format

Share Document