Health, Psychosocial, and Social issues emanating from COVID-19 pandemic based on Social Media Comments using Natural Language Processing (Preprint)

Mapping Intimacies ◽

10.2196/preprints.22734 ◽

2020 ◽

Author(s):

Oladapo Oyebode ◽

Chinenye Ndulue ◽

Ashfaq Adib ◽

Dinesh Mulchandani ◽

Banuchitra Suruliraj ◽

...

Keyword(s):

Social Media ◽

Natural Language Processing ◽

Language Processing ◽

Social Issues ◽

Public Perceptions ◽

Psychosocial Issues ◽

Social Media Data ◽

Health Related ◽

The Impact ◽

Media Data

BACKGROUND The COVID-19 pandemic has caused a global health crisis that affects many aspects of human lives. In the absence of vaccines and antivirals, several behavioural change and policy initiatives, such as physical distancing, have been implemented to control the spread of the coronavirus. Social media data can reveal public perceptions toward how governments and health agencies across the globe are handling the pandemic, as well as the impact of the disease on people regardless of their geographic locations in line with various factors that hinder or facilitate the efforts to control the spread of the pandemic globally. OBJECTIVE This paper aims to investigate the impact of the COVID-19 pandemic on people globally using social media data. METHODS We apply natural language processing (NLP) and thematic analysis to understand public opinions, experiences, and issues with respect to the COVID-19 pandemic using social media data. First, we collect over 47 million COVID-19-related comments from Twitter, Facebook, YouTube, and three online discussion forums. Second, we perform data preprocessing which involves applying NLP techniques to clean and prepare the data for automated theme extraction. Third, we apply context-aware NLP approach to extract meaningful keyphrases or themes from over 1 million randomly-selected comments, as well as compute sentiment scores for each theme and assign sentiment polarity (i.e., positive, negative, or neutral) based on the scores using lexicon-based technique. Fourth, we categorize related themes into broader themes. RESULTS A total of 34 negative themes emerged, out of which 15 are health-related issues, psychosocial issues, and social issues related to the COVID-19 pandemic from the public perspective. Some of the health-related issues are increased mortality, health concerns, struggling health systems, and fitness issues; while some of the psychosocial issues include frustrations due to life disruptions, panic shopping, and expression of fear. Social issues include harassment, domestic violence, and wrong societal attitude. In addition, 20 positive themes emerged from our results. Some of the positive themes include public awareness, encouragement, gratitude, cleaner environment, online learning, charity, spiritual support, and innovative research. CONCLUSIONS We uncover various negative and positive themes representing public perceptions toward the COVID-19 pandemic and recommend interventions that can help address the health, psychosocial, and social issues based on the positive themes and other remedial ideas rooted in research. These interventions will help governments, health professionals and agencies, institutions, and individuals in their efforts to curb the spread of COVID-19 and minimize its impact, as well as in reacting to any future pandemics.

Download Full-text

A Pipeline to Understand Emerging Illness Via Social Media Data Analysis: Case Study on Breast Implant Illness (Preprint)

10.2196/preprints.29768 ◽

2021 ◽

Author(s):

Vishal Dey ◽

Peter Krasniak ◽

Minh Nguyen ◽

Clara Lee ◽

Xia Ning

Keyword(s):

Mental Health ◽

Social Media ◽

Natural Language Processing ◽

Data Analysis ◽

Natural Language ◽

Language Processing ◽

Breast Implant ◽

Public Attention ◽

Social Media Data ◽

Media Data

BACKGROUND A new illness can come to public attention through social media before it is medically defined, formally documented, or systematically studied. One example is a condition known as breast implant illness (BII), which has been extensively discussed on social media, although it is vaguely defined in the medical literature. OBJECTIVE The objective of this study is to construct a data analysis pipeline to understand emerging illnesses using social media data and to apply the pipeline to understand the key attributes of BII. METHODS We constructed a pipeline of social media data analysis using natural language processing and topic modeling. Mentions related to signs, symptoms, diseases, disorders, and medical procedures were extracted from social media data using the clinical Text Analysis and Knowledge Extraction System. We mapped the mentions to standard medical concepts and then summarized these mapped concepts as topics using latent Dirichlet allocation. Finally, we applied this pipeline to understand BII from several BII-dedicated social media sites. RESULTS Our pipeline identified topics related to toxicity, cancer, and mental health issues that were highly associated with BII. Our pipeline also showed that cancers, autoimmune disorders, and mental health problems were emerging concerns associated with breast implants, based on social media discussions. Furthermore, the pipeline identified mentions such as rupture, infection, pain, and fatigue as common self-reported issues among the public, as well as concerns about toxicity from silicone implants. CONCLUSIONS Our study could inspire future studies on the suggested symptoms and factors of BII. Our study provides the first analysis and derived knowledge of BII from social media using natural language processing techniques and demonstrates the potential of using social media information to better understand similar emerging illnesses. CLINICALTRIAL

Download Full-text

Mining Consumer Brand Relationship from Social Media Data: A Natural Language Processing Approach

Lecture Notes in Computer Science - Artificial Intelligence and Security ◽

10.1007/978-3-030-78609-0_47 ◽

2021 ◽

pp. 553-565

Author(s):

Di Shang ◽

Zhenda Hu ◽

Zhaoxia Wang

Keyword(s):

Social Media ◽

Natural Language Processing ◽

Natural Language ◽

Language Processing ◽

Social Media Data ◽

Brand Relationship ◽

Processing Approach ◽

Media Data

Download Full-text

Ontology-Based Natural Language Processing of Social Media Data in the Assessment of Health Information Sought During Pregnancy

10.3233/shti210668 ◽

2021 ◽

Author(s):

Joo Yun Lee

Keyword(s):

Social Media ◽

Natural Language Processing ◽

South Korea ◽

Natural Language ◽

Family Support ◽

Health Information ◽

Language Processing ◽

Social Media Data ◽

Media Data

This study analyzed collected social media data from South Korea containing keywords related to “pregnancy” using ontology-based natural language processing. Of the 504,725 documents, those containing concepts related to “maternal emotion” were the most frequent, followed by “family support”. Social media were used as a means of exchanging information and expressing emotions.

Download Full-text

Text mining and Natural Language Processing on Social Media Data giving Insights for Pharmacovigilance: A Case Study with Fentanyl

Indian Journal of Pharmaceutical Sciences ◽

10.4172/pharmaceutical-sciences.1000418 ◽

2018 ◽

Vol 80 (4) ◽

Author(s):

R Paulose ◽

B Gopal Samy ◽

K Jegatheesan

Keyword(s):

Social Media ◽

Natural Language Processing ◽

Text Mining ◽

Natural Language ◽

Language Processing ◽

Social Media Data ◽

Media Data

Download Full-text

A Pipeline to Understand Emerging Illness Via Social Media Data Analysis: Case Study on Breast Implant Illness

JMIR Medical Informatics ◽

10.2196/29768 ◽

2021 ◽

Vol 9 (11) ◽

pp. e29768

Author(s):

Vishal Dey ◽

Peter Krasniak ◽

Minh Nguyen ◽

Clara Lee ◽

Xia Ning

Keyword(s):

Mental Health ◽

Social Media ◽

Natural Language Processing ◽

Data Analysis ◽

Natural Language ◽

Language Processing ◽

Breast Implant ◽

Public Attention ◽

Social Media Data ◽

Media Data

Background A new illness can come to public attention through social media before it is medically defined, formally documented, or systematically studied. One example is a condition known as breast implant illness (BII), which has been extensively discussed on social media, although it is vaguely defined in the medical literature. Objective The objective of this study is to construct a data analysis pipeline to understand emerging illnesses using social media data and to apply the pipeline to understand the key attributes of BII. Methods We constructed a pipeline of social media data analysis using natural language processing and topic modeling. Mentions related to signs, symptoms, diseases, disorders, and medical procedures were extracted from social media data using the clinical Text Analysis and Knowledge Extraction System. We mapped the mentions to standard medical concepts and then summarized these mapped concepts as topics using latent Dirichlet allocation. Finally, we applied this pipeline to understand BII from several BII-dedicated social media sites. Results Our pipeline identified topics related to toxicity, cancer, and mental health issues that were highly associated with BII. Our pipeline also showed that cancers, autoimmune disorders, and mental health problems were emerging concerns associated with breast implants, based on social media discussions. Furthermore, the pipeline identified mentions such as rupture, infection, pain, and fatigue as common self-reported issues among the public, as well as concerns about toxicity from silicone implants. Conclusions Our study could inspire future studies on the suggested symptoms and factors of BII. Our study provides the first analysis and derived knowledge of BII from social media using natural language processing techniques and demonstrates the potential of using social media information to better understand similar emerging illnesses.

Download Full-text

Design and implementation of natural language processing with syntax and semantic analysis for extract traffic conditions from social media data

2015 5th IEEE International Conference on System Engineering and Technology (ICSET) ◽

10.1109/icsengt.2015.7412443 ◽

2015 ◽

Cited By ~ 2

Author(s):

Mochamad Vicky Ghani Aziz ◽

Ary Setijadi Prihatmanto ◽

Diotra Henriyan ◽

Rifki Wijaya

Keyword(s):

Social Media ◽

Natural Language Processing ◽

Natural Language ◽

Language Processing ◽

Semantic Analysis ◽

Social Media Data ◽

Traffic Conditions ◽

Design And Implementation ◽

Media Data

Download Full-text

Perceiving Residents’ Festival Activities Based on Social Media Data: A Case Study in Beijing, China

ISPRS International Journal of Geo-Information ◽

10.3390/ijgi10070474 ◽

2021 ◽

Vol 10 (7) ◽

pp. 474

Author(s):

Bingqing Wang ◽

Bin Meng ◽

Juan Wang ◽

Siyu Chen ◽

Jian Liu

Keyword(s):

Social Media ◽

Language Processing ◽

Topic Model ◽

Central Area ◽

Classification Model ◽

Social Media Data ◽

Ring Road ◽

Different Types ◽

Spatial Differences ◽

Media Data

Social media data contains real-time expressed information, including text and geographical location. As a new data source for crowd behavior research in the era of big data, it can reflect some aspects of the behavior of residents. In this study, a text classification model based on the BERT and Transformers framework was constructed, which was used to classify and extract more than 210,000 residents’ festival activities based on the 1.13 million Sina Weibo (Chinese “Twitter”) data collected from Beijing in 2019 data. On this basis, word frequency statistics, part-of-speech analysis, topic model, sentiment analysis and other methods were used to perceive different types of festival activities and quantitatively analyze the spatial differences of different types of festivals. The results show that traditional culture significantly influences residents’ festivals, reflecting residents’ motivation to participate in festivals and how residents participate in festivals and express their emotions. There are apparent spatial differences among residents in participating in festival activities. The main festival activities are distributed in the central area within the Fifth Ring Road in Beijing. In contrast, expressing feelings during the festival is mainly distributed outside the Fifth Ring Road in Beijing. The research integrates natural language processing technology, topic model analysis, spatial statistical analysis, and other technologies. It can also broaden the application field of social media data, especially text data, which provides a new research paradigm for studying residents’ festival activities and adds residents’ perception of the festival. The research results provide a basis for the design and management of the Chinese festival system.

Download Full-text

Exploring public perceptions on alternative meat in China from social media data using transfer learning method

Food Quality and Preference ◽

10.1016/j.foodqual.2022.104530 ◽

2022 ◽

pp. 104530

Author(s):

Yuan Chen ◽

Zhisheng Zhang

Keyword(s):

Social Media ◽

Transfer Learning ◽

Public Perceptions ◽

Learning Method ◽

Social Media Data ◽

Media Data

Download Full-text

Mining Social Media Data for Biomedical Signals and Health-Related Behavior

Annual Review of Biomedical Data Science ◽

10.1146/annurev-biodatasci-030320-040844 ◽

2020 ◽

Vol 3 (1) ◽

pp. 433-458 ◽

Cited By ~ 1

Author(s):

Rion Brattig Correia ◽

Ian B. Wood ◽

Johan Bollen ◽

Luis M. Rocha

Keyword(s):

Mental Health ◽

Social Media ◽

Population Level ◽

Data Access ◽

Health Conditions ◽

Social Phenomena ◽

Medical Treatments ◽

Social Media Data ◽

Health Related ◽

Media Data

Social media data have been increasingly used to study biomedical and health-related phenomena. From cohort-level discussions of a condition to population-level analyses of sentiment, social media have provided scientists with unprecedented amounts of data to study human behavior associated with a variety of health conditions and medical treatments. Here we review recent work in mining social media for biomedical, epidemiological, and social phenomena information relevant to the multilevel complexity of human health. We pay particular attention to topics where social media data analysis has shown the most progress, including pharmacovigilance and sentiment analysis, especially for mental health. We also discuss a variety of innovative uses of social media data for health-related applications as well as important limitations of social media data access and use.

Download Full-text

Using social media data to map the areas most affected by ISIS in Syria

Proceedings of the International conference “InterCarto/InterGIS” ◽

10.35595/2414-9179-2020-1-26-464-470 ◽

2020 ◽

Vol 26 (1) ◽

pp. 464-470

Author(s):

Mohamad Hasan

Keyword(s):

Social Media ◽

Language Processing ◽

Geographical Information ◽

Data Mapping ◽

Islamic State ◽

Social Media Data ◽

The Social ◽

Mapping Process ◽

Data Source ◽

Media Data

This paper presents a model to collect, save, geocode, and analyze social media data. The model is used to collect and process the social media data concerned with the ISIS terrorist group (the Islamic State in Iraq and Syria), and to map the areas in Syria most affected by ISIS accordingly to the social media data. Mapping process is assumed automated compilation of a density map for the geocoded tweets. Data mined from social media (e.g., Twitter and Facebook) is recognized as dynamic and easily accessible resources that can be used as a data source in spatial analysis and geographical information system. Social media data can be represented as a topic data and geocoding data basing on the text of the mined from social media and processed using Natural Language Processing (NLP) methods. NLP is a subdomain of artificial intelligence concerned with the programming computers to analyze natural human language and texts. NLP allows identifying words used as an initial data by developed geocoding algorithm. In this study, identifying the needed words using NLP was done using two corpora. First corpus contained the names of populated places in Syria. The second corpus was composed in result of statistical analysis of the number of tweets and picking the words that have a location meaning (i.e., schools, temples, etc.). After identifying the words, the algorithm used Google Maps geocoding API in order to obtain the coordinates for posts.

Download Full-text