Using Application Programming Interfaces to Access Google Data for Health Research: Protocol for a Methodological Framework (Preprint)

Mapping Intimacies ◽

10.2196/preprints.16543 ◽

2019 ◽

Author(s):

Anne Zepecki ◽

Sylvia Guendelman ◽

John DeNero ◽

Ndola Prata

Keyword(s):

Real Time ◽

Birth Control ◽

Google Trends ◽

Search Term ◽

Health Trends ◽

Search Queries ◽

Search Volume ◽

Application Programming ◽

Relative Search Volume ◽

Google Search

BACKGROUND Individuals are increasingly turning to search engines like Google to obtain health information and access resources. Analysis of Google search queries offers a novel approach, which is part of the methodological toolkit for infodemiology or infoveillance researchers, to understanding population health concerns and needs in real time or near-real time. While searches predominantly have been examined with the Google Trends website tool, newer application programming interfaces (APIs) are now available to academics to draw a richer landscape of searches. These APIs allow users to write code in languages like Python to retrieve sample data directly from Google servers. OBJECTIVE The purpose of this paper is to describe a novel protocol to determine the top queries, volume of queries, and the top sites reached by a population searching on the web for a specific health term. The protocol retrieves Google search data obtained from three Google APIs: Google Trends, Google Health Trends (also referred to as Flu Trends), and Google Custom Search. METHODS Our protocol consisted of four steps: (1) developing a master list of top search queries for an initial search term using Google Trends, (2) gathering information on relative search volume using Google Health Trends, (3) determining the most popular sites using Google Custom Search, and (4) calculating estimated total search volume. We tested the protocol following key procedures at each step and verified its usefulness by examining search traffic on birth control in 2017 in the United States. Two separate programmers working independently achieved similar results with insignificant variation due to sample variability. RESULTS We successfully tested the methodology on the initial search term birth control. We identified top search queries for birth control, of which birth control pill was the most popular and obtained the relative and estimated total search volume for the top queries: relative search volume was 0.54 for the pill, corresponding to an estimated 9.3-10.7 million searches. We used the estimates of the proportion of search activity for the top queries to arrive at a generated list of the most popular websites: for the pill, the Planned Parenthood website was the top site. CONCLUSIONS The proposed methodological framework demonstrates how to retrieve Google query data from multiple Google APIs and provides thorough documentation required to systematically identify search queries and websites, as well as estimate relative and total search volume of queries in real time or near-real time in specific locations and time periods. Although the protocol needs further testing, it allows researchers to replicate the steps and shows promise in advancing our understanding of population-level health concerns. INTERNATIONAL REGISTERED REPORT RR1-10.2196/16543

Download Full-text

Using Application Programming Interfaces to Access Google Data for Health Research: Protocol for a Methodological Framework

JMIR Research Protocols ◽

10.2196/16543 ◽

2020 ◽

Vol 9 (7) ◽

pp. e16543

Author(s):

Anne Zepecki ◽

Sylvia Guendelman ◽

John DeNero ◽

Ndola Prata

Keyword(s):

Real Time ◽

Birth Control ◽

Google Trends ◽

Search Term ◽

Health Trends ◽

Search Queries ◽

Search Volume ◽

Application Programming ◽

Relative Search Volume ◽

Google Search

Background Individuals are increasingly turning to search engines like Google to obtain health information and access resources. Analysis of Google search queries offers a novel approach, which is part of the methodological toolkit for infodemiology or infoveillance researchers, to understanding population health concerns and needs in real time or near-real time. While searches predominantly have been examined with the Google Trends website tool, newer application programming interfaces (APIs) are now available to academics to draw a richer landscape of searches. These APIs allow users to write code in languages like Python to retrieve sample data directly from Google servers. Objective The purpose of this paper is to describe a novel protocol to determine the top queries, volume of queries, and the top sites reached by a population searching on the web for a specific health term. The protocol retrieves Google search data obtained from three Google APIs: Google Trends, Google Health Trends (also referred to as Flu Trends), and Google Custom Search. Methods Our protocol consisted of four steps: (1) developing a master list of top search queries for an initial search term using Google Trends, (2) gathering information on relative search volume using Google Health Trends, (3) determining the most popular sites using Google Custom Search, and (4) calculating estimated total search volume. We tested the protocol following key procedures at each step and verified its usefulness by examining search traffic on birth control in 2017 in the United States. Two separate programmers working independently achieved similar results with insignificant variation due to sample variability. Results We successfully tested the methodology on the initial search term birth control. We identified top search queries for birth control, of which birth control pill was the most popular and obtained the relative and estimated total search volume for the top queries: relative search volume was 0.54 for the pill, corresponding to an estimated 9.3-10.7 million searches. We used the estimates of the proportion of search activity for the top queries to arrive at a generated list of the most popular websites: for the pill, the Planned Parenthood website was the top site. Conclusions The proposed methodological framework demonstrates how to retrieve Google query data from multiple Google APIs and provides thorough documentation required to systematically identify search queries and websites, as well as estimate relative and total search volume of queries in real time or near-real time in specific locations and time periods. Although the protocol needs further testing, it allows researchers to replicate the steps and shows promise in advancing our understanding of population-level health concerns. International Registered Report Identifier (IRRID) RR1-10.2196/16543

Download Full-text

Seasonal variations and public search interests in Toxoplasma: a 16-year retrospective analysis of big data on Google Trends

Transactions of the Royal Society of Tropical Medicine and Hygiene ◽

10.1093/trstmh/traa147 ◽

2020 ◽

Author(s):

Lei Liu ◽

Peng Wang ◽

Su-Qin Jiang ◽

Zi-Rong Zhong ◽

Ting-Zheng Zhan ◽

...

Keyword(s):

New Zealand ◽

Google Trends ◽

Seasonal Patterns ◽

Search Term ◽

Internet Search ◽

Search Volume ◽

The Usa ◽

Search Data ◽

Relative Search Volume ◽

The Uk

Abstract Background This study aims to understand whether there is a seasonal change in the internet search interest for Toxoplasma by using the data derived from Google Trends (GT). Methods The present study searched for the relative search volume (RSV) for the search term ‘Toxoplasma’ in GT within six major English-speaking countries (Australia, New Zealand [Southern Hemisphere] and Canada, Ireland, the UK and the USA [Northern Hemisphere] from 1 January 2004 to 31 December 2019, utilizing the category of ‘health’. Data regarding the RSV of Toxoplasma was obtained and further statistical analysis was performed in R software using the ‘season’ package. Results There were significantly seasonal patterns for the RSV of the search term ‘Toxoplasma’ in five countries (all p<0.05), except for the UK. A peak in December–March and a trough in July–September (Canada, Ireland, the UK and the USA) were observed, while a peak in June/August and a trough in December/February (Australia, New Zealand) were also found. Moreover, the presence of seasonal patterns regarding RSV for ‘Toxoplasma’ between the Southern and Northern Hemispheres was also found (both p<0.05), with a reversed meteorological month. Conclusions Overall, our study revealed the seasonal variation for Toxoplasma in using internet search data from GT, providing additional evidence on seasonal patterns in Toxoplasma.

Download Full-text

Increase in public interest concerning alternative medicine during the COVID-19 pandemic in Indonesia: a Google Trends study

F1000Research ◽

10.12688/f1000research.25525.2 ◽

2021 ◽

Vol 9 ◽

pp. 1201

Author(s):

Dewi Rokhmah ◽

Khaidar Ali ◽

Serius Miliyani Dwi Putri ◽

Khoiron Khoiron

Keyword(s):

Alternative Medicine ◽

Public Interest ◽

Time Lag ◽

Google Trends ◽

Rank Test ◽

Search Term ◽

Alternative Medicines ◽

Search Volume ◽

Relative Search Volume ◽

Time Lag Correlation

Background: The COVID-19 pandemic has triggered individuals to increase their healthy behaviour in order to prevent transmission, including improving their immunity potentially through the use of alternative medicines. This study aimed to examine public interest on alternative medicine during the COVID-19 pandemic using Google Trends in Indonesia. Methods: Employing a quantitative study, the Spearman rank test was used to analyze the correlation between Google Relative Search Volume (RSV) of various search terms, within the categories of alternative medicine, herbal medicine and practical activity, with COVID-19 cases. In addition, time lag correlation was also investigated. Results: Public interest toward alternative medicine during COVID-19 pandemic in Indonesia is dramatically escalating. All search term categories (alternative medicine, medical herbal, and alternative medicine activities) were positively associated with COVID-19 cases (p<0.05). The terms ‘ginger’ (r=0.6376), ‘curcumin’ (r=0.6550) and ‘planting ginger’ (0.6713) had the strongest correlation. Furthermore, time lag correlation between COVID-19 and Google RSV was also positively significant (p<0.05). Conclusion: Public interest concerning alternative medicine related terms dramatically increased after the first COVID-19 confirmed case was reported in Indonesia. Time lag correlation showed good performance using weekly data. The Indonesian Government will play an important role to provide and monitor information related to alternative medicine in order for the population to receive the maximum benefit.

Download Full-text

Public Awareness of Uterine Power Morcellation Through US Food and Drug Administration Communications: Analysis of Google Trends Search Term Patterns (Preprint)

10.2196/preprints.9913 ◽

2018 ◽

Author(s):

Lauren N Wood ◽

Juzar Jamnagerwalla ◽

Melissa A Markowitz ◽

D Joseph Thum ◽

Philip McCarty ◽

...

Keyword(s):

Minimally Invasive ◽

Drug Administration ◽

Public Awareness ◽

Food And Drug Administration ◽

Google Trends ◽

Search Term ◽

Search Volume ◽

Power Morcellation ◽

The Mean ◽

Google Search

BACKGROUND Uterine power morcellation, where the uterus is shred into smaller pieces, is a widely used technique for removal of uterine specimens in patients undergoing minimally invasive abdominal hysterectomy or myomectomy. Complications related to power morcellation of uterine specimens led to US Food and Drug Administration (FDA) communications in 2014 ultimately recommending against the use of power morcellation for women undergoing minimally invasive hysterectomy. Subsequently, practitioners drastically decreased the use of morcellation. OBJECTIVE We aimed to determine the effect of increased patient awareness on the decrease in use of the morcellator. Google Trends is a public tool that provides data on temporal patterns of search terms, and we correlated this data with the timing of the FDA communication. METHODS Weekly relative search volume (RSV) was obtained from Google Trends using the term “morcellation.” Higher RSV corresponds to increases in weekly search volume. Search volumes were divided into 3 groups: the 2 years prior to the FDA communication, a 1-year period following, and thereafter, with the distribution of the weekly RSV over the 3 periods tested using 1-way analysis of variance. Additionally, we analyzed the total number of websites containing the term “morcellation” over this time. RESULTS The mean RSV prior to the FDA communication was 12.0 (SD 15.8), with the RSV being 60.3 (SD 24.7) in the 1-year after and 19.3 (SD 5.2) thereafter (P<.001). The mean number of webpages containing the term “morcellation” in 2011 was 10,800, rising to 18,800 during 2014 and 36,200 in 2017. CONCLUSIONS Google search activity about morcellation of uterine specimens increased significantly after the FDA communications. This trend indicates an increased public awareness regarding morcellation and its complications. More extensive preoperative counseling and alteration of surgical technique and clinician practice may be necessary.

Download Full-text

Google trends to identify public’s Interest in bowel cancer, A Web Based Analysis.

Journal of Clinical Research and Reports ◽

10.31579/2690-1919/120 ◽

2020 ◽

Vol 5 (4) ◽

pp. 01-04

Author(s):

T Manzoor

Keyword(s):

Bowel Cancer ◽

Google Trends ◽

Volume Index ◽

Search Term ◽

Health Campaigns ◽

Cancer Awareness ◽

Search Volume ◽

Search Data ◽

Public Health Campaigns ◽

Relative Search Volume

Aims: Bowel Cancer is one of the commonest cancers in UK. Google Trends were used to evaluate public’s search interest regarding bowel cancer. We hypothesize that the search data in Google Trends may be influenced by “Bowel cancer awareness month” campaign and that in future this might be a useful surrogate to monitor the effectiveness of public health campaigns. Methods: Google Trends were used to extract data presented as “Relative search volume index”(SVI) ranging between 0 to 100. “Bowel Cancer” was used as a search term to collect the relevant data for the last 5 years ( January 2015 to December 2019), All the peaks were assessed and their correlation with bowel cancer awareness month was noted. Results: We noticed an upward trend for the searched term during the months of April for most of the years where peak of search touched 90%. It corresponds with bowel cancer awareness month campaign. A downward trend was also noticed during the months of December during all years where it went down to 53%. This may represent avoidance of health related searches during the happy holiday period. Conclusion: Our study shows an encouraging association between bowel cancer awareness month campaign and public’s search interest. Results can be used in future to start effective awareness strategies and leverage future interventions.

Download Full-text

Determination of the Popularity of Dietary Supplements Using Google Search Rankings

Nutrients ◽

10.3390/nu12040908 ◽

2020 ◽

Vol 12 (4) ◽

pp. 908 ◽

Cited By ~ 2

Author(s):

Mikołaj Kamiński ◽

Matylda Kręgielska-Narożna ◽

Paweł Bogdański

Keyword(s):

Dietary Supplements ◽

Google Trends ◽

Access To Information ◽

Search Volume ◽

Regional Interest ◽

Trends Over Time ◽

Relative Search Volume ◽

Google Search ◽

Over Time

The internet provides access to information about dietary supplements and allows their easy purchase. We aimed to rank the interest of Google users in dietary supplements and to determine the changes that occurred in their popularity from 2004 to 2019. We used Google Trends to generate data over time on regional interest in dietary supplements (n = 200). We categorized each included supplement and calculated the interest in all topics in proportion to the relative search volume (RSV) of “lutein”. We analyzed the trends over time of all topics and categories. Globally, the topics with the highest popularity were “magnesium”, which was 23.72 times more popular than “lutein”, “protein” (15.22 times more popular), and “iron” (15.12). The categories of supplements receiving most interest were protein (9.64), mineral (5.24), and vitamin (3.47). The RSV of seven categories of topics (amino acid, bacterial, botanical, fiber, mineral, protein, and vitamin) increased over time while two categories (enzyme and fat or fatty acid) saw a drop in their RSV. Overall, 119 topics saw an increase in interest over time, 19 remained stable, and 62 saw interest in them decrease. Google Trends provides insights into e-discourse and enables analysis of the differences in popularity of certain topics across countries and over time.

Download Full-text

Increase in public interest concerning alternative medicine during the COVID-19 pandemic in Indonesia: a Google Trends study

F1000Research ◽

10.12688/f1000research.25525.1 ◽

2020 ◽

Vol 9 ◽

pp. 1201

Author(s):

Dewi Rokhmah ◽

Khaidar Ali ◽

Serius Miliyani Dwi Putri ◽

Khoiron Khoiron

Keyword(s):

Alternative Medicine ◽

Public Interest ◽

Time Lag ◽

Google Trends ◽

Rank Test ◽

Search Term ◽

Alternative Medicines ◽

Search Volume ◽

Relative Search Volume ◽

Time Lag Correlation

Download Full-text

Seasonal Variation and Global Public Interest in the Internet Searches for Osteoporosis

BioMed Research International ◽

10.1155/2021/6663559 ◽

2021 ◽

Vol 2021 ◽

pp. 1-8

Author(s):

Chao Wang ◽

Xiong Shu ◽

Jianfeng Tao ◽

Yanzhuo Zhang ◽

Yue Yuan ◽

...

Keyword(s):

Seasonal Variation ◽

Public Interest ◽

Seasonal Pattern ◽

Late Winter ◽

Search Term ◽

Ibandronic Acid ◽

Search Volume ◽

Significant Seasonal Variation ◽

Relative Search Volume ◽

Google Search

Background. To ascertain the seasonal pattern and global public interest in osteoporosis by evaluating search term popularity changes of the disease over a decade. Methods. We applied Google Trends to retrieve search popularity scores for the term “osteoporosis” between January 01, 2004, and December 31, 2019. Cosinor analyses were conducted to examine the seasonality of osteoporosis, and analysis on osteoporosis-related topics including hot topics and rising-related topics was also performed. Results. The cosinor analyses demonstrated a statistically significant seasonal variation in relative search volume of the “osteoporosis” in the world ( p = 0.0083 ), USA ( p < 0.001 ), UK ( p < 0.001 ), Canada ( p < 0.001 ), Ireland ( p < 0.001 ), Australia ( p < 0.001 ), and New Zealand ( p < 0.001 ), with a peak in the late winter months and trough in the summer months. The peaks in late winter and valley in summer presented an approximately 6-month difference between hemispheres. The top 11 rising topics were denosumab, FRAX, hypocalcaemia, zoledronic acid, ibandronic acid, osteomyelitis, osteopenia, osteoarthritis, bone, calcium, and bone density. Conclusions. Google search query volumes related to osteoporosis follow strong seasonal patterns with late winter peaks and summer troughs. Further studies aimed at elucidating the possible mechanisms behind seasonality in osteoporosis are needed. Moreover, Internet data including the top rising topics may alert physicians to strengthen the propaganda of osteoporosis timely, so as to further promote the development of public health interventions.

Download Full-text

Global Internet Data on the Interest in Antibiotics and Probiotics Generated by Google Trends

Antibiotics ◽

10.3390/antibiotics8030147 ◽

2019 ◽

Vol 8 (3) ◽

pp. 147 ◽

Cited By ~ 8

Author(s):

Mikołaj Kamiński ◽

Igor Łoniewski ◽

Wojciech Marlicz

Keyword(s):

Health Expenditure ◽

Rank Correlation ◽

Antibiotic Consumption ◽

Google Trends ◽

Related Information ◽

Search Volume ◽

Spearman Rank Correlation Analysis ◽

The Mean ◽

Relative Search Volume ◽

Google Search

Data from the Google search engine enables the assessment of Google users’ interest in a specific topic. We analyzed the world trends in searches associated with the topics “antibiotics” and “probiotics” from January 2004 to June 2019, using Google Trends. We analyzed the yearly trends and seasonal variation. We performed an R-Spearman rank correlation analysis of the relative search volume (RSV) of the topics in 2015 with antibiotic consumption, health expenditure per capita, and the 2015 Human Development Index (HDI) of the country. The mean interest in the topic of antibiotics was equal to RSV = 57.5 ± 17.9, rising by 3.7 RSV/year (6.5%/year), while that of probiotics was RSV = 14.1 ± 7.9, which rose by 1.7 RSV/year (12.1%). The seasonal amplitude of antibiotics was equal to RSV = 9.8, while probiotics was RSV = 2.7. The seasonal peaks for both topics were observed in the cold months. The RSV of probiotics, but not antibiotics, was associated with antibiotic consumption (Rs = 0.35; p < 0.01), health expenditure (Rs = 0.41; p < 0.001), and HDI (Rs = 0.44; p < 0.001). Google users’ interest in antibiotic- and probiotic-related information increases from year to year, and peaks in cold months. The interest in probiotic-related information might be associated with antibiotic consumption, health expenditure, and the development status of the Google users’ country.

Download Full-text

"Dr. Google" consultations on psoriasis: trends and seasonality in a digital era. (Preprint)

10.2196/preprints.21709 ◽

2020 ◽

Author(s):

Fernando Garcia-Souto ◽

Jose Juan Pereyra-Rodriguez

Keyword(s):

Google Trends ◽

Percentage Change ◽

Volume Index ◽

Treatment Modalities ◽

Search Term ◽

Annual Percentage Change ◽

Digital Era ◽

Search Volume ◽

Relative Search Volume ◽

Join Point

BACKGROUND In recent years, the Internet has become an essential tool where people seek information about health care. OBJECTIVE The aim of this study is to use data from Google Trends to analyze worldwide public interest in psoriasis and its different treatment modalities, and to analyze the possible seasonality of searches. METHODS A worldwide search was carried out through Google Trends from 2004 to 2019. A combination of terms related to psoriasis treatments was introduced. Join-point regression has been performed. Google Trends assigns a relative search volume index to the search terms. Comparison annual relative search volume, annual percentage change, and average annual percentage change (AAPC) were analyzed to assess loss or gain of interest. RESULTS Our study reflects an increase interest in secukinumab (AAPC: 33.7), ixekizumab (AAPC: 23.3) and apremilast (AAPC: 21.4). It shows less interest in methotrexate (AAPC: -3.6), retinoids (AAPC: -9.8), cyclosporine (AAPC: -9.8), phototherapy (AAPC: -6.3), etanercept (AAPC: -14.9), infliximab (AAPC: -14) and adalimumab (AAPC: -5.8). Seasonality was found in the search term “psoriasis”. CONCLUSIONS Secukinumab followed by ixekizumab, and apremilast have been the treatments that have aroused the most interest. Our results show current psoriasis search trends and its different treatments based on Google trend analysis.

Download Full-text