scholarly journals Protecting against researcher bias in secondary data analysis: challenges and potential solutions

Author(s):  
Jessie R. Baldwin ◽  
Jean-Baptiste Pingault ◽  
Tabea Schoeler ◽  
Hannah M. Sallis ◽  
Marcus R. Munafò

AbstractAnalysis of secondary data sources (such as cohort studies, survey data, and administrative records) has the potential to provide answers to science and society’s most pressing questions. However, researcher biases can lead to questionable research practices in secondary data analysis, which can distort the evidence base. While pre-registration can help to protect against researcher biases, it presents challenges for secondary data analysis. In this article, we describe these challenges and propose novel solutions and alternative approaches. Proposed solutions include approaches to (1) address bias linked to prior knowledge of the data, (2) enable pre-registration of non-hypothesis-driven research, (3) help ensure that pre-registered analyses will be appropriate for the data, and (4) address difficulties arising from reduced analytic flexibility in pre-registration. For each solution, we provide guidance on implementation for researchers and data guardians. The adoption of these practices can help to protect against researcher bias in secondary data analysis, to improve the robustness of research based on existing data.

2018 ◽  
Author(s):  
Sara J Weston ◽  
Stuart James Ritchie ◽  
Julia Marie Rohrer ◽  
Andrew K Przybylski

Secondary data analysis, or the analysis of pre-existing data, can be a powerful tool for the resourceful researcher. Never has this been more true than now, when technological advances allow for easier sharing of data across labs and continents and the mining of large sources of “pre-existing data”. However, secondary data analysis is often ignored as a methodological tool, either when developing new open science practices or improving analytic methods for robust data analysis. In this paper, we hope to provide researchers with the knowledge necessary to incorporate secondary data analysis into their toolbox. Specifically, we define secondary data analysis as a tool and in relation to other common forms of analysis (including exploratory and confirmatory, observational and experimental). We highlight the advantages and disadvantages of this tool. We describe how engagement in transparency can improve and alter our interpretations of results from secondary data analysis and provide resources for robust data analysis. We close by suggesting ways in which subfields and institutions could address and improve the use of secondary data analysis.


2020 ◽  
Author(s):  
Jessie Baldwin ◽  
Jean-Baptiste Pingault ◽  
Tabea Schoeler ◽  
Hannah Sallis ◽  
Marcus Robert Munafo

Protecting against researcher biases – both conscious and unconscious – can help to ensure robust findings and correct inferences in epidemiology. While pre-registration can be an effective way to achieve this, it brings several challenges for researchers analysing existing datasets. Here we describe these challenges, and propose solutions and alternatives. For each solution, we provide guidance, and highlight practical considerations for researchers. The adoption of these practices will allow researchers to effectively pre-register secondary data analysis studies, or use an alternative approach, in order to protect themselves against common human biases. In turn, this will increase the robustness and credibility of epidemiological research based on secondary data.


2009 ◽  
Vol 23 (3) ◽  
pp. 203-215 ◽  
Author(s):  
Daniel M. Doolan ◽  
Erika S. Froelicher

The vast majority of the research methods literature assumes that the researcher designs the study subsequent to determining research questions. This assumption is not met for the many researchers involved in secondary data analysis. Researchers doing secondary data analysis need not only understand research concepts related to designing a new study, but additionally must be aware of challenges specific to conducting research using an existing data set. Techniques are discussed to determine if secondary data analysis is appropriate. Suggestions are offered on how to best identify, obtain, and evaluate a data set; refine research questions; manage data; calculate power; and report results. Examples from nursing research are provided. If an existing data set is suitable for answering a new research question, then a secondary analysis is preferable since it can be completed in less time, for less money, and with far lower risks to subjects. The researcher must carefully consider if the existing data set’s available power and data quality are adequate to answer the proposed research questions.


2021 ◽  
pp. 107780122110139
Author(s):  
Jodie Murphy-Oikonen ◽  
Lori Chambers ◽  
Karen McQueen ◽  
Alexa Hiebert ◽  
Ainsley Miller

Rates of sexual victimization among Indigenous women are 3 times higher when compared with non-Indigenous women. The purpose of this secondary data analysis was to explore the experiences and recommendations of Indigenous women who reported sexual assault to the police and were not believed. This qualitative study of the experiences of 11 Indigenous women reflects four themes. The women experienced (a) victimization across the lifespan, (b) violent sexual assault, (c) dismissal by police, and (d) survival and resilience. These women were determined to voice their experience and make recommendations for change in the way police respond to sexual assault.


1989 ◽  
Vol 3 (2) ◽  
pp. 66-69
Author(s):  
Dorothy G. Herron

Sign in / Sign up

Export Citation Format

Share Document