RNA-Sequencing Improves Diagnostics and Treatment of Pediatric Hematological Malignancies

Marco J. Koudijs; Lennart A. Kester; Jayne Y. Hehir-Kwa; Eugene T.P. Verwiel; Erik Strengman; Marc van Tuil; Douwe van der Leest; Edwin Sonneveld; Patrick Kemmeren; Valerie De Haas; Josef Vormoor; Bas B.J. Tops

doi:10.1182/blood-2021-147692

RNA-Sequencing Improves Diagnostics and Treatment of Pediatric Hematological Malignancies

Blood ◽

10.1182/blood-2021-147692 ◽

2021 ◽

Vol 138 (Supplement 1) ◽

pp. 107-107

Author(s):

Marco J. Koudijs ◽

Lennart A. Kester ◽

Jayne Y. Hehir-Kwa ◽

Eugene T.P. Verwiel ◽

Erik Strengman ◽

...

Keyword(s):

The Netherlands ◽

Rna Sequencing ◽

Gene Fusion ◽

Hematological Malignancies ◽

Diagnostic Procedures ◽

Molecular Techniques ◽

Imatinib Treatment ◽

Rna Seq ◽

Routine Diagnostics ◽

Fusion Detection

Abstract Background Diagnosis and treatment of hematological malignancies relies increasingly on the detection of underlying genetic abnormalities. Various laboratory techniques, including karyotyping, SNP-array, FISH, MLPA and RT-PCR are typically required to detect the full spectrum of clinically relevant genetic aberrations. These techniques are also hampered in their sensitivity by their targeted approach or lack of resolution. Ideally, an unbiased genome wide approach like RNA sequencing (RNA-seq) as a one-test-fits-all, could save costs and efforts and streamline diagnostic procedures. In the Netherlands, the care for all children with oncological disorders has been concentrated in a single, national center. Within the Laboratory of Childhood Cancer Pathology, we aim for a comprehensive diagnostic pipeline by implementing RNA-seq to aid diagnosis, prognosis and treatment of all children with cancer in the Netherlands. Methods We have established an RNA-seq based diagnostic pipeline, primarily aimed at detecting gene fusion events. Library prep is performed on 50-300 ng total RNA isolated from fresh (frozen) samples, followed by ribo-depletion and subsequent paired-end sequencing (2x150 nt) using the Illumina NovaSeq platform. Data is analyzed using the StarFusion algorithm for gene-fusion detection. We are prospectively comparing the results with routine diagnostic procedures. In addition, we are validating the detection of single nucleotide variants (SNVs) from RNA-seq data and developing a diagnostic classifier, using a nearest neighbor network approach. Results Based on RNA-seq profiling in diagnostics for all patients entering the Princess Maxima Center, there are several use-cases that highlight the value of RNA-seq. 1) In a prospective cohort of 244 patients (pan-cancer, including 97 hematological malignancies) we have shown that the diagnostic yield for detecting gene fusion events increased by approximately 40% compared to classical methods. An example is the TNIP1--PDGFRB gene fusion in a patient with pre B-ALL, making this patient eligible for imatinib treatment, which was not detected by other methods. 2) Variant calling on RNA-seq shows that activating mutations in e.g. KRAS are detected with high sensitivity, stratifying patients for therapeutic MEK intervention. 3) By expression outlier analysis, we were able to detect various promotor exchanges, e.g. IGH-MYC or IGH--DUX4, which are typically hard to detect by molecular techniques since the genomic breakpoint is highly variable and no chimeric transcript is formed. 4) Preliminary results from our diagnostic classifier show its potential to predict subclasses of hematological malignancies, e.g. high-hyperdiploid or bi-phenotypic ALL patients. 5) Fusion gene breakpoints detected by RNA-seq serve as a target for MRD analysis, allowing us to monitor disease progression and therapy response in individual patients. Currently, RNA-seq data is available for more than 1500 pediatric tumor samples. At the upcoming conference we will present an update of our results and some typical cases highlighting the added value of RNA-seq in routine diagnostics. Conclusion We show that RNA-seq on pediatric cancer samples is feasible and of great value for routine diagnostics. It has a higher sensitivity to detect gene fusion events compared to targeted assays. RNA-seq based gene fusion detection, in combination with mutation and expression analysis, is also promising to improve classification of malignancies, prognosis and stratification of patients for targeted therapies. Disclosures No relevant conflicts of interest to declare.

Download Full-text

Gene Fusion Detection By RNA-Seq in Acute Myeloid Leukemia (AML)

Blood ◽

10.1182/blood-2019-125869 ◽

2019 ◽

Vol 134 (Supplement_1) ◽

pp. 4655-4655

Author(s):

Paul Kerbs ◽

Aarif Mohamed Nazeer Batcha ◽

Sebastian Vosberg ◽

Dirk Metzler ◽

Tobias Herold ◽

...

Keyword(s):

Chromosomal Aberrations ◽

Gene Fusion ◽

Fusion Transcript ◽

Clinical Diagnostics ◽

Fusion Genes ◽

Rna Seq ◽

Fusion Event ◽

Routine Diagnostics ◽

Partner Gene ◽

Fusion Detection

Accurate and complete genetic classification of AML is crucial for the prediction of clinical outcome and treatment stratification. Deciphering the spectrum of genetic abnormalities by polymerase chain reaction (PCR), karyotyping and fluorescence in situ hybridization (FISH) in routine diagnostics is the current gold standard, however, fusion genes might potentially be missed by these assays. Recently, several methods have been developed to improve the detection of gene fusion transcripts based on RNA sequencing data, providing robust results. To test the detection power and assess the applicability of RNA-Seq based methods in clinical diagnostics we applied two different algorithms, namely FusionCatcher (Nicorici D et al., bioRxiv, 2014) and Arriba (Uhrig S et al., DKFZ, https://github.com/suhrig/arriba), to the transcriptomes of 895 well-characterized AML samples from three independently sequenced cohorts: AMLCG (Herold T et al., Haematologica, 2018, n=261), DKTK (Greif PA et al., Clin Cancer Res, 2018 and unpublished data, n=166), BeatAML (Tyner JW et al., Nature 2018, n=468) and publicly available healthy control samples (SRA studies: SRP018028, SRP047126, SRP050146, SRP105369, SRP115911, SRP133442, n=38). According to karyotyping, 31% (277/895) of samples harbored chromosomal aberrations putatively causing gene fusions (i.e. translocations, interstitial deletions, duplications, inversions, insertions). Analyses by FISH and/or PCR confirmed these rearrangements in 51.3% (142/277) of samples, whereas fusion detection by the means of RNA-Seq showed evidence for fusion genes corresponding to these rearrangements in 60.3% (167/277) of samples. Chromosomal aberrations, identified by karyotyping, which are known to result in clinically relevant fusions (e.g. RUNX1-RUNX1T1, KMT2A fusions) were confirmed by FISH/PCR (AMLCG: n=27/27, DKTK: n=21/21, BeatAML: n=54/57) and RNA-Seq based methods (AMLCG: n=17/27, DKTK: n=21/21, BeatAML: n=56/57) in most of the cases. Of note, the AMLCG cohort was sequenced using the SENSE mRNA Library Prep Kit from Lexogen which seems to be not optimal for fusion detection. Furthermore, 19 samples (AMLCG: n=12, DKTK: n=4, BeatAML: n=3) were found to harbor known pathogenic fusions, described in previous studies, which were not reported by routine diagnostics: NUP98-NSD1 (n=11); CBFB-MYH11, RUNX1-RUNX1T1 and DEK-NUP214 (n=2 each); RUNX1-CBFA2T2 and RUNX1-CBFA2T3 (n=1 each). Reanalysis of six of these samples by PCR confirmed three fusions which were initially missed by routine diagnostics. In general, the amount of reported fusion events by RNA-Seq is high (on average 69 and 39 per sample as detected by FusionCatcher and Arriba respectively), even after applying the built-in filters, indicating a high false positive rate. To robustly identify putative novel fusions, we developed a filtering pipeline and incorporated two new filtering steps. The promiscuity score (PS) of a fusion measures the amount of further distinct fusion partners which were detected in the respective cohort for the 5' and 3' gene. The fusion transcript score (FTS) measures the relative abundance of a fusion transcript to its 5' and 3' partner gene. PS and FTS of known, clinically relevant fusions confirmed by FISH/PCR were used to define cut-offs. To further maximize specificity while maintaining sensitivity, we excluded fusion events which we detected in publicly available healthy samples and subsequently filtered for overlapping calls from FusionCatcher and Arriba (Fig. 1A). Additionally, we obtained further evidence for a fusion event by an elevated transcription of the 3' fusion partner. In case of a fusion event, the transcription of the 3' partner gene likely gets under the control of the promoter of the 5' partner gene. This results in an elevated transcription of genes which are otherwise transcribed at low levels (Fig. 1B-C). Thus, we identified five putatively novel recurrent fusion genes which were detected in two cohorts independently: NRIP1-MIR99AHG, LATS2-ZMYM2, ATP11A-ING1, MBP-SLC66A2, PRDM16-SKI (Fig. 1D-F). Although these events were called with high evidence, we aim at independent validation by complementary methods. In our study, we have not only demonstrated that the application of RNA-Seq to the detection of fusion genes is a valuable complement to diagnostic routine but also has the potential to discover novel putatively pathogenic fusions. Disclosures No relevant conflicts of interest to declare.

Download Full-text

Abstract 177: RNA sequencing based gene fusion detection with oncomine comprehensive assay plus

10.1158/1538-7445.am2020-177 ◽

2020 ◽

Author(s):

Amir Marcovitz ◽

Rajesh K. Gottimukkala ◽

Gary G. Bee ◽

Jennifer M. Kilzer ◽

Vinay K. Mital ◽

...

Keyword(s):

Rna Sequencing ◽

Gene Fusion ◽

Fusion Detection

Download Full-text

Statistical assessment of gene fusion detection algorithms using RNA Sequencing Data

2012 IEEE Statistical Signal Processing Workshop (SSP) ◽

10.1109/ssp.2012.6319801 ◽

2012 ◽

Cited By ~ 1

Author(s):

Vinay Varadan ◽

Angel Janevski ◽

Sitharthan Kamalakaran ◽

Nilanjana Banerjee ◽

Nevenka Dimitrova ◽

...

Keyword(s):

Rna Sequencing ◽

Gene Fusion ◽

Sequencing Data ◽

Statistical Assessment ◽

Detection Algorithms ◽

Fusion Detection

Download Full-text

SimFuse: A Novel Fusion Simulator for RNA Sequencing (RNA-Seq) Data

BioMed Research International ◽

10.1155/2015/780519 ◽

2015 ◽

Vol 2015 ◽

pp. 1-5 ◽

Cited By ~ 2

Author(s):

Yuxiang Tan ◽

Yann Tambouret ◽

Stefano Monti

Keyword(s):

Sample Size ◽

Rna Sequencing ◽

High Throughput Sequencing ◽

Performance Metrics ◽

Simulated Data ◽

Real Data ◽

Rna Seq ◽

Sequencing Data ◽

Detection Algorithms ◽

Fusion Detection

The performance evaluation of fusion detection algorithms from high-throughput sequencing data crucially relies on the availability of data with known positive and negative cases of gene rearrangements. The use of simulated data circumvents some shortcomings of real data by generation of an unlimited number of true and false positive events, and the consequent robust estimation of accuracy measures, such as precision and recall. Although a few simulated fusion datasets from RNA Sequencing (RNA-Seq) are available, they are of limited sample size. This makes it difficult to systematically evaluate the performance of RNA-Seq based fusion-detection algorithms. Here, we present SimFuse to address this problem. SimFuse utilizes real sequencing data as the fusions’ background to closely approximate the distribution of reads from a real sequencing library and uses a reference genome as the template from which to simulate fusions’ supporting reads. To assess the supporting read-specific performance, SimFuse generates multiple datasets with various numbers of fusion supporting reads. Compared to an extant simulated dataset, SimFuse gives users control over the supporting read features and the sample size of the simulated library, based on which the performance metrics needed for the validation and comparison of alternative fusion-detection algorithms can be rigorously estimated.

Download Full-text

Development and Validation of an RNA Sequencing Assay for Gene Fusion Detection in Formalin-Fixed, Paraffin-Embedded Tumors

Journal of Molecular Diagnostics ◽

10.1016/j.jmoldx.2020.11.005 ◽

2020 ◽

Author(s):

Hao Peng ◽

Rong Huang ◽

Kui Wang ◽

Cuiyun Wang ◽

Bin Li ◽

...

Keyword(s):

Rna Sequencing ◽

Gene Fusion ◽

Formalin Fixed Paraffin ◽

Formalin Fixed Paraffin Embedded ◽

Development And Validation ◽

Fusion Detection ◽

Formalin Fixed

Download Full-text

FusionQ: a novel approach for gene fusion detection and quantification from paired-end RNA-Seq

BMC Bioinformatics ◽

10.1186/1471-2105-14-193 ◽

2013 ◽

Vol 14 (1) ◽

pp. 193 ◽

Cited By ~ 20

Author(s):

Chenglin Liu ◽

Jinwen Ma ◽

ChungChe Chang ◽

Xiaobo Zhou

Keyword(s):

Gene Fusion ◽

Rna Seq ◽

Novel Approach ◽

Fusion Detection ◽

Detection And Quantification

Download Full-text

Clinical implementation of RNA sequencing for Mendelian disease diagnostics

10.1101/2021.04.01.21254633 ◽

2021 ◽

Author(s):

Vicente A. Yepez ◽

Mirjana Gusic ◽

Robert Kopajtich ◽

Christian Mertes ◽

Nicholas H. Smith ◽

...

Keyword(s):

Gene Expression ◽

Rna Sequencing ◽

Genetic Diagnosis ◽

Allelic Expression ◽

Disease Genes ◽

Mendelian Disease ◽

Rna Seq ◽

Aberrant Splicing ◽

Aberrant Expression ◽

Routine Diagnostics

Lack of functional evidence hampers variant interpretation, leaving a large proportion of cases with a suspected Mendelian disorder without genetic diagnosis after genome or whole exome sequencing (WES). Research studies advocate to further sequence transcriptomes to directly and systematically probe gene expression defects. However, collection of additional biopsies, and establishment of lab workflows, analytical pipelines, and defined concepts in clinical interpretation of aberrant gene expression are still needed for adopting RNA-sequencing (RNA-seq) in routine diagnostics. To address these issues, we implemented an automated RNA-seq protocol and a computational workflow with which we analyzed skin fibroblasts of 303 individuals with a suspected mitochondrial disease. We detected on average 12,500 genes per sample including around 60% disease genes - a coverage substantially higher than with whole blood, supporting the use of skin biopsies. We prioritized genes demonstrating aberrant expression, aberrant splicing, or mono-allelic expression. The pipeline required less than one week from sample preparation to result reporting and provided a median of eight disease genes per patient for inspection. A genetic diagnosis was established for 16% of the WES-inconclusive cases. Detection of aberrant expression was a major contributor to diagnosis including instances of 50% reduction, which, together with mono-allelic expression, allowed for the diagnosis of dominant disorders caused by haploinsufficiency. Moreover, calling aberrant splicing and variants from RNA-seq data enabled detecting and validating splice-disrupting variants, of which the majority fell outside WES-covered regions. Together, these results show that streamlined experimental and computational processes can accelerate the implementation of RNA-seq in routine diagnostics.

Download Full-text

LongGF: computational algorithm and software tool for fast and accurate detection of gene fusions by long-read transcriptome sequencing

BMC Genomics ◽

10.1186/s12864-020-07207-4 ◽

2020 ◽

Vol 21 (S11) ◽

Author(s):

Qian Liu ◽

Yu Hu ◽

Andres Stucky ◽

Li Fang ◽

Jiang F. Zhong ◽

...

Keyword(s):

Candidate Gene ◽

Gene Fusion ◽

Superior Performance ◽

Gene Fusions ◽

Rna Seq ◽

Cdna Sequencing ◽

Sequencing Data ◽

Mrna Sequencing ◽

Long Read ◽

Fusion Detection

Abstract Background Long-read RNA-Seq techniques can generate reads that encompass a large proportion or the entire mRNA/cDNA molecules, so they are expected to address inherited limitations of short-read RNA-Seq techniques that typically generate < 150 bp reads. However, there is a general lack of software tools for gene fusion detection from long-read RNA-seq data, which takes into account the high basecalling error rates and the presence of alignment errors. Results In this study, we developed a fast computational tool, LongGF, to efficiently detect candidate gene fusions from long-read RNA-seq data, including cDNA sequencing data and direct mRNA sequencing data. We evaluated LongGF on tens of simulated long-read RNA-seq datasets, and demonstrated its superior performance in gene fusion detection. We also tested LongGF on a Nanopore direct mRNA sequencing dataset and a PacBio sequencing dataset generated on a mixture of 10 cancer cell lines, and found that LongGF achieved better performance to detect known gene fusions over existing computational tools. Furthermore, we tested LongGF on a Nanopore cDNA sequencing dataset on acute myeloid leukemia, and pinpointed the exact location of a translocation (previously known in cytogenetic resolution) in base resolution, which was further validated by Sanger sequencing. Conclusions In summary, LongGF will greatly facilitate the discovery of candidate gene fusion events from long-read RNA-Seq data, especially in cancer samples. LongGF is implemented in C++ and is available at https://github.com/WGLab/LongGF.

Download Full-text

Comprehensive Multi-Omics Analysis of Gene Fusions in a Large Multiple Myeloma Cohort

Blood ◽

10.1182/blood-2018-99-117245 ◽

2018 ◽

Vol 132 (Supplement 1) ◽

pp. 1898-1898

Author(s):

Steven M. Foltz ◽

Qingsong Gao ◽

Christopher J. Yoon ◽

Amila Weerasinghe ◽

Hua Sun ◽

...

Keyword(s):

Multiple Myeloma ◽

Board Of Directors ◽

Research Funding ◽

Gene Fusion ◽

Gene Fusions ◽

Rna Seq ◽

Advisory Committees ◽

Time Points ◽

Detection Algorithms ◽

Fusion Detection

Abstract Introduction: Gene fusions are the result of genomic rearrangements that create hybrid protein products or bring the regulatory elements of one gene into close proximity of another. Fusions often dysregulate gene function or expression through oncogene overexpression or tumor suppressor underexpression (Gao, Liang, Foltz, et al. Cell Rep 2018). Some fusions such as EML4--ALK in lung adenocarcinoma are known druggable targets. Fusion detection algorithms utilize discordantly mapped RNA-seq reads. Careful consideration of detection and filtering procedures is vital for large-scale fusion detection because current methods are prone to reporting false positives and show poor concordance. Multiple myeloma (MM) is a blood cancer in which rapidly expanding clones of plasma cells spread in the bone marrow. Translocations that juxtapose the highly-expressed IGH enhancer with potential oncogenes are associated with overexpression of partner genes, although they may not lead to a detectable gene fusion in RNA-seq data. Previous studies have explored the fusion landscape of multiple myeloma cohorts (Cleynen, et al. Nat Comm 2017; Nasser, et al. Blood 2017). In this study, we developed a novel gene fusion detection pipeline and post-processing strategy to analyze 742 patient samples at the primary time point and 64 samples at follow-up time points (806 total samples) from the Multiple Myeloma Research Foundation (MMRF) CoMMpass Study using RNA-seq, WGS, and clinical data. Methods and Results: We overlapped five fusion detection algorithms (EricScript, FusionCatcher, INTEGRATE, PRADA, and STAR-Fusion) to report fusion events. Our filtered call set consisted of 2,817 fusions with a median of 3 fusions per sample (mean 3.8), similar to glioblastoma, breast, ovarian, and prostate cancers in TCGA. Major recurrent fusions involving immunoglobulin genes included IGH--WHSC1 (88 primary samples), IGL--BMI1 (29), and the upstream neighbor of MYC, PVT1, paired with IGH (6), IGK (3), and IGL (11). For each event, we used WGS data when available to determine if there was genomic support of the gene fusion (based on discordant WGS reads, SV event detection, and MMRF CoMMpass Seq-FISH WGS results) (Miller, et al. Blood 2016). WGS validation rates varied by the level of RNA-seq evidence supporting each fusion, with an overall rate of 24.1%, which is comparable to previously observed pan-cancer validation rates using low-pass WGS. We calculated the association between fusion status and gene expression and identified genes such as BCL2L11, CCND1/2, LTBR, and TXNDC5 that showed significant overexpression (t-test). We explored the clinical connections of fusion events through survival analysis and clinical data correlations, and by mining potentially druggable targets from our Database of Evidence for Precision Oncology (dinglab.wustl.edu/depo) (Sun, Mashl, Sengupta, et al. Bioinformatics 2018). Major examples of upregulated fusion kinases that could potentially be targeted with off-label drug use include FGFR3 and NTRK1. We examined the evolution of fusion events over multiple time points. In one MMRF patient with a t(8;14) translocation joining the IGH locus and transcription factor MAFA, we observed IGH fusions with TOP1MT (neighbor of MAFA) at all four time points with corresponding high expression of TOP1MT and MAFA. Using non-MMRF single-cell RNA data from different patients, we were able to track cell-type composition over time as well as detect subpopulations of cells harboring fusions at different time points with potential treatment implications. Discussion: Gene fusions offer potential targets for alternative MM therapies. Careful implementation of gene fusion detection algorithms and post-processing are essential in large cohort studies to reduce false positives and enrich results for clinically relevant information. Clinical fusion detection from untargeted RNA-seq remains a challenge due to poor sensitivity, specificity, and usability. By combining MMRF CoMMpass data from multiple platforms, we have produced a comprehensive fusion profile of 742 MM patients. We have shown novel gene fusion associations with gene expression and clinical data, and we identified candidates for druggability studies. Disclosures Vij: Bristol-Myers Squibb: Honoraria, Membership on an entity's Board of Directors or advisory committees, Research Funding; Celgene: Honoraria, Membership on an entity's Board of Directors or advisory committees, Research Funding; Jazz Pharmaceuticals: Honoraria, Membership on an entity's Board of Directors or advisory committees; Jansson: Honoraria, Membership on an entity's Board of Directors or advisory committees; Amgen: Honoraria, Membership on an entity's Board of Directors or advisory committees; Karyopharma: Honoraria, Membership on an entity's Board of Directors or advisory committees; Takeda: Honoraria, Membership on an entity's Board of Directors or advisory committees, Research Funding.

Download Full-text

PO-400 Arriba – fast and accurate gene fusion detection from RNA-seq data

10.1136/esmoopen-2018-eacr25.426 ◽

2018 ◽

Cited By ~ 6

Author(s):

S Uhrig ◽

M Fröhlich ◽

B Hutter ◽

B Brors

Keyword(s):

Gene Fusion ◽

Rna Seq ◽

Fusion Detection

Download Full-text