Genome Biology
Latest Publications


TOTAL DOCUMENTS

2678
(FIVE YEARS 964)

H-INDEX

136
(FIVE YEARS 35)

Published By Springer Science And Business Media LLC

1474-760x

2022 ◽  
Vol 23 (1) ◽  
Author(s):  
Thomas H. Jonkman ◽  
Koen F. Dekkers ◽  
Roderick C. Slieker ◽  
Crystal D. Grant ◽  
M. Arfan Ikram ◽  
...  

Abstract Background Epigenetic clocks use DNA methylation (DNAm) levels of specific sets of CpG dinucleotides to accurately predict individual chronological age. A popular application of these clocks is to explore whether the deviation of predicted age from chronological age is associated with disease phenotypes, where this deviation is interpreted as a potential biomarker of biological age. This wide application, however, contrasts with the limited insight in the processes that may drive the running of epigenetic clocks. Results We perform a functional genomics analysis on four epigenetic clocks, including Hannum’s blood predictor and Horvath’s multi-tissue predictor, using blood DNA methylome and transcriptome data from 3132 individuals. The four clocks result in similar predictions of individual chronological age, and their constituting CpGs are correlated in DNAm level and are enriched for similar histone modifications and chromatin states. Interestingly, DNAm levels of CpGs from the clocks are commonly associated with gene expression in trans. The gene sets involved are highly overlapping and enriched for T cell processes. Further analysis of the transcriptome and methylome of sorted blood cell types identifies differences in DNAm between naive and activated T and NK cells as a probable contributor to the clocks. Indeed, within the same donor, the four epigenetic clocks predict naive cells to be up to 40 years younger than activated cells. Conclusions The ability of epigenetic clocks to predict chronological age involves their ability to detect changes in proportions of naive and activated immune blood cells, an established feature of immuno-senescence. This finding may contribute to the interpretation of associations between clock-derived measures and age-related health outcomes.


2022 ◽  
Vol 23 (1) ◽  
Author(s):  
Yanyu Liang ◽  
Milton Pividori ◽  
Ani Manichaikul ◽  
Abraham A. Palmer ◽  
Nancy J. Cox ◽  
...  

Abstract Background Polygenic risk scores (PRS) are valuable to translate the results of genome-wide association studies (GWAS) into clinical practice. To date, most GWAS have been based on individuals of European-ancestry leading to poor performance in populations of non-European ancestry. Results We introduce the polygenic transcriptome risk score (PTRS), which is based on predicted transcript levels (rather than SNPs), and explore the portability of PTRS across populations using UK Biobank data. Conclusions We show that PTRS has a significantly higher portability (Wilcoxon p=0.013) in the African-descent samples where the loss of performance is most acute with better performance than PRS when used in combination.


2022 ◽  
Vol 23 (1) ◽  
Author(s):  
Gaoyang Li ◽  
Shaliu Fu ◽  
Shuguang Wang ◽  
Chenyu Zhu ◽  
Bin Duan ◽  
...  

AbstractHere, we present a multi-modal deep generative model, the single-cell Multi-View Profiler (scMVP), which is designed for handling sequencing data that simultaneously measure gene expression and chromatin accessibility in the same cell, including SNARE-seq, sci-CAR, Paired-seq, SHARE-seq, and Multiome from 10X Genomics. scMVP generates common latent representations for dimensionality reduction, cell clustering, and developmental trajectory inference and generates separate imputations for differential analysis and cis-regulatory element identification. scMVP can help mitigate data sparsity issues with imputation and accurately identify cell groups for different joint profiling techniques with common latent embedding, and we demonstrate its advantages on several realistic datasets.


2022 ◽  
Vol 23 (1) ◽  
Author(s):  
Andre L. M. Reis ◽  
Ira W. Deveson ◽  
Bindu Swapna Madala ◽  
Ted Wong ◽  
Chris Barker ◽  
...  

Abstract Background Next-generation sequencing (NGS) can identify mutations in the human genome that cause disease and has been widely adopted in clinical diagnosis. However, the human genome contains many polymorphic, low-complexity, and repetitive regions that are difficult to sequence and analyze. Despite their difficulty, these regions include many clinically important sequences that can inform the treatment of human diseases and improve the diagnostic yield of NGS. Results To evaluate the accuracy by which these difficult regions are analyzed with NGS, we built an in silico decoy chromosome, along with corresponding synthetic DNA reference controls, that encode difficult and clinically important human genome regions, including repeats, microsatellites, HLA genes, and immune receptors. These controls provide a known ground-truth reference against which to measure the performance of diverse sequencing technologies, reagents, and bioinformatic tools. Using this approach, we provide a comprehensive evaluation of short- and long-read sequencing instruments, library preparation methods, and software tools and identify the errors and systematic bias that confound our resolution of these remaining difficult regions. Conclusions This study provides an analytical validation of diagnosis using NGS in difficult regions of the human genome and highlights the challenges that remain to resolve these difficult regions.


2022 ◽  
Vol 23 (1) ◽  
Author(s):  
Ping Lin ◽  
Kailiang Wang ◽  
Yupeng Wang ◽  
Zhikang Hu ◽  
Chao Yan ◽  
...  

Abstract Background As a perennial crop, oil-Camellia possesses a long domestication history and produces high-quality seed oil that is beneficial to human health. Camellia oleifera Abel. is a sister species to the tea plant, which is extensively cultivated for edible oil production. However, the molecular mechanism of the domestication of oil-Camellia is still limited due to the lack of sufficient genomic information. Results To elucidate the genetic and genomic basis of evolution and domestication, here we report a chromosome-scale reference genome of wild oil-Camellia (2.95 Gb), together with transcriptome sequencing data of 221 cultivars. The oil-Camellia genome, assembled by an integrative approach of multiple sequencing technologies, consists of a large proportion of repetitive elements (76.1%) and high heterozygosity (2.52%). We construct a genetic map of high-density corrected markers by sequencing the controlled-pollination hybrids. Genome-wide association studies reveal a subset of artificially selected genes that are involved in the oil biosynthesis and phytohormone pathways. Particularly, we identify the elite alleles of genes encoding sugar-dependent triacylglycerol lipase 1, β-ketoacyl-acyl carrier protein synthase III, and stearoyl-acyl carrier protein desaturases; these alleles play important roles in enhancing the yield and quality of seed oil during oil-Camellia domestication. Conclusions We generate a chromosome-scale reference genome for oil-Camellia plants and demonstrate that the artificial selection of elite alleles of genes involved in oil biosynthesis contributes to oil-Camellia domestication.


2022 ◽  
Vol 23 (1) ◽  
Author(s):  
Maria Mircea ◽  
Mazène Hochane ◽  
Xueying Fan ◽  
Susana M. Chuva de Sousa Lopes ◽  
Diego Garlaschelli ◽  
...  

AbstractThe ability to discover new cell phenotypes by unsupervised clustering of single-cell transcriptomes has revolutionized biology. Currently, there is no principled way to decide whether a cluster of cells contains meaningful subpopulations that should be further resolved. Here, we present phiclust (ϕclust), a clusterability measure derived from random matrix theory that can be used to identify cell clusters with non-random substructure, testably leading to the discovery of previously overlooked phenotypes.


2022 ◽  
Vol 23 (1) ◽  
Author(s):  
Navchetan Kaur ◽  
Boris Oskotsky ◽  
Atul J. Butte ◽  
Zicheng Hu

Abstract Background Angiotensin-converting enzyme 2 (ACE2) is the cell-entry receptor for SARS-CoV-2. It plays critical roles in both the transmission and the pathogenesis of COVID-19. Comprehensive profiling of ACE2 expression patterns could reveal risk factors of severe COVID-19 illness. While the expression of ACE2 in healthy human tissues has been well characterized, it is not known which diseases and drugs might be associated with ACE2 expression. Results We develop GENEVA (GENe Expression Variance Analysis), a semi-automated framework for exploring massive amounts of RNA-seq datasets. We apply GENEVA to 286,650 publicly available RNA-seq samples to identify any previously studied experimental conditions that could be directly or indirectly associated with ACE2 expression. We identify multiple drugs, genetic perturbations, and diseases that are associated with the expression of ACE2, including cardiomyopathy, HNF1A overexpression, and drug treatments with RAD140 and itraconazole. Our joint analysis of seven datasets confirms ACE2 upregulation in all cardiomyopathy categories. Using electronic health records data from 3936 COVID-19 patients, we demonstrate that patients with pre-existing cardiomyopathy have an increased mortality risk than age-matched patients with other cardiovascular conditions. GENEVA is applicable to any genes of interest and is freely accessible at http://genevatool.org. Conclusions This study identifies multiple diseases and drugs that are associated with the expression of ACE2. The effect of these conditions should be carefully studied in COVID-19 patients. In particular, our analysis identifies cardiomyopathy patients as a high-risk group, with increased ACE2 expression in the heart and increased mortality after SARS-COV-2 infection.


2022 ◽  
Vol 23 (1) ◽  
Author(s):  
Xiang Gao ◽  
Xu-Kai Ma ◽  
Xiang Li ◽  
Guo-Wei Li ◽  
Chu-Xiao Liu ◽  
...  

AbstractMany circular RNAs (circRNAs) are produced from back-splicing of exons of precursor mRNAs and are generally co-expressed with cognate linear RNAs. Methods for circRNA-specific knockout are lacking, largely due to sequence overlaps between forms. Here, we use base editors (BEs) for circRNA depletion. By targeting splice sites involved in both back-splicing and canonical splicing, BEs can repress circular and linear RNAs. Targeting sites predominantly for circRNA biogenesis, BEs could efficiently repress the production of circular but not linear RNAs. As hundreds of exons are predominantly back-spliced to produce circRNAs, this provides an efficient method to deplete circRNAs for functional study.


2022 ◽  
Vol 23 (1) ◽  
Author(s):  
Jonathan Tyler ◽  
Yining Lu ◽  
Jay Dunlap ◽  
Daniel B. Forger

Abstract Background Circadian (daily) timekeeping is essential to the survival of many organisms. An integral part of all circadian timekeeping systems is negative feedback between an activator and repressor. However, the role of this feedback varies widely between lower and higher organisms. Results Here, we study repression mechanisms in the cyanobacterial and eukaryotic clocks through mathematical modeling and systems analysis. We find a common mathematical model that describes the mechanism by which organisms generate rhythms; however, transcription’s role in this has diverged. In cyanobacteria, protein sequestration and phosphorylation generate and regulate rhythms while transcription regulation keeps proteins in proper stoichiometric balance. Based on recent experimental work, we propose a repressor phospholock mechanism that models the negative feedback through transcription in clocks of higher organisms. Interestingly, this model, when coupled with activator phosphorylation, allows for oscillations over a wide range of protein stoichiometries, thereby reconciling the negative feedback mechanism in Neurospora with that in mammals and cyanobacteria. Conclusions Taken together, these results paint a picture of how circadian timekeeping may have evolved.


Sign in / Sign up

Export Citation Format

Share Document