genomic locations Latest Research Papers

MGcount: a total RNA-seq quantification tool to address multi-mapping and multi-overlapping alignments ambiguity in non-coding transcripts

BMC Bioinformatics ◽

10.1186/s12859-021-04544-3 ◽

2022 ◽

Vol 23 (1) ◽

Author(s):

Andrea Hita ◽

Gilles Brocart ◽

Ana Fernandez ◽

Marc Rehmsmeier ◽

Anna Alemany ◽

...

Keyword(s):

Rna Sequencing ◽

Genomic Region ◽

Simultaneous Estimation ◽

Rna Seq ◽

Protein Coding ◽

Total Rna ◽

Simultaneous Study ◽

Downstream Analysis ◽

And Function ◽

Genomic Locations

Abstract Background Total-RNA sequencing (total-RNA-seq) allows the simultaneous study of both the coding and the non-coding transcriptome. Yet, computational pipelines have traditionally focused on particular biotypes, making assumptions that are not fullfilled by total-RNA-seq datasets. Transcripts from distinct RNA biotypes vary in length, biogenesis, and function, can overlap in a genomic region, and may be present in the genome with a high copy number. Consequently, reads from total-RNA-seq libraries may cause ambiguous genomic alignments, demanding for flexible quantification approaches. Results Here we present Multi-Graph count (MGcount), a total-RNA-seq quantification tool combining two strategies for handling ambiguous alignments. First, MGcount assigns reads hierarchically to small-RNA and long-RNA features to account for length disparity when transcripts overlap in the same genomic position. Next, MGcount aggregates RNA products with similar sequences where reads systematically multi-map using a graph-based approach. MGcount outputs a transcriptomic count matrix compatible with RNA-sequencing downstream analysis pipelines, with both bulk and single-cell resolution, and the graphs that model repeated transcript structures for different biotypes. The software can be used as a python module or as a single-file executable program. Conclusions MGcount is a flexible total-RNA-seq quantification tool that successfully integrates reads that align to multiple genomic locations or that overlap with multiple gene features. Its approach is suitable for the simultaneous estimation of protein-coding, long non-coding and small non-coding transcript concentration, in both precursor and processed forms. Both source code and compiled software are available at https://github.com/hitaandrea/MGcount.

Download Full-text

H2A.Z deposition by SWR1C involves multiple ATP-dependent steps

10.1101/2022.01.11.475888 ◽

2022 ◽

Author(s):

Jiayi Fan ◽

Andrew T Moreno ◽

Alexander S Baier ◽

Joseph J Loparo ◽

Craig L Peterson

Keyword(s):

Single Molecule ◽

Reaction Pathway ◽

Histone H2a ◽

Time Analysis ◽

Rapid Loss ◽

Protein Coding ◽

Nucleosome Remodeling ◽

Real Time Analysis ◽

Dna Unwrapping ◽

Genomic Locations

The histone variant H2A.Z is a conserved feature of nucleosomes flanking protein-coding genes. Deposition of H2A.Z requires ATP-dependent replacement of nucleosomal H2A by a chromatin remodeler related to the multi-subunit enzyme, yeast SWR1C. How these enzymes use ATP to promote this nucleosome editing reaction remains unclear. Here we use single-molecule and ensemble methodologies to identify three ATP-dependent phases in the H2A.Z deposition reaction. Real-time analysis of single nucleosome remodeling events reveals an initial, priming step that occurs after ATP addition that likely involves transient DNA unwrapping from the nucleosome. Priming is followed by rapid loss of histone H2A, which is subsequently released from the H2A.Z nucleosomal product. Surprisingly, the rates of both priming and the release of the H2A/H2B dimer are sensitive to ATP concentration. This complex reaction pathway provides multiple opportunities to regulate the timely and accurate deposition of H2A.Z at key genomic locations.

Download Full-text

Cut-and-Paste DNA Insertion with Engineered Type V-K CRISPR-associated Transposases

10.1101/2022.01.07.475005 ◽

2022 ◽

Author(s):

Connor J Tou ◽

Benno Orr ◽

Benjamin P Kleinstiver

Keyword(s):

Homing Endonuclease ◽

Type I ◽

Insertion Product ◽

Dna Insertion ◽

Transposition Mechanism ◽

Type V ◽

Genomic Locations ◽

Distinct Features ◽

Robust Integration ◽

Replicative Transposition

CRISPR-associated transposases (CASTs) enable recombination-independent, multi-kilobase DNA insertions at RNA-programmed genomic locations. Type V-K CASTs offer distinct technological advantages over type I CASTs given their smaller coding size, fewer components, and unidirectional insertions. However, the utility of type V-K CASTs is hindered by a replicative transposition mechanism that results in a mixture of desired simple cargo insertions and undesired plasmid co-integrate products. Here, we overcome this limitation by engineering new CASTs with dramatically improved product purity. To do so, we compensate for the absence of the TnsA subunit in multiple type V-K CASTs by engineering a Homing Endonuclease-assisted Large-sequence Integrating CAST compleX, or HELIX system. HELIX utilizes a nicking homing endonuclease (nHE) fused to TnsB to restore the 5-prime nicking capability needed for dual-nicking of the DNA donor. By leveraging distinct features of both type V-K and type I systems, HELIX enables cut-and-paste DNA insertion with up to 99.3% simple insertion product purity, while retaining robust integration efficiencies on genomic targets. Furthermore, we demonstrate the versatility of this approach by generating HELIX systems for other CAST orthologs. We also establish the feasibility of creating a minimal, 3-component HELIX, simplifying the number of proteins that must be expressed. Together, HELIX streamlines and improves the application of CRISPR-based transposition technologies, eliminating barriers for efficient and specific RNA-guided DNA insertions.

Download Full-text

Mapping of crown rust (Puccinia coronata f. sp. avenae) resistance gene Pc54 and a novel quantitative trait locus effective against powdery mildew (Blumeria graminis f. sp. avenae) in the oat (Avena sativa) line Pc54

Phytopathology ◽

10.1094/phyto-10-21-0445-r ◽

2022 ◽

Author(s):

Belayneh Admassu Yimer ◽

Kathy Esvelt Klos ◽

Irene Griffiths ◽

Alexander Cowan ◽

Catherine Howarth

Keyword(s):

Molecular Markers ◽

Powdery Mildew ◽

Resistance Gene ◽

Interval Mapping ◽

Crown Rust ◽

The Novel ◽

Unknown Gene ◽

Trait Locus ◽

Separate Identity ◽

Genomic Locations

The Pc54 oat line carries the crown rust resistance gene ‘Pc54’ and an unknown gene effective against powdery mildew. In this study two recombinant inbred line populations were developed to identify the genomic locations of the two genes and producing lists of molecular markers with a potential for marker assisted selection. The RILs and parents were phenotyped for crown rust and powdery mildew in a controlled environment. They were also genotyped using the 6K Illumina Infinium iSelect oat SNP chip. Multiple interval mapping placed Pc54 on the linkage group Mrg02 (chromosome 7D) and the novel powdery mildew QTL ‘QPm.18’ on Mrg18 (chromosome 1A) both in the mapping and validating population. A total of nine and 31 significant molecular markers were identified linked with the Pc54 gene and QPm.18, respectively. Reactions to crown rust inoculations have justified separate identity of Pc54 from other genes and QTL that have previously been reported on Mrg02 except for ’qPCRFd’. Pm3 is the only powdery mildew resistance gene previously mapped on Mrg18. However, the pm3 differential line, Mostyn was susceptible to the powdery mildew race used in this study suggesting that Pm3 and QPm.18 are different genes. Determining the chromosomal locations of Pc54 and QPm.18 is helpful for better understanding the molecular mechanism of resistance to crown rust and powdery mildew in oats. Furthermore, SNPs and SSRs that are closely linked with the genes could be valuable for developing PCR based molecular markers and facilitating the utilization of these genes in oat breeding programs.

Download Full-text

Genomic environments scale the activities of diverse core promoters

Genome Research ◽

10.1101/gr.276025.121 ◽

2021 ◽

Author(s):

Clarice K.Y. Hong ◽

Barak A. Cohen

Keyword(s):

Core Promoter ◽

Classical Model ◽

Regulatory Elements ◽

Sequence Motifs ◽

Core Promoters ◽

The Core ◽

Genomic Location ◽

Basal Transcriptional Machinery ◽

Promoter Specificity ◽

Genomic Locations

A classical model of gene regulation is that enhancers provide specificity whereas core promoters provide a modular site for the assembly of the basal transcriptional machinery. However, examples of core promoter specificity have led to an alternate hypothesis in which specificity is achieved by core promoters with different sequence motifs that respond differently to genomic environments containing different enhancers and chromatin landscapes. To distinguish between these models, we measured the activities of hundreds of diverse core promoters in four different genomic locations and, in a complementary experiment, six different core promoters at thousands of locations across the genome. Although genomic locations had large effects on expression, the intrinsic activities of different classes of promoters were preserved across genomic locations, suggesting that core promoters are modular regulatory elements whose activities are independently scaled up or down by different genomic locations. This scaling of promoter activities is nonlinear and depends on the genomic location and the strength of the core promoter. Our results support the classical model of regulation in which diverse core promoter motifs set the intrinsic strengths of core promoters, which are then amplified or dampened by the activities of their genomic environments.

Download Full-text

BSImp: imputing partially observed methylation patterns for evaluating methylation heterogeneity

10.1101/2021.12.07.471020 ◽

2021 ◽

Author(s):

Ya-Ting Chang ◽

Ming-Ren Yen ◽

Pao-Yang Chen

Keyword(s):

Cellular Heterogeneity ◽

Accurate Estimation ◽

Genome Wide ◽

Partially Observed ◽

Genome Screening ◽

Single Base Pair ◽

Downstream Analysis ◽

Genomic Locations ◽

User Friendly ◽

Methylation Patterns

DNA methylation is one of the most studied epigenetic modifications that has applications ranging from transcriptional regulation to aging, and can be assessed by bisulfite sequencing (BS-seq) at single base-pair resolution. The permutations of methylation statuses at bisulfite converted reads reflect the methylation patterns of individual cells. These patterns at specific genomic locations are sought to be indicative of cellular heterogeneity within a cellular population, which are predictive of developments and diseases; therefore, methylation heterogeneity has potentials in early detection of these changes. Computational methods have been developed to assess methylation heterogeneity using methylation patterns formed by four CpGs, but the nature of shotgun sequencing often give partially observed patterns, which makes very limited data available for downstream analysis. While many programs are developed to impute methylation levels genomewide, currently there is only one method developed for recovering partially observed methylation patterns; however, the program needs lots of data to train and cannot be used directly; therefore, we developed a probabilistic-based imputation method that uses information from neighbouring sites to recover partially observed methylation patterns speedily. It is demonstrated to allow for the evaluation of methylation heterogeneity at three times more regions genome-wide with high accuracy for data with moderate depth. To make it more user-friendly we also provide a computational pipeline for genome-screening, which can be used in both evaluating methylation levels and profiling methylation patterns genomewide for all cytosine contexts, which is the first of its kind. Our method allows for accurate estimation of methylation levels and makes evaluating methylation heterogeneity available for much more data with reasonable coverage, which has important implications in using methylation heterogeneity for monitoring changes within the cellular populations that were impossible to detect for the assessment of development and diseases.

Download Full-text

BXD Recombinant Inbred Mice as a Model to Study Neurotoxicity

Biomolecules ◽

10.3390/biom11121762 ◽

2021 ◽

Vol 11 (12) ◽

pp. 1762

Author(s):

Airton C. Martins ◽

Caridad López-Granero ◽

Beatriz Ferrer ◽

Alexey A. Tinkov ◽

Anatoly V. Skalny ◽

...

Keyword(s):

Recombinant Inbred ◽

Inbred Mice ◽

Reference Population ◽

Recombinant Inbred Mice ◽

Complex Phenotypes ◽

Pesticide Exposures ◽

Underlying Mechanisms ◽

Genomic Locations ◽

Segregating Population ◽

Ri Lines

BXD recombinant inbred (RI) lines represent a genetic reference population derived from a cross between C57BL/6J mice (B6) and DBA/2J mice (D2), which through meiotic recombination events possesses recombinant chromosomes containing B6 or D2 haplotype segments. The quantitative trait loci (QTLs) are the locations of segregating genetic polymorphisms and are fundamental to understanding genetic diversity in human disease susceptibility and severity. QTL mapping represents the typical approach for identifying naturally occurring polymorphisms that influence complex phenotypes. In this process, genotypic values at markers of known genomic locations are associated with phenotypic values measured in a segregating population. Indeed, BXD RI strains provide a powerful tool to study neurotoxicity induced by different substances. In this review, we describe the use of BXD RI lines to understand the underlying mechanisms of neurotoxicity in response to ethanol and cocaine, as well as metals and pesticide exposures.

Download Full-text

The Expanding Constellation of Histone Post-Translational Modifications in the Epigenetic Landscape

Genes ◽

10.3390/genes12101596 ◽

2021 ◽

Vol 12 (10) ◽

pp. 1596

Author(s):

Vincenzo Cavalieri

Keyword(s):

Environmental Influence ◽

Epigenetic Landscape ◽

Adaptive Responses ◽

Post Translational Modification ◽

Exciting Field ◽

Post Translational Modifications ◽

Linker Histones ◽

Nucleosome Dynamics ◽

Epigenetic Machinery ◽

Genomic Locations

The emergence of a nucleosome-based chromatin structure accompanied the evolutionary transition from prokaryotes to eukaryotes. In this scenario, histones became the heart of the complex and precisely timed coordination between chromatin architecture and functions during adaptive responses to environmental influence by means of epigenetic mechanisms. Notably, such an epigenetic machinery involves an overwhelming number of post-translational modifications at multiple residues of core and linker histones. This review aims to comprehensively describe old and recent evidence in this exciting field of research. In particular, histone post-translational modification establishing/removal mechanisms, their genomic locations and implication in nucleosome dynamics and chromatin-based processes, as well as their harmonious combination and interdependence will be discussed.

Download Full-text

Genomic and Epigenetic Foundations of Neocentromere Formation

Annual Review of Genetics ◽

10.1146/annurev-genet-071719-020924 ◽

2021 ◽

Vol 55 (1) ◽

Author(s):

Evon M. DeBose-Scarlett ◽

Beth A. Sullivan

Keyword(s):

Genome Instability ◽

Genome Project ◽

Model Organisms ◽

Annual Review ◽

Publication Date ◽

Open Chromatin ◽

Developmental Defects ◽

Active Transcription ◽

Genomic Locations ◽

The Human Genome Project

Centromeres are essential to genome inheritance, serving as the site of kinetochore assembly and coordinating chromosome segregation during cell division. Abnormal centromere function is associated with birth defects, infertility, and cancer. Normally, centromeres are assembled and maintained at the same chromosomal location. However, ectopic centromeres form spontaneously at new genomic locations and contribute to genome instability and developmental defects as well as to acquired and congenital human disease. Studies in model organisms have suggested that certain regions of the genome, including pericentromeres, heterochromatin, and regions of open chromatin or active transcription, support neocentromere activation. However, there is no universal mechanism that explains neocentromere formation. This review focuses on recent technological and intellectual advances in neocentromere research and proposes future areas of study. Understanding neocentromere biology will provide a better perspective on chromosome and genome organization and functional context for information generated from the Human Genome Project, ENCODE, and other large genomic consortia. Expected final online publication date for the Annual Review of Genetics, Volume 55 is November 2021. Please see http://www.annualreviews.org/page/journal/pubdates for revised estimates.

Download Full-text

DNA Replication-transcription conflicts do not significantly contribute to spontaneous mutations due to replication errors in Escherichia coli

10.1101/2021.08.23.457454 ◽

2021 ◽

Author(s):

Patricia L Foster ◽

Brittany A Niccum ◽

Heewook Lee

Keyword(s):

Escherichia Coli ◽

Dna Replication ◽

Base Pair ◽

Point Mutations ◽

Trna Genes ◽

Small Indels ◽

Replication Errors ◽

Highly Expressed Genes ◽

Genomic Locations ◽

Orientation Bias

Encounters between DNA replication and transcription can cause genomic disruption, particularly when the two meet head-on. Whether these conflicts produce point mutations is debated. This paper presents detailed analyses of a large collection of mutations generated during mutation accumulation experiments with mismatch-repair (MMR) defective Escherichia coli. With MMR absent, mutations are primarily due to DNA replication errors. Overall, there were no differences in the frequencies of base-pair substitutions or small indels (insertion and deletions ≤ 4 bp) in the coding sequences or promoters of genes oriented codirectionally versus head-on to replication. Among a subset of highly expressed genes there was a 2- to 3-fold bias for indels in genes oriented head-on to replication, but this difference was almost entirely due to the asymmetrical genomic locations of tRNA genes containing mononucleotide runs, which are hotspots for indels.No additional orientation bias in mutation frequencies occurred when MMR-strains were also defective for transcription-coupled repair (TCR). However, in contrast to other reports, loss of TCR slightly increased the overall mutation rate, meaning that TCR is antimutagenic. There was no orientation bias in mutation frequencies among the stress-response genes that are regulated by RpoS or induced by DNA damage. Thus, biases in the locations of mutational targets can account for most, if not all, apparent biases in mutation frequencies between genes oriented head-on versus co-directional to replication. In addition, the data revealed a strong correlation of the frequency of base-pair substitutions with gene length, but no correlation with gene expression levels.

Download Full-text

genomic locations
Recently Published Documents

TOTAL DOCUMENTS

H-INDEX

MGcount: a total RNA-seq quantification tool to address multi-mapping and multi-overlapping alignments ambiguity in non-coding transcripts

H2A.Z deposition by SWR1C involves multiple ATP-dependent steps

Cut-and-Paste DNA Insertion with Engineered Type V-K CRISPR-associated Transposases

Mapping of crown rust (Puccinia coronata f. sp. avenae) resistance gene Pc54 and a novel quantitative trait locus effective against powdery mildew (Blumeria graminis f. sp. avenae) in the oat (Avena sativa) line Pc54

Genomic environments scale the activities of diverse core promoters

BSImp: imputing partially observed methylation patterns for evaluating methylation heterogeneity

BXD Recombinant Inbred Mice as a Model to Study Neurotoxicity

The Expanding Constellation of Histone Post-Translational Modifications in the Epigenetic Landscape

Genomic and Epigenetic Foundations of Neocentromere Formation

DNA Replication-transcription conflicts do not significantly contribute to spontaneous mutations due to replication errors in Escherichia coli

Export Citation Format

genomic locationsRecently Published Documents

TOTAL DOCUMENTS

H-INDEX

MGcount: a total RNA-seq quantification tool to address multi-mapping and multi-overlapping alignments ambiguity in non-coding transcripts

H2A.Z deposition by SWR1C involves multiple ATP-dependent steps

Cut-and-Paste DNA Insertion with Engineered Type V-K CRISPR-associated Transposases

Mapping of crown rust (Puccinia coronata f. sp. avenae) resistance gene Pc54 and a novel quantitative trait locus effective against powdery mildew (Blumeria graminis f. sp. avenae) in the oat (Avena sativa) line Pc54

Genomic environments scale the activities of diverse core promoters

BSImp: imputing partially observed methylation patterns for evaluating methylation heterogeneity

BXD Recombinant Inbred Mice as a Model to Study Neurotoxicity

The Expanding Constellation of Histone Post-Translational Modifications in the Epigenetic Landscape

Genomic and Epigenetic Foundations of Neocentromere Formation

DNA Replication-transcription conflicts do not significantly contribute to spontaneous mutations due to replication errors in Escherichia coli

genomic locations
Recently Published Documents