Tài liệu miễn phí Sinh học
Download Tài liệu học tập miễn phí Sinh học
Accurate variant calling and genotyping represent major limiting factors for downstream applications of single-cell genomics. Here, we report Conbase for the identification of somatic mutations in single-cell DNA sequencing data.
4/6/2023 10:26:20 PM +00:00
Circadian rhythms in transcription ultimately result in oscillations of key biological processes. Understanding how transcriptional rhythms are generated in plants provides an opportunity for fine-tuning growth, development, and responses to the environment.
4/6/2023 10:26:09 PM +00:00
Despite its widespread use, RNA-seq is still too laborious and expensive to replace RT-qPCR as the default gene expression analysis method. We present a novel approach, BRB-seq, which uses early multiplexing to produce 3′ cDNA libraries for dozens of samples, requiring just 2 hours of hands-on time.
4/6/2023 10:26:03 PM +00:00
We introduce alevin, a fast end-to-end pipeline to process droplet-based single-cell RNA sequencing data, performing cell barcode detection, read mapping, unique molecular identifier (UMI) deduplication, gene count estimation, and cell barcode whitelisting.
4/6/2023 10:25:56 PM +00:00
Tandemly repeated DNA is highly mutable and causes at least 31 diseases, but it is hard to detect pathogenic repeat expansions genome-wide. Here, we report robust detection of human repeat expansions from careful alignments of long but error-prone (PacBio and nanopore) reads to a reference genome.
4/6/2023 10:25:49 PM +00:00
Deletions that fuse two adjacent topologically associating domains (TADs) can cause severe developmental disorders. We provide a formal method to quantify deletions based on their potential disruption of the three-dimensional genome structure, denoted as the TAD fusion score.
4/6/2023 10:25:42 PM +00:00
Measurements of single-cell methylation are revolutionizing our understanding of epigenetic control of gene expression, yet the intrinsic data sparsity limits the scope for quantitative analysis of such data.
4/6/2023 10:25:35 PM +00:00
Single-cell RNA-seq quantifies biological heterogeneity across both discrete cell types and continuous cell transitions. Partition-based graph abstraction (PAGA) provides an interpretable graph-like map of the arising data manifold, based on estimating connectivity of manifold partitions (https://github.com/theislab/paga).
4/6/2023 10:25:26 PM +00:00
Droplet-based single-cell RNA sequencing protocols have dramatically increased the throughput of single-cell transcriptomics studies. A key computational challenge when processing these data is to distinguish libraries for real cells from empty droplets.
4/6/2023 10:25:19 PM +00:00
Histone acetylation plays a central role in gene regulation and is sensitive to the levels of metabolic intermediates. However, predicting the impact of metabolic alterations on acetylation in pathological conditions is a significant challenge. Here, we present a genome-scale network model that predicts the impact of nutritional environment and genetic alterations on histone acetylation.
4/6/2023 10:25:12 PM +00:00
The Hemiptera (aphids, cicadas, and true bugs) are a key insect order, with high diversity for feeding ecology and excellent experimental tractability for molecular genetics. Building upon recent sequencing of hemipteran pests such as phloem-feeding aphids and blood-feeding bed bugs, we present the genome sequence and comparative analyses centered on the milkweed bug Oncopeltus fasciatus, a seed feeder of the family Lygaeidae.
4/6/2023 10:25:03 PM +00:00
Sequencing errors are key confounding factors for detecting low-frequency genetic variants that are important for cancer molecular diagnosis, treatment, and surveillance using deep next-generation sequencing (NGS). However, there is a lack of comprehensive understanding of errors introduced at various steps of a conventional NGS workflow, such as sample handling, library preparation, PCR enrichment, and sequencing. In this study, we use current NGS technology to systematically investigate these questions.
4/6/2023 10:24:49 PM +00:00
Hi-C is currently the most widely used assay to investigate the 3D organization of the genome and to study its role in gene regulation, DNA replication, and disease. However, Hi-C experiments are costly to perform and involve multiple complex experimental steps; thus, accurate methods for measuring the quality and reproducibility of Hi-C data are essential to determine whether the output should be used further in a study.
4/6/2023 10:24:40 PM +00:00
DNase-seq and ATAC-seq are broadly used methods to assay open chromatin regions genome-wide. The single nucleotide resolution of DNase-seq has been further exploited to infer transcription factor binding sites (TFBSs) in regulatory regions through footprinting.
4/6/2023 10:24:33 PM +00:00
Predicting the effects of genetic variants on splicing is highly relevant for human genetics. We describe the framework MMSplice (modular modeling of splicing) with which we built the winning model of the CAGI5 exon skipping prediction challenge. T
4/6/2023 10:24:26 PM +00:00
Protein recoding by RNA editing is required for normal health and evolutionary adaptation. However, de novo induction of RNA editing in response to environmental factors is an uncommon phenomenon. While APOBEC3A edits many mRNAs in monocytes and macrophages in response to hypoxia and interferons, the physiological significance of such editing is unclear.
4/6/2023 10:24:19 PM +00:00
The sharp increase of plant genome and transcriptome data provide valuable resources to investigate evolutionary consequences of gene duplication in a range of taxa, and unravel common principles underlying duplicate gene retention.
4/6/2023 10:24:11 PM +00:00
Human selection has a long history of transforming crop genomes. Peach (Prunus persica) has undergone more than 5000 years of domestication that led to remarkable changes in a series of agronomically important traits, but genetic bases underlying these changes and the effects of artificial selection on genomic diversity are not well understood.
4/6/2023 10:24:04 PM +00:00
Adenosine-to-inosine (A-to-I) RNA editing is an essential post-transcriptional mechanism mediated by ADAR enzymes that have been recently associated with cancer. Here, we characterize the inosinome signature in normal brain and de novo glioblastoma (GBM) using new metrics that re-stratify GBM patients according to their editing profiles and indicate this post-transcriptional event as a possible molecular mechanism for sexual dimorphism in GBM.
4/6/2023 10:23:55 PM +00:00
The explosive growth in taxonomic metagenome profiling methods over the past years has created a need for systematic comparisons using relevant performance criteria. The Open-community Profiling Assessment tooL (OPAL) implements commonly used performance metrics, including those of the first challenge of the initiative for the Critical Assessment of Metagenome Interpretation (CAMI), together with convenient visualizations.
4/6/2023 10:23:47 PM +00:00
Measuring gene expression of tumor clones at single-cell resolution links functional consequences to somatic alterations. Without scalable methods to simultaneously assay DNA and RNA from the same single cell, parallel single-cell DNA and RNA measurements from independent cell populations must be mapped for genome-transcriptome association.
4/6/2023 10:23:40 PM +00:00
Alternative splicing is a key regulatory mechanism in eukaryotic cells and increases the effective number of functionally distinct gene products. Using bulk RNA sequencing, splicing variation has been studied across human tissues and in genetically diverse populations.
4/6/2023 10:23:34 PM +00:00
Transposase-Accessible Chromatin followed by sequencing (ATAC-seq) is a simple protocol for detection of open chromatin. Computational footprinting, the search for regions with depletion of cleavage events due to transcription factor binding, is poorly understood for ATAC-seq.
4/6/2023 10:23:24 PM +00:00
RNA biology is revolutionized by recent developments of diverse high-throughput technologies for transcriptomewide profiling of molecular RNA structures. RNA structurome profiling data can be used to identify differentially structured regions between groups of samples.
4/6/2023 10:23:13 PM +00:00
We propose a statistical boosting method, termed I-Boost, to integrate multiple types of high-dimensional genomics data with clinical data for predicting survival time. I-Boost provides substantially higher prediction accuracy than existing methods.
4/6/2023 10:23:06 PM +00:00
Recent technical improvements in single-cell RNA sequencing (scRNA-seq) have enabled massively parallel profiling of transcriptomes, thereby promoting large-scale studies encompassing a wide range of cell types of multicellular organisms.
4/6/2023 10:22:57 PM +00:00
Recent studies have revealed thousands of A-to-I RNA editing events in primates, but the origination and general functions of these events are not well addressed. Results: Here, we perform a comparative editome study in human and rhesus macaque and uncover a substantial proportion of macaque A-to-I editing sites that are genomically polymorphic in some animals or encoded as noneditable nucleotides in human.
4/6/2023 10:22:50 PM +00:00
CRISPR-Cas9 genome editing is widely used to study gene function, from basic biology to biomedical research. Structural rearrangements are a ubiquitous feature of cancer cells and their impact on the functional consequences of CRISPR-Cas9 gene-editing has not yet been assessed.
4/6/2023 10:22:43 PM +00:00
Structural variants (SVs) are known to play important roles in a variety of cancers, but their origins and functional consequences are still poorly understood. Many SVs are thought to emerge from errors in the repair processes following DNA double strand breaks (DSBs).
4/6/2023 10:22:32 PM +00:00
State-of-the-art methods assessing pathogenic non-coding variants have mostly been characterized on common disease-associated polymorphisms, yet with modest accuracy and strong positional biases. In this study, we curated 737 high-confidence pathogenic non-coding variants associated with monogenic Mendelian diseases.
4/6/2023 10:22:25 PM +00:00