Tài liệu miễn phí Sinh học
Download Tài liệu học tập miễn phí Sinh học
The accuracy of transcript quantification using RNA-seq data depends on many factors, such as the choice of alignment or mapping method and the quantification model being adopted. While the choice of quantification model has been shown to be important, considerably less attention has been given to comparing the effect of various read alignment approaches on quantification accuracy.
4/6/2023 5:38:55 AM +00:00
The central nervous system (CNS) is vulnerable to viral infection, yet few host factors in the CNS are known to defend against invasion by neurotropic viruses. Long noncoding RNAs (lncRNAs) have been revealed to play critical roles in a wide variety of biological processes and are highly abundant in the mammalian brain, but their roles in defending against invasion of pathogens into the CNS remain unclear.
4/6/2023 5:38:44 AM +00:00
Viral load is a major contributor to outcome in patients with Ebola virus disease (EVD), with high values leading to a fatal outcome. Evidence from the 2013–2016 Ebola virus (EBOV) outbreak indicated that different genotypes of the virus can have different phenotypes in patients.
4/6/2023 5:38:36 AM +00:00
It is a major challenge to integrate single-cell sequencing data across experiments, conditions, batches, time points, and other technical considerations. New computational methods are required that can integrate samples while simultaneously preserving biological information.
4/6/2023 5:38:28 AM +00:00
Fine-tuning quantitative traits for continuous subtle phenotypes is highly advantageous. We engineer the highly conserved upstream open reading frame (uORF) of FvebZIPs1.1 in strawberry (Fragaria vesca), using base editor A3A-PBE. Seven novel alleles are generated.
4/6/2023 5:38:20 AM +00:00
Compared to proteins, glycans, and lipids, much less is known about RNAs on the cell surface. We develop a series of technologies to test for any nuclearencoded RNAs that are stably attached to the cell surface and exposed to the extracellular space, hereafter called membrane-associated extracellular RNAs (maxRNAs).
4/6/2023 5:38:08 AM +00:00
Quantifying the genetic heterogeneity of a cell population is essential to understanding of biological systems. We develop a universal method to label individual DNA molecules for single-base-resolution haplotype-resolved quantitative characterization of diverse types of rare variants, with frequency as low as 4 × 10−5 , using both short- or long-read sequencing platforms.
4/6/2023 5:37:58 AM +00:00
Guanine-rich sequences are able to form complex RNA structures termed RNA G-quadruplexes in vitro. Because of their high stability, RNA Gquadruplexes are proposed to exist in vivo and are suggested to be associated with important biological relevance.
4/6/2023 5:37:50 AM +00:00
Glioblastoma (GBM) is a complex disease with extensive molecular and transcriptional heterogeneity. GBM can be subcategorized into four distinct subtypes; tumors that shift towards the mesenchymal phenotype upon recurrence are generally associated with treatment resistance, unfavorable prognosis, and the infiltration of pro-tumorigenic macrophages.
4/6/2023 5:37:43 AM +00:00
A key challenge in epigenetics is to determine the biological significance of epigenetic variation among individuals. We present Coordinate Covariation Analysis (COCOA), a computational framework that uses covariation of epigenetic signals across individuals and a database of region sets to annotate epigenetic heterogeneity.
4/6/2023 5:37:32 AM +00:00
Single-cell RNA-sequencing (scRNAseq) and the set of attached analysis methods are evolving fast, with more than 560 software tools available to the community, roughly half of which are dedicated to tasks related to data processing such as clustering, ordering, dimension reduction, or normalization.
4/6/2023 5:37:19 AM +00:00
Recent long-read assemblies often exceed the quality and completeness of available reference genomes, making validation challenging. Here we present Merqury, a novel tool for reference-free assembly evaluation based on efficient k-mer set operations.
4/6/2023 5:37:03 AM +00:00
The rapid development of single-cell RNA-sequencing (scRNA-seq) technologies has led to the emergence of many methods for removing systematic technical noises, including imputation methods, which aim to address the increased sparsity observed in single-cell data.
4/6/2023 5:36:55 AM +00:00
A key step in domestication of the grapevine was the transition from separate sexes (dioecy) in wild Vitis vinifera ssp. sylvestris (V. sylvestris) to hermaphroditism in cultivated Vitis vinifera ssp. sativa (V. vinifera).
4/6/2023 5:36:44 AM +00:00
Many computational methods have been developed recently to analyze single-cell RNA-seq (scRNA-seq) data. Several benchmark studies have compared these methods on their ability for dimensionality reduction, clustering, or differential analysis, often relying on default parameters.
4/6/2023 5:36:35 AM +00:00
Cell type heterogeneity presents a challenge to the interpretation of epigenome data, compounded by the difficulty in generating reliable single-cell DNA methylomes for large numbers of cells and samples. We present EPISCORE, a computational algorithm that performs virtual microdissection of bulk tissue DNA methylation data at single cell-type resolution for any solid tissue.
4/6/2023 5:36:28 AM +00:00
To provide a comprehensive mechanistic interpretation of how known trait-associated SNPs affect complex traits, we propose a method, Primo, for integrative analysis of GWAS summary statistics with multiple sets of omics QTL summary statistics from different cellular conditions or studies.
4/6/2023 5:36:16 AM +00:00
Short tandem repeats (STRs) contribute significantly to de novo mutagenesis, driving phenotypic diversity and genetic disease. Although highly diverse, their repetitive sequences induce DNA polymerase slippage and stalling, leading to length and sequence variation.
4/6/2023 5:36:05 AM +00:00
Contamination from present-day DNA is a fundamental issue when studying ancient DNA from historical or archaeological material, and quantifying the amount of contamination is essential for downstream analyses.
4/6/2023 5:35:58 AM +00:00
Allele expression (AE) analysis robustly measures cis-regulatory effects. Here, we present and demonstrate the utility of a vast AE resource generated from the GTEx v8 release, containing 15,253 samples spanning 54 human tissues for a total of 431 million measurements of AE at the SNP level and 153 million measurements at the haplotype level.
4/6/2023 5:35:50 AM +00:00
Germline disease-causing variants are generally more spatially clustered in protein 3-dimensional structures than benign variants. Motivated by this tendency, we develop a fast and powerful protein-structure-based scan (PSCAN) approach for evaluating gene-level associations with complex disease and detecting signal variants.
4/6/2023 5:35:43 AM +00:00
The observation that disease-associated genetic variants typically reside outside of exons has inspired widespread investigation into the genetic basis of transcriptional regulation. While associations between the mRNA abundance of a gene and its proximal SNPs (cis-eQTLs) are now readily identified, identification of high-quality distal associations (trans-eQTLs) has been limited by a heavy multiple testing burden and the proneness to false-positive signals.
4/6/2023 5:35:34 AM +00:00
Copy number aberrations (CNAs), which are pathogenic copy number variations (CNVs), play an important role in the initiation and progression of cancer. Single-cell DNA-sequencing (scDNAseq) technologies produce data that is ideal for inferring CNAs.
4/6/2023 5:35:24 AM +00:00
MAESTRO provides functions for preprocessing, alignment, quality control, expression and chromatin accessibility quantification, clustering, differential analysis, and annotation. By modeling gene regulatory potential from chromatin accessibilities at the single-cell level, MAESTRO outperforms the existing methods for integrating the cell clusters between scRNAseq and scATAC-seq.
4/6/2023 5:35:13 AM +00:00
RNA-binding proteins (RBPs) function as master regulators of gene expression. Alterations in RBP expression and function are often observed in cancer and influence critical pathways implicated in tumor initiation and growth. Identification and characterization of oncogenic RBPs and their regulatory networks provide new opportunities for targeted therapy.
4/6/2023 5:35:06 AM +00:00
DNA methylation is a key epigenetic modification in human development and disease, yet there is limited understanding of its highly coordinated regulation. Here, we identify 818 genes that affect DNA methylation patterns in blood using large-scale population genomics data.
4/6/2023 5:34:48 AM +00:00
Population structure among study subjects may confound genetic association studies, and lack of proper correction can lead to spurious findings. The Genotype-Tissue Expression (GTEx) project largely contains individuals of European ancestry, but the v8 release also includes up to 15% of individuals of non-European ancestry.
4/6/2023 5:34:41 AM +00:00
Although tuberculosis accounts for the highest mortality from a bacterial infection on a global scale, questions persist regarding its origin. One hypothesis based on modern Mycobacterium tuberculosis complex (MTBC) genomes suggests their most recent common ancestor followed human migrations out of Africa approximately 70,000 years before present.
4/6/2023 5:34:33 AM +00:00
We report a method called ContamLD for estimating autosomal ancient DNA (aDNA) contamination by measuring the breakdown of linkage disequilibrium in a sequenced individual due to the introduction of contaminant DNA.
4/6/2023 5:34:24 AM +00:00
Long-read sequencing is promising for the comprehensive discovery of structural variations (SVs). However, it is still non-trivial to achieve high yields and performance simultaneously due to the complex SV signatures implied by noisy long reads.
4/6/2023 5:34:14 AM +00:00