Advanced search×

Transcript assembly and quantification by RNA-Seq reveals unannotated transcripts and isoform switching during cell differentiation.

Nat Biotechnol 28(5):511-5 (2010) PMID 20436464 PMCID PMC3146043

High-throughput mRNA sequencing (RNA-Seq) promises simultaneous transcript discovery and abundance estimation. However, this would require algorithms that are not restricted by prior gene annotations and that account for alternative transcription and splicing. Here we introduce such algorithms in an open-source software program called Cufflinks. To test Cufflinks, we sequenced and analyzed >430 million paired 75-bp RNA-Seq reads from a mouse myoblast cell line over a differentiation time series. We detected 13,692 known transcripts and 3,724 previously unannotated ones, 62% of which are supported by independent expression data or by homologous genes in other species. Over the time series, 330 genes showed complete switches in the dominant transcription start site (TSS) or splice isoform, and we observed more subtle shifts in 1,304 other genes. These results suggest that Cufflinks can illuminate the substantial regulatory flexibility and complexity in even this well-studied model of muscle development and that it can improve transcriptome-based genome annotation.

Referenced by 1 articles

DOI: 10.1038/nbt.1621
Version: za2963e q8za9 q8zbd q8zcc q8zd4 q8zec q8zf5 q8zge

Similar articles you may find interesting…

  1. The mechanism of ORFH79 suppression with the artificial restorer fertility gene Mt-GRP162.

    New Phytol (2013) PMID 23647140

    We engineered a recombinant GRP162 containing the mitochondrial transit peptide, termed Mt-GRP162, as an artificial restorer of fertility (Rf) gene. Mt-GRP162 was confirmed to bind to CMS-associated RNA and to localize to the mitochondria. The transgenic plants showed restored fertility with partial...
  2. Genome wide proteomics of ERBB2 and EGFR and other oncogenic pathways in inflammatory breast cancer.

    J Proteome Res (2013) PMID 23647160

    We selected three breast cancer cell lines (SKBR3, SUM149 and SUM190) with different oncogene expression levels involved in ERBB2 and EGFR signaling pathways as a model system for the evaluation of selective integration of subsets of transcriptomic and proteomic data. We assessed the oncogene status...
  3. Discovery and mass spectrometric analysis of novel splice-junction peptides using RNA-Seq.

    Mol Cell Proteomics (2013) PMID 23629695

    We collected RNA-Seq and proteomics data from the same cell population (Jurkat cells) and created a bioinformatics pipeline that builds customized databases for the discovery of novel splice-junction peptides. Eighty million paired-end Illumina reads and ~500,000 tandem mass spectra were used to ide...