• Angela Brooks
  • Francisco Pardo-Palacios
  • Fairlie Reese
  • Silvia Carbonell-Sala
  • Mark Diekhans
  • Cindy Liang
  • Dingjie Wang
  • Brian Williams
  • Matthew Adams
  • Amit Behera
  • Julien Lagarde
  • Haoran Li
  • Gabriela Balderrama-Gutierrez
  • Muhammed Hasan Çelik
  • Maite De María
  • Nancy Denslow
  • Natàlia Garcia-Reyero
  • Stefan Goetz
  • Margaret Hunter
  • Jane Loveland
  • Carlos Menor
  • David Moraga
  • Jonathan Mudge
  • Hazuki Takahashi
  • Alison Tang
  • Ingrid Youngworth
  • Piero Carninci
  • Roderic Guigó
  • Hagen U. Tilgner
  • Barbara Wold
  • Christopher Vollmers
  • Gloria Sheynkman
  • Adam Frankish
  • Kin Fai Au
  • Ana Conesa
  • Ali Mortazavi
With increased usage of long-read sequencing technologies to perform transcriptome analyses, there becomes a greater need to evaluate different methodologies including library preparation, sequencing platform, and computational analysis tools. Here, we report the study design of a community effort called the Long-read RNA-Seq Genome Annotation Assessment Project (LRGASP) Consortium, whose goals are characterizing the strengths and remaining challenges in using long-read approaches to identify and quantify the transcriptomes of both model and non-model organisms. The LRGASP organizers have generated cDNA and direct RNA datasets in human, mouse, and manatee samples using different protocols followed by sequencing on Illumina, Pacific Biosciences, and Oxford Nanopore Technologies platforms. Participants will use the provided data to submit predictions for three challenges: transcript isoform detection with a high-quality genome, transcript isoform quantification, and de novo transcript isoform identification. Evaluators from different institutions will determine which pipelines have the highest accuracy for a variety of metrics using benchmarks that include spike-in synthetic transcripts, simulated data, and a set of undisclosed, manually curated transcripts by GENCODE. We also describe plans for experimental validation of predictions that are platform-specific and computational tool-specific. We believe that a community effort to evaluate long-read RNA-seq methods will help move the field toward a better consensus on the best approaches to use for transcriptome analyses.
Original languageEnglish
JournalNature Methods
DOIs
StateSubmitted - 3 Aug 2021

ID: 100354881