{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2024,6,10]],"date-time":"2024-06-10T07:40:03Z","timestamp":1718005203239},"reference-count":26,"publisher":"Oxford University Press (OUP)","issue":"22","content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2015,11,15]]},"abstract":"Abstract<\/jats:title>Motivation: Designing an RNA-seq study depends critically on its specific goals, technology and underlying biology, which renders general guidelines inadequate. We propose a Bayesian framework to customize experiments so that goals can be attained and resources are not wasted, with a focus on alternative splicing.<\/jats:p>Results: We studied how read length, sequencing depth, library preparation and the number of replicates affects cost-effectiveness of single-sample and group comparison studies. Optimal settings varied strongly according to the target organism or tissue (potential 50\u2013500% cost cuts) and, interestingly, short reads outperformed long reads for standard analyses. Our framework learns key characteristics for study design from the data, and predicts if and how to continue experimentation. These predictions matched several follow-up experimental datasets that were used for validation. We provide default pipelines, but the framework can be combined with other data analysis methods and can help assess their relative merits.<\/jats:p>Availability and implementation: casper package at www.bioconductor.org\/packages\/release\/bioc\/html\/casper.html, Supplementary Manual by typing casperDesign() at the R prompt.<\/jats:p>Contact: rosselldavid@gmail.com<\/jats:p>Supplementary information: \u00a0Supplementary data are available at Bioinformatics online.<\/jats:p>","DOI":"10.1093\/bioinformatics\/btv436","type":"journal-article","created":{"date-parts":[[2015,7,29]],"date-time":"2015-07-29T00:29:38Z","timestamp":1438129778000},"page":"3631-3637","source":"Crossref","is-referenced-by-count":7,"title":["Designing alternative splicing RNA-seq studies. Beyond generic guidelines"],"prefix":"10.1093","volume":"31","author":[{"given":"Camille","family":"Stephan-Otto Attolini","sequence":"first","affiliation":[{"name":"1 Institute for Research in Biomedicine (IRB Barcelona), Barcelona, Spain,"}]},{"given":"Victor","family":"Pe\u00f1a","sequence":"additional","affiliation":[{"name":"2 Department of Statistical Science, Duke University, Durham, North Carolina, USA and"}]},{"given":"David","family":"Rossell","sequence":"additional","affiliation":[{"name":"3 Department of Statistics, University of Warwick, Coventry, UK"}]}],"member":"286","published-online":{"date-parts":[[2015,7,27]]},"reference":[{"key":"2023020202402040100_btv436-B1","doi-asserted-by":"crossref","first-page":"289","DOI":"10.1111\/j.2517-6161.1995.tb02031.x","article-title":"Controlling the false discovery rate: a practical and powerful approach to multiple testing","volume":"57","author":"Benjamini","year":"1995","journal-title":"J. R. Stat. Soc. B."},{"key":"2023020202402040100_btv436-B2","doi-asserted-by":"crossref","DOI":"10.1007\/978-1-4757-4286-2","volume-title":"Statistical Decision Theory and Bayesian Analysis","author":"Berger","year":"1985"},{"key":"2023020202402040100_btv436-B3","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1186\/1471-2105-11-94","article-title":"Evaluation of statistical methods for normalization and differential expression in mRNA-seq experiments","volume":"11","author":"Bullard","year":"2010","journal-title":"BMC Bioinformatics"},{"key":"2023020202402040100_btv436-B4","doi-asserted-by":"crossref","first-page":"656","DOI":"10.1093\/bioinformatics\/btt015","article-title":"Scotty: a web tool for designing RNA-seq experiments to measure differential gene expression","volume":"29","author":"Busby","year":"2013","journal-title":"Bioinformatics"},{"key":"2023020202402040100_btv436-B5","doi-asserted-by":"crossref","first-page":"15","DOI":"10.1093\/bioinformatics\/bts635","article-title":"STAR: ultrafast universal RNA-seq aligner","volume":"29","author":"Dobin","year":"2013","journal-title":"Bioinformatics"},{"key":"2023020202402040100_btv436-B6","doi-asserted-by":"crossref","first-page":"57","DOI":"10.1038\/nature11247","article-title":"An integrated encyclopedia of DNA elements in the human genome","volume":"489","author":"ENCODE Project Consortium","year":"2012","journal-title":"Nature"},{"key":"2023020202402040100_btv436-B7","doi-asserted-by":"crossref","first-page":"1185","DOI":"10.1038\/nmeth.2722","article-title":"Systematic evaluation of spliced alignment programs for RNA-seq data","volume":"10","author":"Engstr\u00f6m","year":"2013","journal-title":"Nat. Methods"},{"key":"2023020202402040100_btv436-B8","doi-asserted-by":"crossref","first-page":"2518","DOI":"10.1093\/bioinformatics\/btr427","article-title":"Comparative analysis of RNA-Seq alignment algorithms and the RNA-Seq unified mapper (RUM)","volume":"27","author":"Grant","year":"2011","journal-title":"Bioinformatics"},{"key":"2023020202402040100_btv436-B9","doi-asserted-by":"crossref","first-page":"10073","DOI":"10.1093\/nar\/gks666","article-title":"Modelling and simulating generic RNA-Seq experiments with the flux simulator","volume":"40","author":"Griebel","year":"2012","journal-title":"Nucleic Acids Res."},{"key":"2023020202402040100_btv436-B10","doi-asserted-by":"crossref","first-page":"117","DOI":"10.1186\/s13059-015-0679-0","article-title":"quantro: a data-driven approach to guide the choice of an appropriate normalization method","volume":"16","author":"Hicks","year":"2015","journal-title":"Genome Biol."},{"key":"2023020202402040100_btv436-B11","doi-asserted-by":"crossref","first-page":"506","DOI":"10.1038\/nature12531","article-title":"Transcriptome and genome sequencing uncovers functional variation in humans","volume":"501","author":"Lappalainen","year":"2013","journal-title":"Nature"},{"key":"2023020202402040100_btv436-B12","doi-asserted-by":"crossref","first-page":"R29","DOI":"10.1186\/gb-2014-15-2-r29","article-title":"Voom: precision weights unlock linear model analysis tools for RNA-seq read counts","volume":"15","author":"Law","year":"2014","journal-title":"Genome Biol."},{"key":"2023020202402040100_btv436-B13","doi-asserted-by":"crossref","first-page":"323+","DOI":"10.1186\/1471-2105-12-323","article-title":"RSEM: accurate transcript quantification from RNA-Seq data with or without a reference genome","volume":"12","author":"Li","year":"2011","journal-title":"BMC Bioinformatics"},{"key":"2023020202402040100_btv436-B14","doi-asserted-by":"crossref","first-page":"2078","DOI":"10.1093\/bioinformatics\/btp352","article-title":"The sequence alignment\/map (SAM) format and SAMtools","volume":"25","author":"Li","year":"2009","journal-title":"Bioinformatics"},{"key":"2023020202402040100_btv436-B15","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1186\/1471-2105-15-S8-S1","article-title":"Diminishing return for increased mappability with longer sequencing reads: implications of the k-mer distributions in the human genome","volume":"15","author":"Li","year":"2014","journal-title":"BMC Bioinformatics"},{"key":"2023020202402040100_btv436-B16","doi-asserted-by":"crossref","first-page":"765","DOI":"10.1093\/bioinformatics\/btp053","article-title":"Testing significance relative to a fold-change is a TREAT","volume":"25","author":"McCarthy","year":"2009","journal-title":"Bioinformatics"},{"key":"2023020202402040100_btv436-B17","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1186\/1471-2164-13-341","article-title":"A tale of three next generation sequencing platforms: comparison of Ion torrent, Pacific biosciences and Illumina Miseq sequencers","volume":"13","author":"Quail","year":"2012","journal-title":"BMC Genomics"},{"key":"2023020202402040100_btv436-B18","doi-asserted-by":"crossref","first-page":"R95+","DOI":"10.1186\/gb-2013-14-9-r95","article-title":"Comprehensive evaluation of differential gene expression analysis methods for RNA-seq data","volume":"14","author":"Rapaport","year":"2013","journal-title":"Genome Biol."},{"key":"2023020202402040100_btv436-B19","doi-asserted-by":"crossref","first-page":"1035","DOI":"10.1214\/09-AOAS244","article-title":"GaGa: a simple and flexible hierarchical model for differential expression analysis","volume":"3","author":"Rossell","year":"2009","journal-title":"Ann. Appl. Stat."},{"key":"2023020202402040100_btv436-B20","doi-asserted-by":"crossref","first-page":"75","DOI":"10.1093\/biostatistics\/kxs026","article-title":"Sequential stopping for high-throughput experiments","volume":"14","author":"Rossell","year":"2013","journal-title":"Biostatistics"},{"key":"2023020202402040100_btv436-B21","doi-asserted-by":"crossref","first-page":"309","DOI":"10.1214\/13-AOAS687","article-title":"Quantifying alternative splicing from paired-end RNA-seq data","volume":"8","author":"Rossell","year":"2014","journal-title":"Ann. Appl. Stat."},{"key":"2023020202402040100_btv436-B22","doi-asserted-by":"crossref","first-page":"62","DOI":"10.1214\/10-STS343","article-title":"Statistical modeling of RNA-seq data","volume":"26","author":"Salzman","year":"2011","journal-title":"Stat. Sci."},{"key":"2023020202402040100_btv436-B23","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1198\/016214505000000998","article-title":"Inverse decision theory: characterizing losses for a decision rule with applications in cervical cancer screening","volume":"101","author":"Swartz","year":"2006","journal-title":"J. Am. Stat. Assoc."},{"key":"2023020202402040100_btv436-B24","doi-asserted-by":"crossref","first-page":"562","DOI":"10.1038\/nprot.2012.016","article-title":"Differential gene and transcript expression analysis of RNA-seq experiments with tophat and cufflinks","volume":"7","author":"Trapnell","year":"2012","journal-title":"Nat. Protoc."},{"key":"2023020202402040100_btv436-B25","doi-asserted-by":"crossref","first-page":"46","DOI":"10.1038\/nbt.2450","article-title":"Differential analysis of gene regulation at transcript resolution with RNA-seq","volume":"31","author":"Trapnell","year":"2013","journal-title":"Nat. Biotechnol."},{"key":"2023020202402040100_btv436-B26","doi-asserted-by":"crossref","first-page":"1089","DOI":"10.1111\/j.1541-0420.2006.00611.x","article-title":"A unified approach for simultaneous gene clustering and differential expression identification","volume":"62","author":"Yuan","year":"2006","journal-title":"Biometrics"}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/31\/22\/3631\/49035809\/bioinformatics_31_22_3631.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/31\/22\/3631\/49035809\/bioinformatics_31_22_3631.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2024,6,10]],"date-time":"2024-06-10T07:12:33Z","timestamp":1718003553000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/31\/22\/3631\/241556"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2015,7,27]]},"references-count":26,"journal-issue":{"issue":"22","published-print":{"date-parts":[[2015,11,15]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btv436","relation":{},"ISSN":["1367-4811","1367-4803"],"issn-type":[{"value":"1367-4811","type":"electronic"},{"value":"1367-4803","type":"print"}],"subject":[],"published-other":{"date-parts":[[2015,11,15]]},"published":{"date-parts":[[2015,7,27]]}}}