{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,2,22]],"date-time":"2025-02-22T00:45:29Z","timestamp":1740185129954,"version":"3.37.3"},"reference-count":33,"publisher":"Oxford University Press (OUP)","issue":"20","license":[{"start":{"date-parts":[[2017,7,12]],"date-time":"2017-07-12T00:00:00Z","timestamp":1499817600000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/academic.oup.com\/journals\/pages\/about_us\/legal\/notices"}],"funder":[{"DOI":"10.13039\/501100006769","name":"Russian Science Foundation","doi-asserted-by":"publisher","award":["14-24-00155"],"id":[{"id":"10.13039\/501100006769","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/100000002","name":"National Institutes of Health","doi-asserted-by":"publisher","award":["P30 CA006973","NCI R01CA177669"],"id":[{"id":"10.13039\/100000002","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100002261","name":"Russian Foundation for Basic Research","doi-asserted-by":"publisher","award":["14-04-01872","14-04-00576"],"id":[{"id":"10.13039\/501100002261","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2017,10,15]]},"abstract":"Abstract<\/jats:title>\n \n Motivation<\/jats:title>\n Genomics features with similar genome-wide distributions are generally hypothesized to be functionally related, for example, colocalization of histones and transcription start sites indicate chromatin regulation of transcription factor activity. Therefore, statistical algorithms to perform spatial, genome-wide correlation among genomic features are required.<\/jats:p>\n <\/jats:sec>\n \n Results<\/jats:title>\n Here, we propose a method, StereoGene, that rapidly estimates genome-wide correlation among pairs of genomic features. These features may represent high-throughput data mapped to reference genome or sets of genomic annotations in that reference genome. StereoGene enables correlation of continuous data directly, avoiding the data binarization and subsequent data loss. Correlations are computed among neighboring genomic positions using kernel correlation. Representing the correlation as a function of the genome position, StereoGene outputs the local correlation track as part of the analysis. StereoGene also accounts for confounders such as input DNA by partial correlation. We apply our method to numerous comparisons of ChIP-Seq datasets from the Human Epigenome Atlas and FANTOM CAGE to demonstrate its wide applicability. We observe the changes in the correlation between epigenomic features across developmental trajectories of several tissue types consistent with known biology and find a novel spatial correlation of CAGE clusters with donor splice sites and with poly(A) sites. These analyses provide examples for the broad applicability of StereoGene for regulatory genomics.<\/jats:p>\n <\/jats:sec>\n \n Availability and implementation<\/jats:title>\n The StereoGene C\u2009++\u2009source code, program documentation, Galaxy integration scripts and examples are available from the project homepage http:\/\/stereogene.bioinf.fbb.msu.ru\/<\/jats:p>\n <\/jats:sec>\n \n Supplementary information<\/jats:title>\n Supplementary data are available at Bioinformatics online.<\/jats:p>\n <\/jats:sec>","DOI":"10.1093\/bioinformatics\/btx379","type":"journal-article","created":{"date-parts":[[2017,6,13]],"date-time":"2017-06-13T03:10:25Z","timestamp":1497323425000},"page":"3158-3165","source":"Crossref","is-referenced-by-count":20,"title":["StereoGene: rapid estimation of genome-wide correlation of continuous or interval feature data"],"prefix":"10.1093","volume":"33","author":[{"given":"Elena D","family":"Stavrovskaya","sequence":"first","affiliation":[{"name":"Department of Bioengineering and Bioinformatics, Moscow State University, Moscow, Russia"},{"name":"Institute for Information Transmission Problems, RAS, Moscow, Russia"}]},{"given":"Tejasvi","family":"Niranjan","sequence":"additional","affiliation":[{"name":"Department of Oncology, Sidney Kimmel Comprehensive Cancer Center, The Johns Hopkins University School of Medicine, Baltimore, MD, USA"}]},{"given":"Elana J","family":"Fertig","sequence":"additional","affiliation":[{"name":"Department of Oncology, Sidney Kimmel Comprehensive Cancer Center, The Johns Hopkins University School of Medicine, Baltimore, MD, USA"}]},{"given":"Sarah J","family":"Wheelan","sequence":"additional","affiliation":[{"name":"Department of Oncology, Sidney Kimmel Comprehensive Cancer Center, The Johns Hopkins University School of Medicine, Baltimore, MD, USA"}]},{"given":"Alexander V","family":"Favorov","sequence":"additional","affiliation":[{"name":"Department of Oncology, Sidney Kimmel Comprehensive Cancer Center, The Johns Hopkins University School of Medicine, Baltimore, MD, USA"},{"name":"Laboratory of Systems Biology and Computational Genetics, Vavilov Institute of General Genetics, RAS, Moscow, Russia"},{"name":"Laboratory of Bioinformatics, Research Institute of Genetics and Selection of Industrial Microorganisms, Moscow, Russia"}]},{"given":"Andrey A","family":"Mironov","sequence":"additional","affiliation":[{"name":"Department of Bioengineering and Bioinformatics, Moscow State University, Moscow, Russia"},{"name":"Institute for Information Transmission Problems, RAS, Moscow, Russia"}]}],"member":"286","published-online":{"date-parts":[[2017,7,12]]},"reference":[{"key":"2023020207514029400_btx379-B1","doi-asserted-by":"crossref","DOI":"10.1093\/nar\/gkw343","article-title":"The Galaxy platform for accessible, reproducible and collaborative biomedical analyses: 2016 update","author":"Afgan","year":"2016","journal-title":"Nucleic Acids Res"},{"key":"2023020207514029400_btx379-B2","doi-asserted-by":"crossref","first-page":"147","DOI":"10.1016\/j.molcel.2015.01.027","article-title":"Bursty gene expression in the intact mammalian liver","volume":"58","author":"Bahar Halpern","year":"2015","journal-title":"Mol. Cell"},{"key":"2023020207514029400_btx379-B3","doi-asserted-by":"crossref","first-page":"1045","DOI":"10.1038\/nbt1010-1045","article-title":"The NIH roadmap epigenomics mapping consortium","volume":"28","author":"Bernstein","year":"2010","journal-title":"Nat. Biotechnol"},{"key":"2023020207514029400_btx379-B4","doi-asserted-by":"crossref","first-page":"R90","DOI":"10.1093\/hmg\/dds353","article-title":"Chromatin and epigenetic regulation of pre-mRNA processing","volume":"21","author":"Brown","year":"2012","journal-title":"Hum. Mol. Genet"},{"key":"2023020207514029400_btx379-B5","doi-asserted-by":"crossref","first-page":"93","DOI":"10.1038\/nrg3607","article-title":"Chromatin modifiers and remodellers: regulators of cellular differentiation","volume":"15","author":"Chen","year":"2014","journal-title":"Nat. Rev. Genet"},{"key":"2023020207514029400_btx379-B6","doi-asserted-by":"crossref","first-page":"607","DOI":"10.1093\/bioinformatics\/bts009","article-title":"An effective statistical evaluation of ChIPseq dataset similarity","volume":"28","author":"Chikina","year":"2012","journal-title":"Bioinformatics"},{"key":"2023020207514029400_btx379-B7","doi-asserted-by":"crossref","first-page":"390","DOI":"10.1038\/nrg3454","article-title":"Exploring the three-dimensional organization of genomes: interpreting chromatin interaction data","volume":"14","author":"Dekker","year":"2013","journal-title":"Nat. Rev. Genet"},{"key":"2023020207514029400_btx379-B8","doi-asserted-by":"crossref","first-page":"364","DOI":"10.1038\/nbt.3157","article-title":"Large-scale imputation of epigenomic datasets for systematic annotation of diverse human tissues","volume":"33","author":"Ernst","year":"2015","journal-title":"Nat. Biotechnol"},{"key":"2023020207514029400_btx379-B9","doi-asserted-by":"crossref","first-page":"e1002529\u2013e1002529.","DOI":"10.1371\/journal.pcbi.1002529","article-title":"Exploring massive, genome scale datasets with the GenometriCorr package","volume":"8","author":"Favorov","year":"2012","journal-title":"PLoS Comput. Biol"},{"key":"2023020207514029400_btx379-B10","doi-asserted-by":"crossref","first-page":"445","DOI":"10.1038\/nature13424","article-title":"Comparative analysis of the transcriptome across distant species","volume":"512","author":"Gerstein","year":"2014","journal-title":"Nature"},{"key":"2023020207514029400_btx379-B11","doi-asserted-by":"crossref","first-page":"2046","DOI":"10.1093\/bioinformatics\/btt343","article-title":"GAT: a simulation framework for testing the association of genomic intervals","volume":"29","author":"Heger","year":"2013","journal-title":"Bioinformatics"},{"key":"2023020207514029400_btx379-B12","doi-asserted-by":"crossref","first-page":"44","DOI":"10.1038\/nprot.2008.211","article-title":"Systematic and integrative analysis of large gene lists using DAVID bioinformatics resources","volume":"4","author":"Huang da","year":"2009","journal-title":"Nat. Protoc"},{"key":"2023020207514029400_btx379-B13","doi-asserted-by":"crossref","first-page":"12514\u201312514.","DOI":"10.1038\/ncomms12514","article-title":"reChIP-seq reveals widespread bivalency of H3K4me3 and H3K27me3 in CD4(+) memory T cells","volume":"7","author":"Kinkley","year":"2016","journal-title":"Nat. Commun"},{"key":"2023020207514029400_btx379-B14","doi-asserted-by":"crossref","first-page":"109","DOI":"10.1093\/dnares\/dsu044","article-title":"Genome-wide study of correlations between genomic features and their relationship with the regulation of gene expression","volume":"22","author":"Kravatsky","year":"2015","journal-title":"DNA Res"},{"key":"2023020207514029400_btx379-B15","doi-asserted-by":"crossref","DOI":"10.1371\/journal.pcbi.1003118","article-title":"Software for computing and annotating genomic ranges","volume":"9","author":"Lawrence","year":"2013","journal-title":"PLoS Comput. Biol"},{"key":"2023020207514029400_btx379-B16","doi-asserted-by":"crossref","first-page":"15561","DOI":"10.1073\/pnas.0402962101","article-title":"A system for enhancing genome-wide coexpression dynamics study","volume":"101","author":"Li","year":"2004","journal-title":"Proc. Natl. Acad. Sci. USA"},{"key":"2023020207514029400_btx379-B17","doi-asserted-by":"crossref","DOI":"10.1137\/1.9781611970999","volume-title":"Computational Frameworks for the Fast Fourier Transform","author":"Loan","year":"1992"},{"key":"2023020207514029400_btx379-B18","doi-asserted-by":"crossref","first-page":"20","DOI":"10.1186\/s13040-015-0051-7","article-title":"Uncovering correlated variability in epigenomic datasets using the Karhunen-Loeve transform","volume":"8","author":"Madrigal","year":"2015","journal-title":"BioData Min"},{"key":"2023020207514029400_btx379-B19","doi-asserted-by":"crossref","first-page":"e01256.","DOI":"10.7554\/eLife.01256","article-title":"Chromatin signature of widespread monoallelic expression","volume":"31","author":"Nag","year":"2013","journal-title":"eLife"},{"key":"2023020207514029400_btx379-B20","doi-asserted-by":"crossref","first-page":"1713","DOI":"10.1534\/g3.115.018853","article-title":"Chromatin signature identifies monoallelic gene expression across mammalian cell types","volume":"5","author":"Nag","year":"2015","journal-title":"G3"},{"key":"2023020207514029400_btx379-B21","doi-asserted-by":"crossref","first-page":"1919","DOI":"10.1093\/bioinformatics\/bts277","article-title":"BEDOPS: high-performance genomic feature operations","volume":"28","author":"Neph","year":"2012","journal-title":"Bioinformatics"},{"issue":"Database issue","key":"2023020207514029400_btx379-B22","doi-asserted-by":"crossref","first-page":"D32","DOI":"10.1093\/nar\/gkn721","article-title":"NCBI reference sequences: current status, policy and new initiatives","volume":"37","author":"Pruitt","year":"2009","journal-title":"Nucleic Acids Res"},{"key":"2023020207514029400_btx379-B23","doi-asserted-by":"crossref","first-page":"841","DOI":"10.1093\/bioinformatics\/btq033","article-title":"BEDTools: a flexible suite of utilities for comparing genomic features","volume":"26","author":"Quinlan","year":"2010","journal-title":"Bioinformatics"},{"key":"2023020207514029400_btx379-B24","doi-asserted-by":"crossref","first-page":"744","DOI":"10.1016\/j.cell.2010.01.044","article-title":"An atlas of combinatorial transcriptional regulation in mouse and man","volume":"140","author":"Ravasi","year":"2010","journal-title":"Cell"},{"key":"2023020207514029400_btx379-B25","doi-asserted-by":"crossref","first-page":"135","DOI":"10.1126\/science.2990042","article-title":"An RNA processing activity that debranches RNA lariats","volume":"229","author":"Ruskin","year":"1985","journal-title":"Science"},{"key":"2023020207514029400_btx379-B26","doi-asserted-by":"crossref","first-page":"1777","DOI":"10.1016\/j.celrep.2013.04.032","article-title":"Bivalent chromatin marks developmental regulatory genes in the mouse embryonic germline in vivo","volume":"3","author":"Sachs","year":"2013","journal-title":"Cell Rep"},{"key":"2023020207514029400_btx379-B27","doi-asserted-by":"crossref","first-page":"12.","DOI":"10.1186\/gb-2010-11-12-r121","article-title":"The genomic HyperBrowser: inferential genomics at the sequence level","volume":"11","author":"Sandve","year":"2010","journal-title":"Genome Biol"},{"key":"2023020207514029400_btx379-B28","doi-asserted-by":"crossref","first-page":"461","DOI":"10.1080\/15287394.2012.674914","article-title":"Integrative analyses for omicsdata: a Bayesian mixture model to assess the concordance of ChIP-ChIP and ChIP-seq measurements","volume":"75","author":"Sch\u00e4fer","year":"2012","journal-title":"J. Toxicol. Environ. Health A"},{"key":"2023020207514029400_btx379-B29","doi-asserted-by":"crossref","first-page":"e0155378.","DOI":"10.1371\/journal.pone.0155378","article-title":"CTCF and cohesinSA-1 mark active promoters and boundaries of repressive chromatin domains in primary human erythroid cells","volume":"11","author":"Steiner","year":"2016","journal-title":"PLoS One"},{"key":"2023020207514029400_btx379-B30","doi-asserted-by":"crossref","first-page":"1030","DOI":"10.1038\/nsmb.1841","article-title":"Nuclear-localized tiny RNAs are associated with transcription initiation and splice sites in metazoans","volume":"17","author":"Taft","year":"2010","journal-title":"Nat. Struct. Mol. Biol"},{"key":"2023020207514029400_btx379-B31","doi-asserted-by":"crossref","first-page":"R137.","DOI":"10.1186\/gb-2008-9-9-r137","article-title":"Model-based analysis of ChIP-seq (MACS)","volume":"9","author":"Zhang","year":"2008","journal-title":"Genome Biol"},{"key":"2023020207514029400_btx379-B32","doi-asserted-by":"crossref","first-page":"e58.","DOI":"10.1093\/nar\/gkr053","article-title":"QDMR: a quantitative method for identification of differentially methylated regions by entropy","volume":"39","author":"Zhang","year":"2011","journal-title":"Nucleic Acids Res"},{"key":"2023020207514029400_btx379-B33","doi-asserted-by":"crossref","first-page":"e1003525.","DOI":"10.1371\/journal.pcbi.1003525","article-title":"Global quantitative modeling of chromatin factor interactions","volume":"10","author":"Zhou","year":"2014","journal-title":"PLoS Comput. Biol"}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/33\/20\/3158\/49042870\/bioinformatics_33_20_3158.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/33\/20\/3158\/49042870\/bioinformatics_33_20_3158.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,2,2]],"date-time":"2023-02-02T07:57:40Z","timestamp":1675324660000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/33\/20\/3158\/3957591"}},"subtitle":[],"editor":[{"given":"Inanc","family":"Birol","sequence":"additional","affiliation":[]}],"short-title":[],"issued":{"date-parts":[[2017,7,12]]},"references-count":33,"journal-issue":{"issue":"20","published-print":{"date-parts":[[2017,10,15]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btx379","relation":{"has-preprint":[{"id-type":"doi","id":"10.1101\/059584","asserted-by":"object"}]},"ISSN":["1367-4803","1367-4811"],"issn-type":[{"type":"print","value":"1367-4803"},{"type":"electronic","value":"1367-4811"}],"subject":[],"published-other":{"date-parts":[[2017,10,15]]},"published":{"date-parts":[[2017,7,12]]}}}