Abstract
MEDLINE is the premier literature database in the biomedical and life sciences fields, containing over 17 million references to journal articles. Searching in this database can be performed through PubMed, a web interface designed to provide a rapid and comprehensive retrieval of articles matching a specific criteria. However, considering the complexity of biological systems and of the genotype to phenotype relations, the results retrieved from PubMed can be only a short view of the relevant information that is available. In this paper we present a new approach for expanding the terminology used in each query to enrich the set of documents that are retrieved. We have developed a paper prioritization methodology that, for a given list of genes, expands the search in several biological domains using a mesh of co-related terms, extracts the most relevant results from the literature, and organize them according to domain weighted factors.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Jensen, L.J., Saric, J., Bork, P.: Literature mining for the biologist: from information retrieval to biological discovery. Nat. Rev. Genet. 7, 119–129 (2006)
Weeber, M., Kors, J.A., Mons, B.: Online tools to support literature-based discovery in the life sciences. Brief Bioinform. 6, 277–286 (2005)
Cohen, A.M., Hersh, W.R.: A survey of current work in biomedical text mining. Brief Bioinform. 6, 57–71 (2005)
Krallinger, M., Valencia, A.: Text-mining and information-retrieval services for molecular biology. Genome Biol. 6, 224 (2005)
Renn, O.: DOI, CrossRef, LINK, and the future of scientific publishing. J. Orofac. Orthop. 62, 408–409 (2001)
Dopazo, J.: Functional interpretation of microarray experiments. Omics 10, 398–410 (2006)
Parkinson, H., Kapushesky, M., Shojatalab, M., Abeygunawardena, N., Coulson, R., Farne, A., Holloway, E., Kolesnykov, N., Lilja, P., Lukk, M., Mani, R., Rayner, T., Sharma, A., William, E., Sarkans, U., Brazma, A.: ArrayExpress–a public database of microarray experiments and gene expression profiles. Nucleic Acids Res. 35, D747–D750 (2007)
Barrett, T., Edgar, R.: Gene expression omnibus: microarray data storage, submission, retrieval, and analysis. Methods Enzymol 411, 352–369 (2006)
Arrais, J., Santos, B., Fernandes, J., Carreto, L., Santos, M.A.S., Oliveira, J.L.: GeneBrowser: an approach for integration and functional classification of genomic data. Journal of Integrative Bioinformatics 4(3) (2007)
Plake, C., Schiemann, T., Pankalla, M., Hakenberg, J., Leser, U.: AliBaba: PubMed as a graph. Bioinformatics 22, 2444–2445 (2006)
Jang, H., Lim, J., Lim, J.H., Park, S.J., Lee, K.C.: BioProber: software system for biomedical relation discovery from PubMed. In: Conf. Proc. IEEE Eng. Med. Biol. Soc., vol. 1, pp. 5779–5782 (2006)
Doms, A., Schroeder, M.: GoPubMed: exploring PubMed with the Gene Ontology. Nucleic Acids Res. 33, W783–W786 (2005)
Muin, M., Fontelo, P.: Technical development of PubMed interact: an improved interface for MEDLINE/PubMed searches. BMC Med. Inform. Decis. Mak. 6, 36 (2006)
Hoffmann, R., Valencia, A.: Implementing the iHOP concept for navigation of biomedical literature. Bioinformatics 21(suppl. 2), ii252–ii258 (2005)
Korotkiy, M., Middelburg, R., Dekker, H., van Harmelen, F., Lankelma, J.: A tool for gene expression based PubMed search through combining data sources. Bioinformatics 20, 1980–1982 (2004)
Al-Shahrour, F., Minguez, P., Tarraga, J., Medina, I., Alloza, E., Montaner, D., Dopazo, J.: FatiGO +: a functional profiling tool for genomic data. Integration of functional annotation, regulatory motifs and interaction data with microarray experiments. Nucleic Acids Res. (2007)
Yonggang, Q., Hans-Peter, F.: Concept based query expansion. In: Proceedings of the 16th annual international ACM SIGIR conference on Research and development in information retrieval. ACM Press, Pittsburgh, Pennsylvania, United States (1993)
Jinxi, X., Croft, W.B.: Query expansion using local and global document analysis. In: Proceedings of the 19th annual international ACM SIGIR conference on Research and development in information retrieval. ACM Press, Zurich (1996)
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 2009 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Arrais, J.P., Rodrigues, J.G.L.M., Oliveira, J.L. (2009). Improving Literature Searches in Gene Expression Studies. In: Corchado, J.M., De Paz, J.F., Rocha, M.P., Fernández Riverola, F. (eds) 2nd International Workshop on Practical Applications of Computational Biology and Bioinformatics (IWPACBB 2008). Advances in Soft Computing, vol 49. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-85861-4_10
Download citation
DOI: https://doi.org/10.1007/978-3-540-85861-4_10
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-85860-7
Online ISBN: 978-3-540-85861-4
eBook Packages: EngineeringEngineering (R0)