Abstract
A common task for earth scientists is the search for stratigraphic background information on a certain rock unit, e.g. its age and properties and its position within the hierarchy of stratigraphic units. Analogously, when geoscientists search for information in databases or within the internet the stratigraphic and geospatial constraints serve as a first orientation within a huge amount of data. However, spatio-temporal information is mostly only implicitly encoded e.g. in the title and abstract given in bibliographic databases and library catalogues. Means to decode spatial information from such texts has become commonly available through gazetteers and geocoding services, but the paleotemporal information remains elusive. Agenames is a stratigraphic information harvester and text parser which offers a web-service to parse geological texts and identify stratigraphic terms. The service has both a web-based GUI and a REST interface. The Agenames ontology records the stratigraphic rank of e.g. a chronostratigraphic or lithostratigraphic unit and the hierarchical relations between terms. Any geologic body has an associated age of origin, assigned by relation to a geochronologic unit from the geologic time scale. In a given text Agenames will identify potential stratigraphic keywords and use these terms to assign a geological age estimate. The web-service can also be used to augment web portals or library catalogues. Agenames can be used to generate indices to infer the relations of library catalogue entries or web pages to the approximate geological epoch that is covered in a text. This allows, for instance, including stratigraphic age into a catalogue search without the need to specify all possibly relevant terms in complex queries.
Similar content being viewed by others
References
Ager DV (1993) The nature of the stratigraphical record, 3rd edn. John Wiley & Sons, Ltd., Chichester
Carroll JJ, Bizer C, Hayes P, Stickler P (2005) Named Graphs, Provenance and Trust. In: Proceedings of the 14th International Conference on World Wide Web. ACM, New York, pp 613–622. doi:10.1145/1060745.1060835
Davenport P (2007–2013) Lexicon of Canadian Geological Names [online], http://weblex.nrcan.gc.ca/weblex_e.pl
Densham I, und Reid J (2003) A geo-coding service encompassing a geo-parsing tool and integrated digital gazetteer service, in Proceedings of the HLT-NAACL 2003 Workshop on Analysis of Geographic References, Bd. 1, S. 80. doi:10.3115/1119394.1119406
Fischer, R., 1971. Fascicule 5. Allemagne. Fascicule 5 f 4, Jurassique Alpine. Lexique Stratigraphique International Vol. 1 Europe, p.43
Gibbard PL, Head MJ, Walker MJC, The S. on Q. Stratigraphy (2010) Formal ratification of the Quaternary System/Period and the Pleistocene Series/Epoch with a base at 2.58 Ma. J Quat Sci 25(2):96–102. doi:10.1002/jqs.1338
Gradstein FM, Ogg JG, Smith AG (2004) A Geologic Time Scale 2004. Cambridge University Press, Cambridge
Hearst M (2009) Search user interfaces. Cambridge University Press, New York
Kent LE (1980) Stratigraphy of South Africa, Handbook 8, Part 1: Lithostratigraphy of the Republic of South Africa. South West Africa/Namibia and the Republics of Bophuthatswana, Transkei and Venda, Geological Survey Republic of South Africa, 690 p
King T, Narock T, Walker R, Merka J, Joy S (2008) A brave new (virtual) world: distributed searches, relevance scoring and facets. Earth Sci Informa 1(1):29–34. doi:10.1007/s12145-008-0002-7
Klump J, Huber R (2011) WorldML vs. YaML – On the scope and purpose of mark-up languages. In: Geophysical Research Abstracts, Vol. 13, EGU2011–10505. Copernicus Society, Vienna
Lenz (2008–2013), Australian Stratigraphic Units Database [online], http://www.ga.gov.au/products-services/data-applications/reference-databases/stratigraphic-units.html
Powers DMW (2007) Evaluation: from precision, recall and f-factor to ROC. Informedness Markedness & Correlation, Technical Report, School of Informatics and Engineering, Flinders University, Adelaide
Remane, J. (1998), Appendix B; Explanatory note to the Global Stratigraphic Chart, Circ. - Int. Subcomm. Strat. Classif. ISSC IUGS Comm. Strat
Rowe, (2007–2013) British Geological Survey Lexicon of Named Rock Units [online, http://www.bgs.ac.uk/lexicon/]
Salvador A (1994) International stratigraphic guide: a guide to stratigraphic classification, terminology, and procedure. Denver, CO., Geological Society of America
Sautter G, Böhm K, Agosti D (2006) A combining approach to find all taxon names (FAT) in legacy biosystematics literature. Biodivers Inform 3:41–53
Schatz BR, Johnson EH, Cochrane PA, Chen H (1996) Interactive term suggestion for users of digital libraries: using subject thesauri and co-occurrence lists for information retrieval. In: Proceedings of the first ACM international conference on Digital libraries. ACM, New York, pp 126–133. doi:10.1145/226931.226956
Schindler U, Diepenbroek M (2008) Generic XML-based framework for metadata portals. Comput Geosci 34(12):1947–1955. doi:10.1016/j.cageo.2008.02.023
Singer G, Norbisrath U, Lewandowski D (2013) Ordinary search engine users carrying out complex search tasks. J Inf Sci 39(3):346–358. doi:10.1177/0165551512466974
Stamm, N.R., Wardlaw, B. & Soller, D.R. (2007–2013), GEOLEX The National Geologic Map Database's Geologic Names Lexicon [online], http://ngmdb.usgs.gov/Geolex/geolex_home.html
Van Rijsbergen CJ (1979) Information retrieval, 2nd edn. Butterworth Heinemann, Oxford
Author information
Authors and Affiliations
Corresponding author
Additional information
Communicated by: H. A. Babaie
Published in the Special Issue with Guest Editors Dr. Xiaogang Ma, Dr. Peter Fox, Dr. Thomas Narock and Dr. Brian Wilson.
Rights and permissions
About this article
Cite this article
Huber, R., Klump, J. Agenames a stratigraphic information harvester and text parser. Earth Sci Inform 8, 125–134 (2015). https://doi.org/10.1007/s12145-014-0171-5
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s12145-014-0171-5