Abstract
This paper discusses the basic design of the encoding scheme described by the Text Encoding Initiative'sGuidelines for Electronic Text Encoding and Interchange (TEI document number TEI P3, hereafter simplyP3 orthe Guidelines). It first reviews the basic design goals of the TEI project and their development during the course of the project. Next, it outlines some basic notions relevant for the design of any markup language and uses those notions to describe the basic structure of the TEI encoding scheme. It also describes briefly the “core” tag set defined in chapter 6 of P3, and the “default text structure” defined in chapter 7 of that work. The final section of the paper attempts an evaluation of P3 in the light of its original design goals, and outlines areas in which further work is still needed.
Similar content being viewed by others
References
Association for Computers and the Humanities (ACH), Association for Computational Linguistics (ACL), and Association for Literary and Linguistic Computing (ALLC).Guidelines for Electronic Text Encoding and Interchange. Eds. C. M. Sperberg-McQueen and Lou Burnard. Chicago, Oxford: Text Encoding Initiative, 1994.
Guittet, C., Ed.Formex: Formalized Exchange of Electronic Publications. Luxembourg: Office for Official Publications of the European Communities, ‘New Technologies — Project Management’ Department, 1984.
International Organization for Standardization (ISO). ISO 8879-1986Information Processing — Text and Office Systems — Standard Generalized Markup Language (SGML). [Geneva]: ISO, 1986.
International Organization for Standardization (ISO). ISO/TR 9573-1988(E)Information processing — SGML support facilities — Techniques for using SGML. [Geneva]: ISO, 1988.
International Organization for Standardization (ISO).ISO DIS 10179 — Information Technology — Text Composition — Document Style Semantics and Specification Language [Geneva]: ISO, 1994.
Text Encoding Initiative. TEI ED P1 “Design Principles for Text Encoding Guidelines”. [Chicago, Oxford]: TEI, 1989.
Text Encoding Initiative. TEI ED P2 “Charges to the Working Committees”. [Chicago, Oxford]: TEI, 1989.
Text Encoding Initiative. TEI ED P3 “Theoretical Stance and Resolution of Theory Conflict”. [Chicago, Oxford]: TEI, 1989.
Tompa, Frank Wm. “What is (Tagged) Text?” InDictionaries in the Electronic Age: Fifth Annual Conference of the UW Centre for the New Oxford English Dictionary. Oxford: [n.p.], 1989.
Author information
Authors and Affiliations
Additional information
C. M. Sperberg-McQueen is a Senior Research Programmer at the academic computer center of the University of Illinois at Chicago; his interests include medieval Germanic languages and literatures and the theory of electronic text markup. Since 1988 he has been editor in chief of the ACH/ACL/ALLC Text Encoding Initiative. Lou Burnard is Director of the Oxford Text Archive at Oxford University Computing Services, with interests in electronic text and database technology. He is European Editor of the Text Encoding Initiative's Guidelines.
Rights and permissions
About this article
Cite this article
Sperberg-McQueen, C.M., Burnard, L. The design of the TEI encoding scheme. Comput Hum 29, 17–39 (1995). https://doi.org/10.1007/BF01830314
Issue Date:
DOI: https://doi.org/10.1007/BF01830314