Guidelines for Electronic Text Encoding and Interchange

Abbreviation: TEI Guidelines

Scope: Texts of the Digital Humanities

Topic: Generic Corpus Annotation, Markup Language

Standard body: TEI

Use in CLARIN: fully recommended

Description:

The TEI Guidelines have been developed and are maintained by the Text Encoding Initiative Consortium (TEI) and are designed for people working with any kind of textual resource in digital form.

The guidelines present recommendations of suitable ways of representing the features of textual resources, which need to be identified explicitly in order to facilitate processing by computer programs. In particular, they specify a set of markers (or tags) which can be inserted in the electronic representation of the text, to mark the text structure and other features of interest. Many computer programs depend on the presence of such explicit markers for their functionality. Without the markers, a digitized text appears to be nothing but a sequence of undifferentiated bits. The success of the World Wide Web, for example, is partly because of its use of such markup to indicate features, such as headings and lists on individual pages, and to indicate links between pages. The process of inserting such explicit markers for implicit textual features is often called ‘markup’, or equivalently within this work ‘encoding’; the term ‘tagging’ is also used informally. We use the term encoding scheme or markup language to denote the complete set of rules associated with the use of markup in a given context. Besides, we use the term markup vocabulary for the specific set of markers or named distinctions employed by a given encoding scheme. Thus, this work describes both the TEI encoding scheme, and documents the TEI markup vocabulary.

The TEI encoding scheme is particularly useful in facilitating a loss-free interchange of data amongst individuals and research groups using different programs, computer systems, or application software. Since they contain an inventory of the features most often deployed for computer-based text processing, the guidelines are also useful as a starting point for those designing new systems and creating new materials, even where interchange of information is not a primary objective.

Related Standard(s):

SpecTranScriptTSL

Other standards in the same topic(s):

Abbreviation: TEI Guidelines-1994 [not official, only for reference in this website]

Version Number: P3

Status: final

Release Date: 1994-05-16

Editor:

C. M. Sperberg-McQueen
Lou Burnard

Features:

metaLanguage: SGML
constraintLanguage: DTD
grammarClass: LTG
formalModel: Tree
notation: Inline
multipleHierarchies: milestones fragments feature structures

URL(s): http://www.tei-c.org/Vault/GL/P3/index.htm

Related Standard(s):

SpecCesCES/XCES
TEI P3 is the superclass of CES. CES is a modification of a part of the TEI P3's DTD.
SpecCes4-1CES
SpecSGMLSGML
TEI P3 is an application of the Standard Generalized Markup Language (SGML).
SpecXces104XCES
The XCES specification is based on the TEI P3 Standard.

Abbreviation: TEI Guidelines-2002 [not official, only for reference in this website]

Version Number: P4

Status: final

Release Date: 2002-03-01

Editor:

C. M. Sperberg-McQueen
Lou Burnard

Features:

metaLanguage: XML
constraintLanguage: DTD
grammarClass: LTG
formalModel: Tree
notation: Inline
multipleHierarchies: milestones fragments feature structures

URL(s): http://www.tei-c.org/release/doc/tei-p4-doc/html/

Related Standard(s):

SpecXMLXML
TEI P4 is an application of the Extensible Markup Language (XML). It uses the XML syntax for instances and XML DTD as document grammar formalism.

Abbreviation: TEI Guidelines-2011 [not official, only for reference in this website]

Version Number: P52.7.0

Status: final

Release Date: 2014-09-16

Editor:

Lou Burnard
Syd Bauman

Features:

metaLanguage: XML
constraintLanguage: RELAX NG
grammarClass: STG
formalModel: Tree
notation: Standoff
multipleHierarchies: milestones fragments standoff annotation feature structures

URL(s): http://www.tei-c.org/release/doc/tei-p5-doc/en/html/index.html

Related Standard(s):

SpecRNGRELAX NG
TEI P5 uses RELAX NG as its document grammar formalism.
SpecXMLXML
TEI P5 is an application of the Extensible Markup Language (XML). It uses the XML syntax for instances.

Used in CLARIN centre(s):

Relations

Legend:
	isUsedBy
	isApplicationOf
	isSuperclassOf
	isVersionOf

Home
Centres
Format Recommendations
	Data Deposition Formats
	Functional Domains
	File Extensions
	Media Types
	Statistics
		Popular Formats
		Relevant KPIs
	Sanity Check
		Keywords
Standards and Specifications
	Standard Bodies
	Topics
	Search
API
About / F.A.Q.