Show all info regardless research infrastructures. Switch to CLARIN environment and show only relevant info to CLARIN, e.g. format recommendations by CLARIN centres. Switch to Text+ environment and show only relevant info to Text+, e.g. format recommendations by Text+ centres. Switch to DARIAH environment and show only relevant info to DARIAH, e.g. format recommendations by DARIAH centres.
XML serialization of CHAT
suggest a fix or extension
Abbreviation: CHAT-XML
Identifiers:
Type Id
SIS ID fCHAT-XML Copy ID to clipboardSIS ID copied
Media type(s):
File extension(s): .xml
Format family: XML
Schema location:  https://talkbank.org/software/talkbank.xsd
Functional domains:
  • Audiovisual Annotation
Recommendations:
Centre Domain Level Comments
CMU Audiovisual AnnotationAnnotations of audiovisual sources, usually including a basic rendering of the spoken content (transcription) and sometimes further annotation. recommended
FIN-CLARIN Audiovisual AnnotationAnnotations of audiovisual sources, usually including a basic rendering of the spoken content (transcription) and sometimes further annotation. discouraged Consider using TEISpoken instead.
IDS Audiovisual AnnotationAnnotations of audiovisual sources, usually including a basic rendering of the spoken content (transcription) and sometimes further annotation. discouraged Consider using TEISpoken instead.
Sprakbanken Audiovisual AnnotationAnnotations of audiovisual sources, usually including a basic rendering of the spoken content (transcription) and sometimes further annotation. discouraged Consider using TEISpoken instead.
Description:

CHAT (Codes for the Human Analysis of Transcripts) is the most common format for transcribing, encoding and analysing child language data. It was developed by Brian MacWhinney within the CHILDES (Child Language Data Exchange System) Project. All of the transcripts in the CHILDES database are encoded in the CHAT format.

By using the CHAT format, transcribed data can be annotated with different levels of linguistic information such as phonological, morphological or syntactic information. The data transcribed and annotated in CHAT, are compatible with some analysis tools, for example CLAN provided also by the CHILDES Project.

Since the CHAT format is widely recognized in children’s language research, many transcription tools (such as ELAN, EXMARaLDA, Praat, Phon, Transcriber) support it. These tools can also export and import data encoded in various formats as CHAT formatted data.

The XML version is produced by the tool Chatter, and the XML Schema for that format is documented at https://talkbank.org/software/xsddoc/talkbank_xsd.html.

Keywords: transcription of speech, spoken language, speech data, annotation
Related Standard(s):
Relations
Legend:

isDefinedBy

isUsedBy