Show all info regardless research infrastructures. Switch to CLARIN environment and show only relevant info to CLARIN, e.g. format recommendations by CLARIN centres. Switch to Text+ environment and show only relevant info to Text+, e.g. format recommendations by Text+ centres. Switch to DARIAH environment and show only relevant info to DARIAH, e.g. format recommendations by DARIAH centres.
The ILC4CLARIN Centre at the Institute for Computational Linguistics
Suggest a fix or extension
Abbreviation: ILC4CLARIN
Registry: CLARIN:
Research infrastructure:
  • CLARIN (B-centre)
Warning: The recommendations have not been curated yet.
Data functions covered by the recommendations: ...
Format recommendations:
Format Domain Level Comments
QuickTime Audiovisual Source Language DataAudio or video recordings providing spoken/multimodal or signed language data for research purposes. recommended
MPEG-1 Audiovisual Source Language DataAudio or video recordings providing spoken/multimodal or signed language data for research purposes. recommended
WAVE Audiovisual Source Language DataAudio or video recordings providing spoken/multimodal or signed language data for research purposes. recommended
HTML DocumentationUnstructured documentation of the resource and its parts such as corpus or annotation guidelines. recommended
LaTeXClick to add or suggest missing format information DocumentationUnstructured documentation of the resource and its parts such as corpus or annotation guidelines. recommended
PDF DocumentationUnstructured documentation of the resource and its parts such as corpus or annotation guidelines. acceptable
TeXClick to add or suggest missing format information DocumentationUnstructured documentation of the resource and its parts such as corpus or annotation guidelines. recommended
XML DocumentationUnstructured documentation of the resource and its parts such as corpus or annotation guidelines. recommended
GIF Image Source Language DataDigitized images of analogue sources of written language data for research purposes (e.g. facsimiles, scans of handwriting, photos of inscriptions). recommended
JPEG Image Source Language DataDigitized images of analogue sources of written language data for research purposes (e.g. facsimiles, scans of handwriting, photos of inscriptions). recommended
TIFF Image Source Language DataDigitized images of analogue sources of written language data for research purposes (e.g. facsimiles, scans of handwriting, photos of inscriptions). recommended
GZIP PackagingPackaging formats of various nature (archiving, compression, library) if no more specific domain is suitable. recommended
ZIP PackagingPackaging formats of various nature (archiving, compression, library) if no more specific domain is suitable. recommended
RDFXMLClick to add or suggest missing format information Text AnnotationAnnotations of textual sources/written text, with the original text included or as stand-off. recommended
XML Text AnnotationAnnotations of textual sources/written text, with the original text included or as stand-off. recommended
ODTClick to add or suggest missing format information Textual Source Language DataWritten unstructured/plain text or originally structured text (e.g. HTML) without linguistic or other mark-up added for research purposes. recommended
PDF Textual Source Language DataWritten unstructured/plain text or originally structured text (e.g. HTML) without linguistic or other mark-up added for research purposes. acceptable
plainText Textual Source Language DataWritten unstructured/plain text or originally structured text (e.g. HTML) without linguistic or other mark-up added for research purposes. recommended
CSSClick to add or suggest missing format information Tool SupportTool-related formats required for specific functionality of the tool or reliable reuse of resources (e.g. tagsets, annotation schemes, vocabularies, language models, parameter files, and other specifications or settings) recommended
Last update commit-id: 147cb5ea