Show all info regardless research infrastructures. Switch to CLARIN environment and show only relevant info to CLARIN, e.g. format recommendations by CLARIN centres. Switch to Text+ environment and show only relevant info to Text+, e.g. format recommendations by Text+ centres. Switch to DARIAH environment and show only relevant info to DARIAH, e.g. format recommendations by DARIAH centres.
The CLARIN Centre at the University of Copenhagen
Suggest a fix or extension
Abbreviation: CLARIN-DK-UCPH
Link: https://centres.clarin.eu/centre/14
Research infrastructure:
  • CLARIN (B-centre,K-centre)
Warning: The recommendations have not been curated yet.
Functional domains:
  • Audiovisual Source Language Data
  • Documentation
  • Image Source Language Data
  • Language Description
  • Lexical Resource
  • Text Annotation
  • Textual Source Language Data
Format recommendations:
Format Domain Level Comments
CSV Lexical ResourceStructured (item-based) resources for lexical and/or conceptual information on units of language (e.g. wordlists, lexicons, WordNets etc.) recommended
FLAC Audiovisual Source Language DataAudio or video recordings providing spoken/multimodal or signed language data for research purposes. recommended
JPEG Image Source Language DataDigitized images of analogue sources of written language data for research purposes (e.g. facsimiles, scans of handwriting, photos of inscriptions). recommended
M2JClick to add or suggest missing format information Audiovisual Source Language DataAudio or video recordings providing spoken/multimodal or signed language data for research purposes. recommended
MP4 Audiovisual Source Language DataAudio or video recordings providing spoken/multimodal or signed language data for research purposes. recommended
OGGClick to add or suggest missing format information Audiovisual Source Language DataAudio or video recordings providing spoken/multimodal or signed language data for research purposes. recommended
plainText DocumentationUnstructured documentation of the resource and its parts such as corpus or annotation guidelines. acceptable
plainText Textual Source Language DataWritten unstructured/plain text or originally structured text (e.g. HTML) without linguistic or other mark-up added for research purposes. recommended
TEI DocumentationUnstructured documentation of the resource and its parts such as corpus or annotation guidelines. recommended
TEI Text AnnotationAnnotations of textual sources/written text, with the original text included or as stand-off. recommended
TIFF Image Source Language DataDigitized images of analogue sources of written language data for research purposes (e.g. facsimiles, scans of handwriting, photos of inscriptions). recommended
VRTClick to add or suggest missing format information Language DescriptionStructured or unstructured descriptions of linguistic varieties or phenomena, typological databases etc. recommended
XML DocumentationUnstructured documentation of the resource and its parts such as corpus or annotation guidelines. recommended
XML Text AnnotationAnnotations of textual sources/written text, with the original text included or as stand-off. recommended
Last update commit-id: 147cb5ea