This page presents formats of data depositions that various CLARIN centres are ready to accept. Each format, for each centre, can be "recommended", "acceptable" or "discouraged" in the context of several domains that represent the functions that the deposited data can play. The level of recommendation should always be viewed as relative to the profile of the given centre.
- "recommended" should be interpreted as meaning that the centre in question will in most cases be able to process the data without much manipulation and that it is likely that the data will be preserved long-term in that format (the specifics are up to that centre);
- "acceptable" should be interpreted as meaning that the centre may need to spend some time and resources on the up-conversion of the data, and that the data may be preserved in one of the recommended formats instead;
- "discouraged" should be understood as indicating that the centre may find it problematic to up-convert the data.
Use the dropboxes to select the particular domain, centre, and/or level of recommendation. Columns can be sorted, and your results can be downloaded as XML.
The exported XML files for a specified centre can be used to extend or modify the recommendations for that centre, by an authorised person. In order to aid in the process, please consult the separate lists of all available file formats and of the functional groupings of formats (functional domains).
As of mid-2022, not every centre with depositing services has submitted the information to the SIS; in some cases, the information had to be unreliably mapped from lists provided on centre homepages onto the feature matrix offered by the SIS (created on the basis of the SIS functional domains and levels of recommendation). If you think you see an error, please kindly help us get it right.
Format | Centre | Domain | Recommendation | |
---|---|---|---|---|
AACClick to add or suggest missing format information | DANS | Audiovisual Source Language DataAudio or video recordings providing spoken/multimodal or signed language data for research purposes. | acceptable |
|
AIClick to add or suggest missing format information | DANS | Image Source Language DataDigitized images of analogue sources of written language data for research purposes (e.g. facsimiles, scans of handwriting, photos of inscriptions). | acceptable |
|
AIFF | DANS | Audiovisual Source Language DataAudio or video recordings providing spoken/multimodal or signed language data for research purposes. | acceptable |
|
ArcGIS.gdb | DANS | GeodataInformation on geographic locations. | acceptable |
|
ArcGIS.mxd | DANS | GeodataInformation on geographic locations. | acceptable |
|
ASCII Grid | DANS | GeodataInformation on geographic locations. | recommended |
|
AutoCAD DXF-R12 | DANS | GeodataInformation on geographic locations. | recommended |
|
AVI | DANS | Audiovisual Source Language DataAudio or video recordings providing spoken/multimodal or signed language data for research purposes. | acceptable |
|
BWFClick to add or suggest missing format information | DANS | Audiovisual Source Language DataAudio or video recordings providing spoken/multimodal or signed language data for research purposes. | recommended |
|
CDRClick to add or suggest missing format information | DANS | Image Source Language DataDigitized images of analogue sources of written language data for research purposes (e.g. facsimiles, scans of handwriting, photos of inscriptions). | acceptable |
|
CSSClick to add or suggest missing format information | DANS | Tool SupportTool-related formats required for specific functionality of the tool or reliable reuse of resources (e.g. tagsets, annotation schemes, vocabularies, language models, parameter files, and other specifications or settings) | recommended |
|
CSV | DANS | OtherAny other function that cannot be included in an existing domain. The content of this domain will be periodically examined for potential patterns that may give rise to new domains. | recommended |
|
DBASEClick to add or suggest missing format information | DANS | OtherAny other function that cannot be included in an existing domain. The content of this domain will be periodically examined for potential patterns that may give rise to new domains. | acceptable |
|
DGNClick to add or suggest missing format information | DANS | GeodataInformation on geographic locations. | acceptable |
|
DICOM | DANS | Image Source Language DataDigitized images of analogue sources of written language data for research purposes (e.g. facsimiles, scans of handwriting, photos of inscriptions). | recommended |
|
DOCClick to add or suggest missing format information | DANS | DocumentationUnstructured documentation of the resource and its parts such as corpus or annotation guidelines. | acceptable |
|
DOCClick to add or suggest missing format information | DANS | Textual Source Language DataWritten unstructured/plain text or originally structured text (e.g. HTML) without linguistic or other mark-up added for research purposes. | acceptable |
|
DOCX | DANS | DocumentationUnstructured documentation of the resource and its parts such as corpus or annotation guidelines. | acceptable |
|
DOCX | DANS | Textual Source Language DataWritten unstructured/plain text or originally structured text (e.g. HTML) without linguistic or other mark-up added for research purposes. | acceptable |
|
DWGClick to add or suggest missing format information | DANS | GeodataInformation on geographic locations. | acceptable |
|
DXF | DANS | GeodataInformation on geographic locations. | acceptable |
|
EPSClick to add or suggest missing format information | DANS | Image Source Language DataDigitized images of analogue sources of written language data for research purposes (e.g. facsimiles, scans of handwriting, photos of inscriptions). | acceptable |
|
Erdas.img | DANS | GeodataInformation on geographic locations. | acceptable |
|
FLAC | DANS | Audiovisual Source Language DataAudio or video recordings providing spoken/multimodal or signed language data for research purposes. | recommended |
|
GeoJSON | DANS | GeodataInformation on geographic locations. | recommended | |
GeoTIFF | DANS | GeodataInformation on geographic locations. | recommended |
|
GML | DANS | GeodataInformation on geographic locations. | recommended |
|
HDF5 | DANS | PackagingPackaging formats of various nature (archiving, compression, library) if no more specific domain is suitable. | acceptable |
|
HTML | DANS | DocumentationUnstructured documentation of the resource and its parts such as corpus or annotation guidelines. | recommended |
|
HTML | DANS | Textual Source Language DataWritten unstructured/plain text or originally structured text (e.g. HTML) without linguistic or other mark-up added for research purposes. | recommended |
|
JP2 | DANS | Image Source Language DataDigitized images of analogue sources of written language data for research purposes (e.g. facsimiles, scans of handwriting, photos of inscriptions). | recommended |
|
JPEG | DANS | Image Source Language DataDigitized images of analogue sources of written language data for research purposes (e.g. facsimiles, scans of handwriting, photos of inscriptions). | recommended |
|
JS | DANS | Tool SupportTool-related formats required for specific functionality of the tool or reliable reuse of resources (e.g. tagsets, annotation schemes, vocabularies, language models, parameter files, and other specifications or settings) | recommended |
|
KML | DANS | GeodataInformation on geographic locations. | acceptable |
|
MapInfo.mif | DANS | GeodataInformation on geographic locations. | recommended |
|
MapInfo.tab | DANS | GeodataInformation on geographic locations. | acceptable |
|
MapInfo.wor | DANS | GeodataInformation on geographic locations. | acceptable |
|
Markdown | DANS | DocumentationUnstructured documentation of the resource and its parts such as corpus or annotation guidelines. | acceptable |
|
Markdown | DANS | Textual Source Language DataWritten unstructured/plain text or originally structured text (e.g. HTML) without linguistic or other mark-up added for research purposes. | acceptable |
|
MATLABClick to add or suggest missing format information | DANS | Tool SupportTool-related formats required for specific functionality of the tool or reliable reuse of resources (e.g. tagsets, annotation schemes, vocabularies, language models, parameter files, and other specifications or settings) | recommended |
|
MKVClick to add or suggest missing format information | DANS | Audiovisual Source Language DataAudio or video recordings providing spoken/multimodal or signed language data for research purposes. | recommended |
|
MP3 | DANS | Audiovisual Source Language DataAudio or video recordings providing spoken/multimodal or signed language data for research purposes. | acceptable |
|
MP4 | DANS | Audiovisual Source Language DataAudio or video recordings providing spoken/multimodal or signed language data for research purposes. | acceptable |
|
MPEG-2 | DANS | Audiovisual Source Language DataAudio or video recordings providing spoken/multimodal or signed language data for research purposes. | acceptable |
|
MSAccessClick to add or suggest missing format information | DANS | OtherAny other function that cannot be included in an existing domain. The content of this domain will be periodically examined for potential patterns that may give rise to new domains. | acceptable |
|
MXFClick to add or suggest missing format information | DANS | Audiovisual Source Language DataAudio or video recordings providing spoken/multimodal or signed language data for research purposes. | recommended |
|
NetCDFClick to add or suggest missing format information | DANS | Tool SupportTool-related formats required for specific functionality of the tool or reliable reuse of resources (e.g. tagsets, annotation schemes, vocabularies, language models, parameter files, and other specifications or settings) | recommended |
|
ODSClick to add or suggest missing format information | DANS | OtherAny other function that cannot be included in an existing domain. The content of this domain will be periodically examined for potential patterns that may give rise to new domains. | recommended |
|
ODTClick to add or suggest missing format information | DANS | DocumentationUnstructured documentation of the resource and its parts such as corpus or annotation guidelines. | recommended |
|
ODTClick to add or suggest missing format information | DANS | Textual Source Language DataWritten unstructured/plain text or originally structured text (e.g. HTML) without linguistic or other mark-up added for research purposes. | recommended |
|