Show all info regardless research infrastructures. Switch to CLARIN environment and show only relevant info to CLARIN, e.g. format recommendations by CLARIN centres. Switch to Text+ environment and show only relevant info to Text+, e.g. format recommendations by Text+ centres. Switch to DARIAH environment and show only relevant info to DARIAH, e.g. format recommendations by DARIAH centres.
Format Recommendations

This page presents formats of data depositions that various CLARIN centres are ready to accept. Each format, for each centre, can be "recommended", "acceptable" or "discouraged" in the context of several domains that represent the functions that the deposited data can play. The level of recommendation should always be viewed as relative to the profile of the given centre.

  • "recommended" should be interpreted as meaning that the centre in question will in most cases be able to process the data without much manipulation and that it is likely that the data will be preserved long-term in that format (the specifics are up to that centre);
  • "acceptable" should be interpreted as meaning that the centre may need to spend some time and resources on the up-conversion of the data, and that the data may be preserved in one of the recommended formats instead;
  • "discouraged" should be understood as indicating that the centre may find it problematic to up-convert the data.

Use the dropboxes to select the particular domain, centre, and/or level of recommendation. Columns can be sorted, and your results can be downloaded as XML.

The exported XML files for a specified centre can be used to extend or modify the recommendations for that centre, by an authorised person. In order to aid in the process, please consult the separate lists of all available file formats and of the functional groupings of formats (functional domains).

As of mid-2022, not every centre with depositing services has submitted the information to the SIS; in some cases, the information had to be unreliably mapped from lists provided on centre homepages onto the feature matrix offered by the SIS (created on the basis of the SIS functional domains and levels of recommendation). If you think you see an error, please kindly help us get it right.

Format Centre Domain Recommendation
PDF/A SAW DocumentationUnstructured documentation of the resource and its parts such as corpus or annotation guidelines. recommended
PDF/A FIN-CLARIN DocumentationUnstructured documentation of the resource and its parts such as corpus or annotation guidelines. recommended
PDF/A DANS DocumentationUnstructured documentation of the resource and its parts such as corpus or annotation guidelines. recommended See more info from DANS
PDF/A-1 Sprakbanken DocumentationUnstructured documentation of the resource and its parts such as corpus or annotation guidelines. recommended
PDF/A-1 ACDH-ARCHE DocumentationUnstructured documentation of the resource and its parts such as corpus or annotation guidelines. recommended
PDF/A-1 OTA DocumentationUnstructured documentation of the resource and its parts such as corpus or annotation guidelines. recommended
PDF/A-2 ORTOLANG DocumentationUnstructured documentation of the resource and its parts such as corpus or annotation guidelines. acceptable
PDF/A-2 Sprakbanken DocumentationUnstructured documentation of the resource and its parts such as corpus or annotation guidelines. recommended
PDF/A-2 ACDH-ARCHE DocumentationUnstructured documentation of the resource and its parts such as corpus or annotation guidelines. recommended
PDF/A-2 OTA DocumentationUnstructured documentation of the resource and its parts such as corpus or annotation guidelines. recommended
PDF/A-3 ORTOLANG DocumentationUnstructured documentation of the resource and its parts such as corpus or annotation guidelines. acceptable
PDF/A-3 Sprakbanken DocumentationUnstructured documentation of the resource and its parts such as corpus or annotation guidelines. acceptable
PDF/A-3 ACDH-ARCHE DocumentationUnstructured documentation of the resource and its parts such as corpus or annotation guidelines. acceptable
PDF/A-3 OTA DocumentationUnstructured documentation of the resource and its parts such as corpus or annotation guidelines. acceptable
plainText PORTULAN-CLARIN DocumentationUnstructured documentation of the resource and its parts such as corpus or annotation guidelines. recommended
plainText IDS DocumentationUnstructured documentation of the resource and its parts such as corpus or annotation guidelines. recommended
plainText Sprakbanken DocumentationUnstructured documentation of the resource and its parts such as corpus or annotation guidelines. recommended
plainText SAW DocumentationUnstructured documentation of the resource and its parts such as corpus or annotation guidelines. recommended
plainText ACDH-ARCHE DocumentationUnstructured documentation of the resource and its parts such as corpus or annotation guidelines. recommended
plainText MI DocumentationUnstructured documentation of the resource and its parts such as corpus or annotation guidelines. recommended
plainText CLARIN-DK-UCPH DocumentationUnstructured documentation of the resource and its parts such as corpus or annotation guidelines. acceptable
plainText FIN-CLARIN DocumentationUnstructured documentation of the resource and its parts such as corpus or annotation guidelines. recommended e.g. as README.txt
plainText OTA DocumentationUnstructured documentation of the resource and its parts such as corpus or annotation guidelines. recommended
plainText BBAW DocumentationUnstructured documentation of the resource and its parts such as corpus or annotation guidelines. recommended
plainText DANS DocumentationUnstructured documentation of the resource and its parts such as corpus or annotation guidelines. recommended Encoded as UTF-8/16/32, see more info from DANS
RTFClick to add or suggest missing format information DANS DocumentationUnstructured documentation of the resource and its parts such as corpus or annotation guidelines. acceptable See more info from DANS
RTFClick to add or suggest missing format information CLARIN-CH DocumentationUnstructured documentation of the resource and its parts such as corpus or annotation guidelines. discouraged
SGMLClick to add or suggest missing format information DANS DocumentationUnstructured documentation of the resource and its parts such as corpus or annotation guidelines. acceptable See more info from DANS
TEI CLARIN.SI DocumentationUnstructured documentation of the resource and its parts such as corpus or annotation guidelines. recommended
TEI COCOON DocumentationUnstructured documentation of the resource and its parts such as corpus or annotation guidelines. recommended
TEI Sprakbanken DocumentationUnstructured documentation of the resource and its parts such as corpus or annotation guidelines. recommended
TEI ACDH-ARCHE DocumentationUnstructured documentation of the resource and its parts such as corpus or annotation guidelines. recommended
TEI CLARIN-DK-UCPH DocumentationUnstructured documentation of the resource and its parts such as corpus or annotation guidelines. recommended
TEI EKUT DocumentationUnstructured documentation of the resource and its parts such as corpus or annotation guidelines. recommended
TEI FIN-CLARIN DocumentationUnstructured documentation of the resource and its parts such as corpus or annotation guidelines. recommended
TEI OTA DocumentationUnstructured documentation of the resource and its parts such as corpus or annotation guidelines. recommended
TEI ZIM DocumentationUnstructured documentation of the resource and its parts such as corpus or annotation guidelines. recommended
TEI BBAW DocumentationUnstructured documentation of the resource and its parts such as corpus or annotation guidelines. recommended
TeXClick to add or suggest missing format information ZIM DocumentationUnstructured documentation of the resource and its parts such as corpus or annotation guidelines. recommended
TeXClick to add or suggest missing format information UdS DocumentationUnstructured documentation of the resource and its parts such as corpus or annotation guidelines. recommended
TeXClick to add or suggest missing format information ILC4CLARIN DocumentationUnstructured documentation of the resource and its parts such as corpus or annotation guidelines. recommended
XML CLARIN.SI DocumentationUnstructured documentation of the resource and its parts such as corpus or annotation guidelines. recommended
XML Sprakbanken DocumentationUnstructured documentation of the resource and its parts such as corpus or annotation guidelines. recommended
XML MPI-PL DocumentationUnstructured documentation of the resource and its parts such as corpus or annotation guidelines. recommended
XML ACDH-ARCHE DocumentationUnstructured documentation of the resource and its parts such as corpus or annotation guidelines. recommended
XML MI DocumentationUnstructured documentation of the resource and its parts such as corpus or annotation guidelines. recommended
XML CLARIN-DK-UCPH DocumentationUnstructured documentation of the resource and its parts such as corpus or annotation guidelines. recommended
XML EKUT DocumentationUnstructured documentation of the resource and its parts such as corpus or annotation guidelines. recommended
XML OTA DocumentationUnstructured documentation of the resource and its parts such as corpus or annotation guidelines. recommended
XML ZIM DocumentationUnstructured documentation of the resource and its parts such as corpus or annotation guidelines. recommended
XML BBAW DocumentationUnstructured documentation of the resource and its parts such as corpus or annotation guidelines. recommended
<< 2/3 >>