Show all info regardless research infrastructures. Switch to CLARIN environment and show only relevant info to CLARIN, e.g. format recommendations by CLARIN centres. Switch to Text+ environment and show only relevant info to Text+, e.g. format recommendations by Text+ centres. Switch to DARIAH environment and show only relevant info to DARIAH, e.g. format recommendations by DARIAH centres.
Format Recommendations

This page presents formats of data depositions that various CLARIN centres are ready to accept. Each format, for each centre, can be "recommended", "acceptable" or "discouraged" in the context of several domains that represent the functions that the deposited data can play. The level of recommendation should always be viewed as relative to the profile of the given centre.

  • "recommended" should be interpreted as meaning that the centre in question will in most cases be able to process the data without much manipulation and that it is likely that the data will be preserved long-term in that format (the specifics are up to that centre);
  • "acceptable" should be interpreted as meaning that the centre may need to spend some time and resources on the up-conversion of the data, and that the data may be preserved in one of the recommended formats instead;
  • "discouraged" should be understood as indicating that the centre may find it problematic to up-convert the data.

Use the dropboxes to select the particular domain, centre, and/or level of recommendation. Columns can be sorted, and your results can be downloaded as XML.

The exported XML files for a specified centre can be used to extend or modify the recommendations for that centre, by an authorised person. In order to aid in the process, please consult the separate lists of all available file formats and of the functional groupings of formats (functional domains).

As of mid-2022, not every centre with depositing services has submitted the information to the SIS; in some cases, the information had to be unreliably mapped from lists provided on centre homepages onto the feature matrix offered by the SIS (created on the basis of the SIS functional domains and levels of recommendation). If you think you see an error, please kindly help us get it right.

Format Centre Domain Recommendation
AVI MI Audiovisual Source Language DataAudio or video recordings providing spoken/multimodal or signed language data for research purposes. recommended
BWFClick to add or suggest missing format information MI Audiovisual Source Language DataAudio or video recordings providing spoken/multimodal or signed language data for research purposes. recommended
FLAC MI Audiovisual Source Language DataAudio or video recordings providing spoken/multimodal or signed language data for research purposes. recommended
QuickTime MI Audiovisual Source Language DataAudio or video recordings providing spoken/multimodal or signed language data for research purposes. recommended
MP4 MI Audiovisual Source Language DataAudio or video recordings providing spoken/multimodal or signed language data for research purposes. recommended
MPEG-2 MI Audiovisual Source Language DataAudio or video recordings providing spoken/multimodal or signed language data for research purposes. recommended
WAVE MI Audiovisual Source Language DataAudio or video recordings providing spoken/multimodal or signed language data for research purposes. recommended
MPEG-4 AVC HZSK Audiovisual Source Language DataAudio or video recordings providing spoken/multimodal or signed language data for research purposes. recommended bit rate up to 5000Kbps, scan type progressive, audio AAC, 48 kHz, 384Kbps, stereo
WAVE HZSK Audiovisual Source Language DataAudio or video recordings providing spoken/multimodal or signed language data for research purposes. recommended (L)PCM-WAV, 48kHz, 16bit
FLAC CLARIN-DK-UCPH Audiovisual Source Language DataAudio or video recordings providing spoken/multimodal or signed language data for research purposes. recommended
M2JClick to add or suggest missing format information CLARIN-DK-UCPH Audiovisual Source Language DataAudio or video recordings providing spoken/multimodal or signed language data for research purposes. recommended
MP4 CLARIN-DK-UCPH Audiovisual Source Language DataAudio or video recordings providing spoken/multimodal or signed language data for research purposes. recommended
OGGClick to add or suggest missing format information CLARIN-DK-UCPH Audiovisual Source Language DataAudio or video recordings providing spoken/multimodal or signed language data for research purposes. recommended
MP4 EKUT Audiovisual Source Language DataAudio or video recordings providing spoken/multimodal or signed language data for research purposes. recommended
WAVE EKUT Audiovisual Source Language DataAudio or video recordings providing spoken/multimodal or signed language data for research purposes. recommended
QuickTime LAC Audiovisual Source Language DataAudio or video recordings providing spoken/multimodal or signed language data for research purposes. recommended Video codec h.264 (preferred profile: main, level: 4.0, 1080p, 30fps), Audio encoding LPCM (preferred sampling rate 48 kHz and bit depth 16 bit)
MPEG-4 AVC LAC Audiovisual Source Language DataAudio or video recordings providing spoken/multimodal or signed language data for research purposes. recommended Video codec h.264 (preferred profile: main, level: 4.0, 1080p, 30fps), Audio encoding AAC (LC) (preferred sampling rate 48 kHz and bit rate 128–384 kbps)
WAVE LAC Audiovisual Source Language DataAudio or video recordings providing spoken/multimodal or signed language data for research purposes. recommended LPCM audio (preferred sampling rate 48 kHz and bit depth 16 bit)
WAVE FIN-CLARIN Audiovisual Source Language DataAudio or video recordings providing spoken/multimodal or signed language data for research purposes. recommended PCM-WAV, 48 kHz, 16 bit
WAVE FIN-CLARIN Audiovisual Source Language DataAudio or video recordings providing spoken/multimodal or signed language data for research purposes. acceptable PCM-WAV above 22 kHz/16 bit
MP4 FIN-CLARIN Audiovisual Source Language DataAudio or video recordings providing spoken/multimodal or signed language data for research purposes. acceptable
FLAC FIN-CLARIN Audiovisual Source Language DataAudio or video recordings providing spoken/multimodal or signed language data for research purposes. acceptable
MP3 FIN-CLARIN Audiovisual Source Language DataAudio or video recordings providing spoken/multimodal or signed language data for research purposes. discouraged lossy formats should be avoided if possible
MPEG-4 AVC FIN-CLARIN Audiovisual Source Language DataAudio or video recordings providing spoken/multimodal or signed language data for research purposes. recommended 25 fps, 1920×1080, constant bit rate
AIFF OTA Audiovisual Source Language DataAudio or video recordings providing spoken/multimodal or signed language data for research purposes. acceptable
FLAC OTA Audiovisual Source Language DataAudio or video recordings providing spoken/multimodal or signed language data for research purposes. acceptable
M2JClick to add or suggest missing format information OTA Audiovisual Source Language DataAudio or video recordings providing spoken/multimodal or signed language data for research purposes. acceptable
MP3 OTA Audiovisual Source Language DataAudio or video recordings providing spoken/multimodal or signed language data for research purposes. discouraged lossy formats should be avoided if possible
MP4 OTA Audiovisual Source Language DataAudio or video recordings providing spoken/multimodal or signed language data for research purposes. acceptable
MPEG-4 AVC OTA Audiovisual Source Language DataAudio or video recordings providing spoken/multimodal or signed language data for research purposes. recommended 25 fps, 1920×1080, constant bit rate
MPEG-1 OTA Audiovisual Source Language DataAudio or video recordings providing spoken/multimodal or signed language data for research purposes. acceptable
MPEG-2 OTA Audiovisual Source Language DataAudio or video recordings providing spoken/multimodal or signed language data for research purposes. acceptable
WAVE OTA Audiovisual Source Language DataAudio or video recordings providing spoken/multimodal or signed language data for research purposes. recommended PCM-WAV, 48 kHz, 16 bit
WAVE OTA Audiovisual Source Language DataAudio or video recordings providing spoken/multimodal or signed language data for research purposes. acceptable PCM-WAV with non-recommended parameters (not 48 kHz, 16 bit)
M4AClick to add or suggest missing format information ZIM Audiovisual Source Language DataAudio or video recordings providing spoken/multimodal or signed language data for research purposes. recommended
QuickTime ZIM Audiovisual Source Language DataAudio or video recordings providing spoken/multimodal or signed language data for research purposes. recommended
MP3 ZIM Audiovisual Source Language DataAudio or video recordings providing spoken/multimodal or signed language data for research purposes. recommended
MP4 ZIM Audiovisual Source Language DataAudio or video recordings providing spoken/multimodal or signed language data for research purposes. recommended
WAVE ZIM Audiovisual Source Language DataAudio or video recordings providing spoken/multimodal or signed language data for research purposes. recommended
AACClick to add or suggest missing format information DANS Audiovisual Source Language DataAudio or video recordings providing spoken/multimodal or signed language data for research purposes. acceptable See more info from DANS
AIFF DANS Audiovisual Source Language DataAudio or video recordings providing spoken/multimodal or signed language data for research purposes. acceptable See more info from DANS
AVI DANS Audiovisual Source Language DataAudio or video recordings providing spoken/multimodal or signed language data for research purposes. acceptable See more info from DANS
BWFClick to add or suggest missing format information DANS Audiovisual Source Language DataAudio or video recordings providing spoken/multimodal or signed language data for research purposes. recommended See more info from DANS
FLAC DANS Audiovisual Source Language DataAudio or video recordings providing spoken/multimodal or signed language data for research purposes. recommended See more info from DANS
MKVClick to add or suggest missing format information DANS Audiovisual Source Language DataAudio or video recordings providing spoken/multimodal or signed language data for research purposes. recommended See more info from DANS
MP3 DANS Audiovisual Source Language DataAudio or video recordings providing spoken/multimodal or signed language data for research purposes. acceptable See more info from DANS
MP4 DANS Audiovisual Source Language DataAudio or video recordings providing spoken/multimodal or signed language data for research purposes. acceptable See more info from DANS
MPEG-2 DANS Audiovisual Source Language DataAudio or video recordings providing spoken/multimodal or signed language data for research purposes. acceptable See more info from DANS
MXFClick to add or suggest missing format information DANS Audiovisual Source Language DataAudio or video recordings providing spoken/multimodal or signed language data for research purposes. recommended See more info from DANS
OGGClick to add or suggest missing format information DANS Audiovisual Source Language DataAudio or video recordings providing spoken/multimodal or signed language data for research purposes. acceptable See more info from DANS
OPUSClick to add or suggest missing format information DANS Audiovisual Source Language DataAudio or video recordings providing spoken/multimodal or signed language data for research purposes. recommended See more info from DANS
<< 4/16 >>