Show all info regardless research infrastructures. Switch to CLARIN environment and show only relevant info to CLARIN, e.g. format recommendations by CLARIN centres. Switch to Text+ environment and show only relevant info to Text+, e.g. format recommendations by Text+ centres. Switch to DARIAH environment and show only relevant info to DARIAH, e.g. format recommendations by DARIAH centres.
Data Archiving and Networked Services
Suggest a fix or extension
Abbreviation: DANS
Link: https://centres.clarin.eu/centre/20
Research infrastructure:
  • CLARIN (C-centre)
Warning: The recommendations have not been curated yet.
Description:

Preferred formats are file formats of which DANS is confident that they will offer the best long-term guarantees in terms of usability, accessibility and sustainability. Deposits of research data in preferred formats will always be accepted by DANS.

Non-preferred formats are file formats that are widely used in addition to the preferred formats, and which will be moderately to reasonably usable, accessible and robust in the long term.

As a general guideline, DANS believes that the file formats best suited for long-term sustainability and accessibility:

  • Are frequently used
  • Have open specifications
  • Are independent of specific software, developers or vendors

In practice, it is not always possible to use formats which satisfy all of these criteria.

It may be desirable to make certain original data available in ‘Non-preferred format(s)’ because these can be characterized as current usage formats. Examples include Esri Shapefiles, Microsoft Access databases, SPSS .sav files. DANS then asks you to deposit your data in these original formats as well as in Preferred formats aimed at long-term sustainability.

If your data are stored in other formats than those mentioned in the recommendations, please contact DANS.

Functional domains:
  • Audiovisual Annotation
  • Audiovisual Source Language Data
  • Catalogue Metadata
  • Contextual Information
  • Documentation
  • Geodata
  • Image Annotation
  • Image Source Language Data
  • Language Description
  • Lexical Resource
  • Metadata
  • Other
  • Statistical Data
  • Text Annotation
  • Textual Source Language Data
  • Tool Support
Format recommendations:
Format Domain Level Comments
AACClick to add or suggest missing format information Audiovisual Source Language DataAudio or video recordings providing spoken/multimodal or signed language data for research purposes. acceptable See more info from DANS
AIClick to add or suggest missing format information Image Source Language DataDigitized images of analogue sources of written language data for research purposes (e.g. facsimiles, scans of handwriting, photos of inscriptions). acceptable See more info from DANS
AIFF Audiovisual Source Language DataAudio or video recordings providing spoken/multimodal or signed language data for research purposes. acceptable See more info from DANS
ArcGIS.gdb GeodataInformation on geographic locations. acceptable See more info from DANS
ArcGIS.mxd GeodataInformation on geographic locations. acceptable See more info from DANS
ASCII Grid GeodataInformation on geographic locations. recommended See more info from DANS
AutoCAD DXF-R12 GeodataInformation on geographic locations. recommended See more info from DANS
AVI Audiovisual Source Language DataAudio or video recordings providing spoken/multimodal or signed language data for research purposes. acceptable See more info from DANS
BWFClick to add or suggest missing format information Audiovisual Source Language DataAudio or video recordings providing spoken/multimodal or signed language data for research purposes. recommended See more info from DANS
CDRClick to add or suggest missing format information Image Source Language DataDigitized images of analogue sources of written language data for research purposes (e.g. facsimiles, scans of handwriting, photos of inscriptions). acceptable See more info from DANS
CSSClick to add or suggest missing format information Tool SupportTool-related formats required for specific functionality of the tool or reliable reuse of resources (e.g. tagsets, annotation schemes, vocabularies, language models, parameter files, and other specifications or settings) recommended See more info from DANS
CSV OtherAny other function that cannot be included in an existing domain. The content of this domain will be periodically examined for potential patterns that may give rise to new domains. recommended See more info from DANS
DBASEClick to add or suggest missing format information OtherAny other function that cannot be included in an existing domain. The content of this domain will be periodically examined for potential patterns that may give rise to new domains. acceptable See more info from DANS
DGNClick to add or suggest missing format information GeodataInformation on geographic locations. acceptable See more info from DANS
DICOM Image Source Language DataDigitized images of analogue sources of written language data for research purposes (e.g. facsimiles, scans of handwriting, photos of inscriptions). recommended See more info from DANS
DOCClick to add or suggest missing format information DocumentationUnstructured documentation of the resource and its parts such as corpus or annotation guidelines. acceptable applicable specifically to Word .doc; see more info from DANS
DOCClick to add or suggest missing format information Textual Source Language DataWritten unstructured/plain text or originally structured text (e.g. HTML) without linguistic or other mark-up added for research purposes. acceptable applicable specifically to Word .doc; see more info from DANS
DOCX DocumentationUnstructured documentation of the resource and its parts such as corpus or annotation guidelines. acceptable See more info from DANS
DOCX Textual Source Language DataWritten unstructured/plain text or originally structured text (e.g. HTML) without linguistic or other mark-up added for research purposes. acceptable See more info from DANS
DWGClick to add or suggest missing format information GeodataInformation on geographic locations. acceptable See more info from DANS
DXF GeodataInformation on geographic locations. acceptable See more info from DANS
EPSClick to add or suggest missing format information Image Source Language DataDigitized images of analogue sources of written language data for research purposes (e.g. facsimiles, scans of handwriting, photos of inscriptions). acceptable See more info from DANS
Erdas.img GeodataInformation on geographic locations. acceptable See more info from DANS
FLAC Audiovisual Source Language DataAudio or video recordings providing spoken/multimodal or signed language data for research purposes. recommended See more info from DANS
GeoJSON GeodataInformation on geographic locations. recommended
GeoTIFF GeodataInformation on geographic locations. recommended See more info from DANS
GML GeodataInformation on geographic locations. recommended See more info from DANS
HDF5 OtherAny other function that cannot be included in an existing domain. The content of this domain will be periodically examined for potential patterns that may give rise to new domains. acceptable See more info from DANS
HTML DocumentationUnstructured documentation of the resource and its parts such as corpus or annotation guidelines. recommended See more info from DANS
HTML Textual Source Language DataWritten unstructured/plain text or originally structured text (e.g. HTML) without linguistic or other mark-up added for research purposes. recommended See more info from DANS
JP2 Image Source Language DataDigitized images of analogue sources of written language data for research purposes (e.g. facsimiles, scans of handwriting, photos of inscriptions). recommended See more info from DANS
JPEG Image Source Language DataDigitized images of analogue sources of written language data for research purposes (e.g. facsimiles, scans of handwriting, photos of inscriptions). recommended See more info from DANS
JS Tool SupportTool-related formats required for specific functionality of the tool or reliable reuse of resources (e.g. tagsets, annotation schemes, vocabularies, language models, parameter files, and other specifications or settings) recommended See more info from DANS
KML GeodataInformation on geographic locations. acceptable See more info from DANS
MapInfo.mif GeodataInformation on geographic locations. recommended See more info from DANS
MapInfo.tab GeodataInformation on geographic locations. acceptable See more info from DANS
MapInfo.wor GeodataInformation on geographic locations. acceptable See more info from DANS
Markdown DocumentationUnstructured documentation of the resource and its parts such as corpus or annotation guidelines. acceptable See more info from DANS
Markdown Textual Source Language DataWritten unstructured/plain text or originally structured text (e.g. HTML) without linguistic or other mark-up added for research purposes. acceptable See more info from DANS
MATLABClick to add or suggest missing format information Tool SupportTool-related formats required for specific functionality of the tool or reliable reuse of resources (e.g. tagsets, annotation schemes, vocabularies, language models, parameter files, and other specifications or settings) recommended See more info from DANS
MKVClick to add or suggest missing format information Audiovisual Source Language DataAudio or video recordings providing spoken/multimodal or signed language data for research purposes. recommended See more info from DANS
MP3 Audiovisual Source Language DataAudio or video recordings providing spoken/multimodal or signed language data for research purposes. acceptable See more info from DANS
MP4 Audiovisual Source Language DataAudio or video recordings providing spoken/multimodal or signed language data for research purposes. acceptable See more info from DANS
MPEG-2 Audiovisual Source Language DataAudio or video recordings providing spoken/multimodal or signed language data for research purposes. acceptable See more info from DANS
MSAccessClick to add or suggest missing format information OtherAny other function that cannot be included in an existing domain. The content of this domain will be periodically examined for potential patterns that may give rise to new domains. acceptable See more info from DANS
MXFClick to add or suggest missing format information Audiovisual Source Language DataAudio or video recordings providing spoken/multimodal or signed language data for research purposes. recommended See more info from DANS
NetCDFClick to add or suggest missing format information Tool SupportTool-related formats required for specific functionality of the tool or reliable reuse of resources (e.g. tagsets, annotation schemes, vocabularies, language models, parameter files, and other specifications or settings) recommended See more info from DANS
ODSClick to add or suggest missing format information OtherAny other function that cannot be included in an existing domain. The content of this domain will be periodically examined for potential patterns that may give rise to new domains. recommended See more info from DANS
ODTClick to add or suggest missing format information DocumentationUnstructured documentation of the resource and its parts such as corpus or annotation guidelines. recommended See more info from DANS
ODTClick to add or suggest missing format information Textual Source Language DataWritten unstructured/plain text or originally structured text (e.g. HTML) without linguistic or other mark-up added for research purposes. recommended See more info from DANS
OGGClick to add or suggest missing format information Audiovisual Source Language DataAudio or video recordings providing spoken/multimodal or signed language data for research purposes. acceptable See more info from DANS
OPUSClick to add or suggest missing format information Audiovisual Source Language DataAudio or video recordings providing spoken/multimodal or signed language data for research purposes. recommended See more info from DANS
PDF DocumentationUnstructured documentation of the resource and its parts such as corpus or annotation guidelines. acceptable See more info from DANS
PDF Textual Source Language DataWritten unstructured/plain text or originally structured text (e.g. HTML) without linguistic or other mark-up added for research purposes. acceptable See more info from DANS
PDF/A DocumentationUnstructured documentation of the resource and its parts such as corpus or annotation guidelines. recommended See more info from DANS
PDF/A OtherAny other function that cannot be included in an existing domain. The content of this domain will be periodically examined for potential patterns that may give rise to new domains. acceptable See more info from DANS
PDF/A Textual Source Language DataWritten unstructured/plain text or originally structured text (e.g. HTML) without linguistic or other mark-up added for research purposes. recommended See more info from DANS
plainText DocumentationUnstructured documentation of the resource and its parts such as corpus or annotation guidelines. recommended Encoded as UTF-8/16/32, see more info from DANS
plainText Textual Source Language DataWritten unstructured/plain text or originally structured text (e.g. HTML) without linguistic or other mark-up added for research purposes. recommended Encoded as UTF-8/16/32, see more info from DANS
PNG Image Source Language DataDigitized images of analogue sources of written language data for research purposes (e.g. facsimiles, scans of handwriting, photos of inscriptions). recommended See more info from DANS
QGIS.qgs GeodataInformation on geographic locations. acceptable See more info from DANS
QTClick to add or suggest missing format information Audiovisual Source Language DataAudio or video recordings providing spoken/multimodal or signed language data for research purposes. acceptable See more info from DANS
RClick to add or suggest missing format information Statistical DataData from surveys and tests in numeric formats. recommended See more info from DANS
RTFClick to add or suggest missing format information DocumentationUnstructured documentation of the resource and its parts such as corpus or annotation guidelines. acceptable See more info from DANS
RTFClick to add or suggest missing format information Textual Source Language DataWritten unstructured/plain text or originally structured text (e.g. HTML) without linguistic or other mark-up added for research purposes. acceptable See more info from DANS
SAS.sd2 Statistical DataData from surveys and tests in numeric formats. acceptable See more info from DANS
SGMLClick to add or suggest missing format information DocumentationUnstructured documentation of the resource and its parts such as corpus or annotation guidelines. acceptable See more info from DANS
SGMLClick to add or suggest missing format information Textual Source Language DataWritten unstructured/plain text or originally structured text (e.g. HTML) without linguistic or other mark-up added for research purposes. acceptable See more info from DANS
Shapefile GeodataInformation on geographic locations. acceptable See more info from DANS
SIARDClick to add or suggest missing format information OtherAny other function that cannot be included in an existing domain. The content of this domain will be periodically examined for potential patterns that may give rise to new domains. recommended See more info from DANS
SPSS.data+setup Statistical DataData from surveys and tests in numeric formats. recommended For general info, see the DANS page for statistical data. SPSS is recommended as "data and setup" (.dat/.sps) format as the most sustainable option.
SPSS.por Statistical DataData from surveys and tests in numeric formats. acceptable For general info, see the DANS page for statistical data. The Portable version of the SPSS format is not recommended.
SPSS.sav Statistical DataData from surveys and tests in numeric formats. acceptable For general info, see the DANS page for statistical data. The .sav version of the SPSS format is not recommended.
SQLClick to add or suggest missing format information OtherAny other function that cannot be included in an existing domain. The content of this domain will be periodically examined for potential patterns that may give rise to new domains. recommended See more info from DANS
STATA.data+setup Statistical DataData from surveys and tests in numeric formats. recommended For general info, see the DANS page for statistical data. STATA is recommended as "data and setup" (.dat/.DO) format as the most sustainable option.
STATA.dta Statistical DataData from surveys and tests in numeric formats. acceptable For general info, see the DANS page for statistical data. The native format of STATA is not recommended.
SVG Image Source Language DataDigitized images of analogue sources of written language data for research purposes (e.g. facsimiles, scans of handwriting, photos of inscriptions). recommended See more info from DANS
SVG OtherAny other function that cannot be included in an existing domain. The content of this domain will be periodically examined for potential patterns that may give rise to new domains. recommended See more info from DANS
TextFabricClick to add or suggest missing format information Tool SupportTool-related formats required for specific functionality of the tool or reliable reuse of resources (e.g. tagsets, annotation schemes, vocabularies, language models, parameter files, and other specifications or settings) recommended See more info from DANS
TIFF Image Source Language DataDigitized images of analogue sources of written language data for research purposes (e.g. facsimiles, scans of handwriting, photos of inscriptions). recommended See more info from DANS
WAVE Audiovisual Source Language DataAudio or video recordings providing spoken/multimodal or signed language data for research purposes. acceptable See more info from DANS
WMFClick to add or suggest missing format information Image Source Language DataDigitized images of analogue sources of written language data for research purposes (e.g. facsimiles, scans of handwriting, photos of inscriptions). acceptable See more info from DANS
Worldfile.jpgw GeodataInformation on geographic locations. acceptable See more info from DANS
Worldfile.tifw GeodataInformation on geographic locations. acceptable See more info from DANS
XLSClick to add or suggest missing format information OtherAny other function that cannot be included in an existing domain. The content of this domain will be periodically examined for potential patterns that may give rise to new domains. acceptable See more info from DANS
XLSX OtherAny other function that cannot be included in an existing domain. The content of this domain will be periodically examined for potential patterns that may give rise to new domains. acceptable See more info from DANS
XML Audiovisual AnnotationAnnotations of audiovisual sources, usually including a basic rendering of the spoken content (transcription) and sometimes further annotation. recommended See more info from DANS
XML Catalogue MetadataBasic structured information for discoverability and general description, to be openly provided for harvesting. recommended See more info from DANS
XML Contextual InformationStructured information on the communicative event or text and its creators (i.e. participants or authors) relevant for analysis. recommended See more info from DANS
XML DocumentationUnstructured documentation of the resource and its parts such as corpus or annotation guidelines. recommended See more info from DANS
XML GeodataInformation on geographic locations. recommended See more info from DANS
XML Image AnnotationAnnotations of image sources. recommended See more info from DANS
XML Language DescriptionStructured or unstructured descriptions of linguistic varieties or phenomena, typological databases etc. recommended See more info from DANS
XML Lexical ResourceStructured (item-based) resources for lexical and/or conceptual information on units of language (e.g. wordlists, lexicons, WordNets etc.) recommended See more info from DANS
XML MetadataComprehensive structured information including descriptive, structural and administrative metadata. See the for further hints. recommended See more info from DANS
XML Text AnnotationAnnotations of textual sources/written text, with the original text included or as stand-off. recommended See more info from DANS
XML Textual Source Language DataWritten unstructured/plain text or originally structured text (e.g. HTML) without linguistic or other mark-up added for research purposes. recommended See more info from DANS
XML Tool SupportTool-related formats required for specific functionality of the tool or reliable reuse of resources (e.g. tagsets, annotation schemes, vocabularies, language models, parameter files, and other specifications or settings) recommended See more info from DANS
XSLTClick to add or suggest missing format information Tool SupportTool-related formats required for specific functionality of the tool or reliable reuse of resources (e.g. tagsets, annotation schemes, vocabularies, language models, parameter files, and other specifications or settings) recommended See more info from DANS
Last update commit-id: 147cb5ea