Show all info regardless research infrastructures. Switch to CLARIN environment and show only relevant info to CLARIN, e.g. format recommendations by CLARIN centres. Switch to Text+ environment and show only relevant info to Text+, e.g. format recommendations by Text+ centres. Switch to DARIAH environment and show only relevant info to DARIAH, e.g. format recommendations by DARIAH centres.
About SIS

This service (more precisely, its "Standards and Specifications" section) was created in 2012 and has been maintained since then at IDS Mannheim as part of the Institute's activity as a CLARIN B-Centre. Around 2020, the system was refreshed and extended with, among others, the section "Format Recommendations". Much of its history can be seen in the versioning system hosted at GitHub.

The SIS is listed as a knowledge base at Fairsharing: https://fairsharing.org/4705

Frequently Asked Questions

  1. The data about my centre do not seem entirely correct. Where did you take these recommendations from?

    We bootstrapped the system with data coming from those centres that made that information available on their homepages, as early as 2020 (and thereabouts). Some of these data had been supplied by members of the Standards Committee connected with those centres, and, in selected cases that offered interesting testing grounds (e.g. negative recommendations or inter-format relationships), the data was added by hand, on the basis of information provided publicly by the centre (back then).

    Note that the information had to be interpreted by us, at least with respect to the functions of data and often with respect to the level of recommendation (recommended vs. acceptable vs. discouraged). That initial state of information is only meant as 'seed' for further work by the centre itself. The need for that work had been re-iterated in the committee presentations at the yearly CLARIN conferences, at NCF (National Coordinators' Forum) meetings, and often directly in conversations with members of centres, to a moderate degree of success, until the Technical Centres committee took the issue up as a cyclic task, in early 2024.

  2. Why put work into the SIS if we can have our own page, structured and styled as we please?

    The "as we please" bit would mean resigning from all the goodies that can be extracted out of data aggregation across CLARIN (and, in the future, across comparable research infrastructures). The idea of the SIS is not only to make it possible to fulfil the CTS requirements in a relatively painless manner, but also to visualize the aggregated data, and also to return the centre's data back to it, thanks to the API point that we make available.

    For example, for the IDS, you would use, e.g., curl 'https://clarin.ids-mannheim.de/standards/rest/views/recommended-formats-with-search.xq?centre=IDS&export=yes' -- have a look at the API documentation to see what parameters are possible, etc.

    You can see an example way of querying the data with jQuery at https://github.com/IDS-Mannheim/IDS-Mannheim.github.io, and the corresponding simple webpage is available for viewing at https://ids-mannheim.github.io/standards/ (many browsers will allow you \ to view the source by doing Ctrl+U). If you would like to contribute a CSS (or XSL) stylesheet to render the info in a nicer way, please feel welcome to contact us and we will set up a directory for such contributions.

  3. Where can one learn more?

    We will gladly deliver a zoom presentation to your centre -- please contact us about that.

    The Centres Committee has an internal checklist document for the purpose of tracking centres' progress -- if you're a centre representative, please do contact your representative on the Centres' committee (or contact us).

    In the meantime, we have tried to make the system as frustration-free when it comes to updating or entering recommendations by the individual centres, as possible. Please have a look at the documentation and contact us (preferably via GitHub issues, so that you can see how we react and so that others can see that the issue is being worked on).

Contact

If you have any question or feedback about the content of this website, please consider using GitHub issues, so that you can see how the matter is processed and what decisions are made (and often why). Link to pre-filled templates of GitHub issues are also placed at various crucial spots in the SIS.

If issues do not suffice for some reason, contact us directly at the addresses below, but please be aware that that often extends the path (because then, most often, we need to re-post your issue ourselves, and that might take a while, and until then, the process is non-transparent), and consequently the time to handle your feedback.

Eliza Margaretha (main developer)

Piotr Banski (conceptual work and contact to the CLARIN Standards and Interoperability Committee)

Institut für Deutsche Sprache
R5, 6-13
68-161 Mannheim
Germany

Further reading

The following publications focus on the Standards Information System at various stages of its development.

The SIS offers some of its documentation through the GitHub wiki.

Technicalities

The SIS is an open-source project hosted at GitHub, built using mainly XQuery (1-3.1) and XML. It is running atop eXist-db 6.2.0.