Data Category Registry

Abbreviation: DCR

Scope: Data Categories for Language Resources

Standard body: ISO

Keywords: ISOcat, linguistic categories, tag sets, linguistic annotation, linguistic terminology

Description:

Data Category Registry (DCR) was developed within ISO TC 37. The motivation of developing DCR was to provide a list of linguistic concepts and data categories covering a wide range of linguistic domains, such that it can be used for various applications, for instance linguistic annotation, design of dictionaries, meta-data description, text markup etc. By means of the data categories, DCR allows interoperability between DCR-based tools and resources.

DCR provides the possibility to use existing data categories. It specifies principles to extend the existing data categories, or to create new data categories. The specification is composed of three main parts: administrative, descriptive and linguistic part. These parts should guarantee that the data categories are interoperable and proper for developing new applications or improving existing applications.

Since 2009 the Max Planck Institute for Psycholinguistics in Nijmegen has been developing a web-based open source reference of the ISO 12620 standard, which is called ISOcat (“Data Category Registry for ISO TC 37”). The ISOcat describes the data model and procedures for DCR. It is mainly beneficial for creating specifications of DCR data category and management.

Related Standard(s):

SpecCQLF-1-2011CQLF-2011
SpecCmdiCMDI
SpecDict-ISDictionaryEntry-RePresentation-2010
SpecLMF-ISLMF-2012
SpecLafLAF
The LAF architecture includes the data elements that are defined in ISO 12620 (Data Category Registry)
SpecMlif-ISMLIF-2012
SpecOLiAOLiA
SpecSemRoleML-ISSemRoleML-2013
SpecSynAFSynAF

Other standards in the same topic(s):

Version Title: Terminology and other language and content resources — Specification of data categories and management of a Data Category Registry for language resources

Abbreviation: DCR-2009 [not official, only for reference in this website]

Version Number: ISO 12620:2009

Status: final

Release Date: 2009-12-10

Publisher:

ISO/TC 37/SC 3/ WG1

URL(s): http://www.iso.org

Related Standard(s):

SpecMaf-ISMAF-2012
The morpho-syntactic tag sets of Morpho-syntaktic Framework (ISO/DIS 24611) are based on data categories from DCR.
SpecTBX-ISTBX-2012
SpecTMF-ISTMF-2009
Some of the data categories from ISO 16642:2003 are used in this standard version.
SpecWordSegBasicWordSeg-1-2010

Used in CLARIN centre(s):

Relations

Legend:
	isUsedBy
	isAdaptedFrom
	isVersionOf

Home
Centres
Format Recommendations
	Data Deposition Formats
	Functional Domains
	File Extensions
	Media Types
	Statistics
		Popular Formats
		Relevant KPIs
	Sanity Check
		Keywords
Standards and Specifications
	Standard Bodies
	Topics
	Search
API
About / F.A.Q.