Collections de corpus oraux numeriques
Abbreviation: COCOON
Research infrastructure:
- CLARIN (C-centre)
Warning: The recommendations have not been curated yet.
Functional domains:
- Audiovisual Annotation
- Audiovisual Source Language Data
- Documentation
- Text Annotation
- Textual Source Language Data
Format recommendations:
Format | Domain | Level | Comments |
---|---|---|---|
CHAT | Audiovisual AnnotationAnnotations of audiovisual sources, usually including a basic rendering of the spoken content (transcription) and sometimes further annotation. | recommended | |
EAF | Textual Source Language DataWritten unstructured/plain text or originally structured text (e.g. HTML) without linguistic or other mark-up added for research purposes. | recommended | |
FLAC | Audiovisual Source Language DataAudio or video recordings providing spoken/multimodal or signed language data for research purposes. | recommended | |
MKVClick to add or suggest missing format information | Audiovisual Source Language DataAudio or video recordings providing spoken/multimodal or signed language data for research purposes. | recommended | |
MP4 | Audiovisual Source Language DataAudio or video recordings providing spoken/multimodal or signed language data for research purposes. | recommended | |
PANGClick to add or suggest missing format information | Audiovisual AnnotationAnnotations of audiovisual sources, usually including a basic rendering of the spoken content (transcription) and sometimes further annotation. | recommended | |
Praat | Textual Source Language DataWritten unstructured/plain text or originally structured text (e.g. HTML) without linguistic or other mark-up added for research purposes. | recommended | |
TEI | DocumentationUnstructured documentation of the resource and its parts such as corpus or annotation guidelines. | recommended | |
TEI | Text AnnotationAnnotations of textual sources/written text, with the original text included or as stand-off. | recommended | |
TRS | Audiovisual AnnotationAnnotations of audiovisual sources, usually including a basic rendering of the spoken content (transcription) and sometimes further annotation. | recommended | |
WAVE | Audiovisual Source Language DataAudio or video recordings providing spoken/multimodal or signed language data for research purposes. | recommended |
Last update commit-id: 147cb5ea