Preparation of a tool chain for multimodal data in CLARIN-D (WG 6)

Project content

The working group 6 “Speech and other modalities” pursues the goal of strengthening and establishing aspects of multimodal language resources in CLARIN-D. Following the first curation project of the WG 6, in which the editing and integration of multimodal resources has been expedited, a tool chain for multimodal data is going to be prepared in the second step. A tool chain in this context is a linkage of various tools that are needed for certain working steps in order to build multimodal corpora.

The procedure of linked working steps has already been established for linguistic data and corresponding service architectures like Weblicht are used and appreciated by the respective technical communities. Multimodal data, however, are different from linguistic and textual data in many respects (e.g., data volume, data and representation formats and restrictive covenant due to personal rights of the participants). In order to facilitate the development of corpora, concepts of tool chains for multimodal data are going to be developed and explored in this curation project. In doing so, it needs to be clarified whether concepts of existing service architectures are also applicable to multimodal data. Further steps are the cataloguing of existing tools and data formats for multimodal data, as well as the specification and and the realisation of suitable data formats for interfaces. To cover most of the tools used by the technical community, a close cooperation with the various members and centres of the WG 6 is planed. Also the experience with the three corpora and the used tools from the first curation project have a good effect on the development of this project.

Responsible for the content of this project is the WG 6 “Speech and other modalities” and the realisation is carried out by research assistants at Bielefeld University. The project can build on technical advice and assistance mainly from the CLARIN-D centre Bavarian Archive for Speech Signals, as well as two other CLARIN-D centres, University of Tübingen and the Max-Planck Institute for Psycholinguistics (MPI).


  • 01.11.2013 - 30.04.2014


  • Working group 6 „Speech and other modalities“ represented by Prof. Dr.-Ing. Stefan Kopp, research group „Sociable Agents“, CITEC, Faculty of Technology, Bielefeld University

Responsible Institution

  • Research group „Sociable Agents“, CITEC, Faculty of Technology, Bielefeld University

Executive Staff

  • Farina Freigang (Bielefeld University)
  • Thomas Kronenberg (Bielefeld University)
  • Sören Klett (Bielefeld University)