A viable data infrastructure is the backbone of Germany as a location of research, the President of KIT, Professor Holger Hanselka, emphasized. "To master the big challenges of energy, mobility, and information, we have to be capable of turning Big Data rapidly into smart data. At KIT, the research university in the Helmholtz Association, we pool the competencies necessary for this purpose."
The Helmholtz Centres are prepared to preserve research data in suitable data infrastructures in the long term and to make them as open as possible for later use by science and the society, Professor Otmar D. Wiestler, President of the Helmholtz Association, said.
Germany's leading data centres join the Helmholtz Data Federation in order to store the flows of research data from various scientific disciplines in an ordered manner, to interconnect them with each other, and to make them available for joint use, Professor Achim Streit of KIT, coordinator of the HDF, pointed out. The HDF might serve as a blueprint for data-intensive research in Germany and Europe, an open harbour for access to and turnover of research data.
The HDF is a central element of the recently adopted position paper of the Helmholtz Association on the handling of research data, which is entitled "Die Ressource Information besser nutzbar machen" - Improving the usability of information resources. Thanks to its secure federation structure and the set-up of multi-thematic data centres, the HDF will enable data-intensive science communities to make their scientific data visible, to share their data while retaining data sovereignty, to use them across disciplines, and to archive these data reliably.
The federation is based on three key elements: Innovative software for research data management, excellent user support, and latest storage and analysis hardware. The partners plan medium-term investments into memory systems of double-digit petabyte capacity and into ten thousands of processor cores for data analysis and management. Until 2021, a total of 49.5 million euro is planned to be financed from the strategic development funds of the Helmholtz Association.
The HDF partners in the first phase are these six centres focusing on five research fields of the Helmholtz Association: Alfred Wegener Institute Helmholtz Centre for Polar and Marine Research - Earth and Environment, Deutsches Elektronen-Synchrotron DESY and GSI Helmholtz Centre of Heavy Ion Research - both research field Matter, German Cancer Research Centre - Health, Forschungszentrum Jülich, and Karlsruhe Institute of Technology - both Energy, Key Technologies, Matter, Earth and Environment. The HDF represents the nucleus of a national research data infrastructure across science organisations, which is open to users in the whole German science community. International connections will make it compatible with the future European Science Cloud (EOSC).
KIT already operates several infrastructures for Big Data. The Smart Data Innovation Lab (SDIL) provides a Germany-wide research platform with latest analysis functions for companies. The Smart Data Solution Center Baden-Württemberg (SDSC) supports small and medium-sized enterprises of the region in accessing smart data technologies. The GridKa data center is part of the worldwide distributed network for the European particle accelerator center CERN. With the Large-Scale Data Facility - LSDF for science in Baden-Württemberg and the Large-Scale Data Management and Analysis Initiative - LSDMA of the Helmholtz Association, KIT has already established the basis for coordinating the HDF. In addition, KITs informatics institutes study analysis methods, evaluation algorithms and data security.