Research and Advances
Computing Applications Europe Region special section: Hot topics

A Federated Infrastructure for European Data Spaces

Posted
  1. Introduction
  2. Gaia-X as a Federated Data Infrastructure
  3. Data Infrastructure Patterns
  4. References
  5. Author
  6. Footnotes
GAIA-X logo

The European strategy for data calls for “common data spaces” as a foundation for the data economy in the European Union.1 President Ursula von der Leyen expressed her view that Gaia-X should play a key role in response to these calls by providing a federated data infrastructure in the cloud.a

One data space example is the Mobility Data Spaceb that was launched in 2020 as a multistakeholder project in Germany in close alignment with the German Federal Government. It aims at data-driven services in the mobility sector and data sovereignty of the data holders and trust among all participants. Smart service examples are intermodal end-to-end services for individual travelers, traffic management services for smart cities, and services to increase road safety for individual drivers.


Gaia-X supports the goals of interoperability of services, data portability, data sovereignty as articulated in the European data strategy, and addresses a requirement gap in existing infrastructure models.


Figure 1 shows the basic functionality of the Mobility Data Space following the provisions of the International Data Spaces (IDS) Reference Architecture Model (RAM).4 It does not pool data in a central data store but rather connects data providers and data users by the respective use of the IDS Connector component. A catalog allows data providers to present and describe their data resources, together with conditions under which the data can be used. Data users search the catalog for data they need for their smart service. If data demand and supply match, the exchange of the data itself is carried out just between the participants, with no involvement of the Mobility Data Space operator. Only metadata on data exchange transactions are monitored and logged to ensure conditions for the use of the data are correctly exchanged and followed.

f1.jpg
Figure 1. Mobility data space overview.

The Mobility Data Space balances the interest of the individual data provider regarding data sovereignty and the interest of the community to increase the data availability and its use. Data has a value when used7 and, thus, should be used as widely as possible to seize its innovation opportunities. However, data use must always take into consideration the interests of the data holder. Providing data, particularly high-quality data, comes at a cost.6 In this context, data sovereignty is defined as the capability of a data holder to be self-determined regarding the use of their data.5

Back to Top

Gaia-X as a Federated Data Infrastructure

Data spaces represent a data integration concept that follows Linked Data design principles.2 First, data spaces do not require physical data integration but leave the data at the source and only make it accessible when needed. Second, data is not forced into one common schema but linked, that is, integrated on a semantic level. Third, the distributed architecture of data spaces allows data redundancies, that is, multiple data objects may exist in a data space describing the same real-world object. Finally, data spaces can be overlapping and nested so that data holders and data users, respectively, can be participants in multiple data spaces.

To implement the European Strategy for Data and create common data spaces, interoperability of data and collaboration of participants is required across the boundaries of individual data spaces. Gaia-X aims to achieve this by providing so-called federation services that function within and across different data spaces.

The Gaia-X initiative is organized as a not-for-profit association headquartered in Brussels.c With more than 300 members, the association aims at a federated data architecture that comprises not only data and smart services, but also cloud infrastructure services (see Figure 2). Gaia-X specifies four so-called federation services, namely “identity and trust,” “sovereign data exchange,” “federated catalog,” and “compliance.”3 They form a blueprint for data spaces and allow for “federations of ecosystems,” hence, support interoperability and coordination across data spaces. The latter requires federation of identities of data space participants, of catalog entries, and of data transaction logs.

f2.jpg
Figure 2. Gaia-X ecosystem and federation services.

Back to Top

Data Infrastructure Patterns

Gaia-X supports the goals of interoperability of services, data portability, data sovereignty as articulated in the European data strategy and addresses a requirement gap in existing infrastructure models. The latter were—and still are—dominated by hyper-scaling platform providers on the one hand side and state-controlled infrastructures on the other hand. Both do not meet the European requirements mentioned here. Consequently, Gaia-X represents an alternative design pattern for data infrastructures.

In contrast to alternative patterns, Gaia-X follows a cooperative approach (see approximate juxtaposition in the accompanying table). The federated infrastructure is open to be used, owned by the community itself, and thus avoids concentration of control over data and services.

ut1.jpg
Table. Data infrastructure patterns.

Join the Discussion (0)

Become a Member or Sign In to Post a Comment

The Latest from CACM

Shape the Future of Computing

ACM encourages its members to take a direct hand in shaping the future of the association. There are more ways than ever to get involved.

Get Involved

Communications of the ACM (CACM) is now a fully Open Access publication.

By opening CACM to the world, we hope to increase engagement among the broader computer science community and encourage non-members to discover the rich resources ACM has to offer.

Learn More