The Finnish EOSC Node, coordinated by CSC – IT Center for Science, provides national FAIR datasets, services, and RDM competence development resources, integrated into the EOSC Federation.

The Node connects with the LUMI AI Factory to provide large and unique datasets, pilots secure processing environments for sensitive data, and promotes interoperability with the EuroHPC Federation. The Node will serve Finnish researchers via the Haka federation and EOSC AAI, and will provide training, FAIR datasets, and multi-node workflow pilots.


SCIENTIFIC IMPACT

FAIR data

The Finnish EOSC Node provides FAIR data via fairdata.fi (Metax, Etsin), research.fi (CRIS data), national repositories (FinCLARIN, geodata services), and CSC competence centre training resources.

Scientific use cases

Federating CERN’s REANA pipelines

The REANA science case focuses on enabling near-data computation—sending computational workflows to where large scientific datasets are stored, rather than transferring massive volumes of data to the researcher.

The use case demonstrates this concept through particle physics—a field that generates enormous data volumes—but it is applicable to many other domains, including astronomy and life sciences. The project aims to show how researchers can execute their analyses directly at the data source, using REANA—CERN’s Reproducible research data analysis platform—to manage containerized workflows across federated computing resources. 

Imaging data workflows on Galaxy

This science case on imaging data workflows on Galaxy demonstrates how a federated, open-access computational platform can transform the way imaging data are processed, shared, and reused across diverse scientific domains.

The use case leverages the Galaxy platform to integrate data from a wide range of imaging-based research fields—such as life sciences (microscopy), astrophysics (telescope data), climate science (satellite data), and marine science (underwater imagery)—into a unified analysis environment. The goal is to demonstrate that Galaxy can be integrated into the EOSC Federation’s common infrastructure to serve many disciplines simultaneously, allowing researchers to share workflows, reuse methods, and access powerful computational tools without needing specialised technical expertise.

Federated analysis of pathogen genomes

The federated analysis of pathogen genomes science case outlines a federated, cross‑border capability for timely analysis of pathogen genomes that brings computation to the data instead of copying sensitive datasets across institutions.

The objective is to shorten time‑to‑insight for outbreak detection, source attribution, and antimicrobial‑resistance (AMR) surveillance while preserving data sovereignty and meeting European legal and ethical requirements. Experience from COVID‑19 showed that sequencing at scale can transform public‑health decision‑making. Operationally, the effort starts with two neighbouring nodes of the EOSC Federation—the Slovakian national node providing workflows, datasets, computational infrastructure and domain expertise, and the Polish national node (via Poland’s National Science Centre (NCN) and a scientific repository service) supplying key technical support and their own datasets. Their geographical proximity make the two EOSC Nodes an ideal pair for a cross-border pilot. The approach demonstrates how to establish a federation of trusted sites, run harmonized workflows locally, and share only the minimum results needed for action. 

Other use cases

  1. Dataset-as-a-Service via LUMI AI Factory (large, restricted datasets).
  2. Facilitating collaboration across institutional and national boundaries by federating sync-and-shares.
  3. Onboarding Finnish resources (FAIR data, services, training) into EOSC Federation.
  4. Integration with EU Node (AAI and federated catalogues).
  5. Sustaining RDM Competence Centre.
STATUS & TIMELINE
CAPABILITIES & ACCESS
COORDINATION