PHASE IV AI | Privacy compliant health data as a service for AI development

Summary
Artificial intelligence (AI) enables data-driven innovations in health care. AI systems, which process vast amounts of data quickly and in detail, show promise both as a tool for preventive health care and clinical decision-making. However, the distributed storage and limited access to health data form a barrier to innovation, as developing trustworthy AI systems requires large datasets for training and validation. Furthermore, the availability of anonymous datasets would increase the adoption of AI-powered tools by supporting health technology assessments and education. Secure, privacy compliant data utilization is key for unlocking the full potential of AI and data analytics.

In this proposal, we will advance the current state-of-the-art data synthesis methods towards a more generalized approach of synthetic data generation. We will also develop metrics for testing and validation, as well as protocols that enable synthetic data generation without access to real-world data (through multi-party computation).

We aim to provide: 1) Improved methods and technical pipelines for privacy-preserving data synthesis including different data formats such as EHRs and medical images, 2) Easy to use and configurable data services to enable AI developers’ access to larger pools of decentralized de-identified data through multi-party computing, 3) Provide anonymous data on demand or from a (temporary) repository, 4) Establish a Data Market – facilitating data sharing and monetization incl. incentives-based provision of data to the services, 5) Integrate the data market and the data service ecosystem as a X-European health data hub in the European Health Data Space, and 6) Validate the results with real-world use-cases focusing on high impact diseases, cancer types in particular.
Unfold all
/
Fold all
More information & hyperlinks
Web resources: https://cordis.europa.eu/project/id/101095384
Start date: 01-10-2023
End date: 30-09-2026
Total budget - Public funding: 6 640 205,75 Euro - 6 640 205,00 Euro
Cordis data

Original description

Artificial intelligence (AI) enables data-driven innovations in health care. AI systems, which process vast amounts of data quickly and in detail, show promise both as a tool for preventive health care and clinical decision-making. However, the distributed storage and limited access to health data form a barrier to innovation, as developing trustworthy AI systems requires large datasets for training and validation. Furthermore, the availability of anonymous datasets would increase the adoption of AI-powered tools by supporting health technology assessments and education. Secure, privacy compliant data utilization is key for unlocking the full potential of AI and data analytics.

In this proposal, we will advance the current state-of-the-art data synthesis methods towards a more generalized approach of synthetic data generation. We will also develop metrics for testing and validation, as well as protocols that enable synthetic data generation without access to real-world data (through multi-party computation).

We aim to provide: 1) Improved methods and technical pipelines for privacy-preserving data synthesis including different data formats such as EHRs and medical images, 2) Easy to use and configurable data services to enable AI developers’ access to larger pools of decentralized de-identified data through multi-party computing, 3) Provide anonymous data on demand or from a (temporary) repository, 4) Establish a Data Market – facilitating data sharing and monetization incl. incentives-based provision of data to the services, 5) Integrate the data market and the data service ecosystem as a X-European health data hub in the European Health Data Space, and 6) Validate the results with real-world use-cases focusing on high impact diseases, cancer types in particular.

Status

SIGNED

Call topic

HORIZON-HLTH-2022-IND-13-02

Update Date

12-03-2024
Geographical location(s)
Structured mapping
Unfold all
/
Fold all
EU-Programme-Call
Horizon Europe
HORIZON.2 Global Challenges and European Industrial Competitiveness
HORIZON.2.1 Health
HORIZON.2.1.0 Cross-cutting call topics
HORIZON-HLTH-2022-IND-13
HORIZON-HLTH-2022-IND-13-02 Scaling up multi-party computation, data anonymisation techniques, and synthetic data generation
HORIZON.2.1.5 Tools, Technologies and Digital Solutions for Health and Care, including personalised medicine
HORIZON-HLTH-2022-IND-13
HORIZON-HLTH-2022-IND-13-02 Scaling up multi-party computation, data anonymisation techniques, and synthetic data generation