MoDynStruct | The design and evaluation of modern fully dynamic data structures

Summary
Many real-world data sets change continuously, but their enormous size prohibits frequent re-processing of the whole data. Thus, there is an urgent need for efficient, fully dynamic data structures that maintain properties of the data set while supporting fast insertions and deletions. This is especially important for problems in data mining and network analysis, where a data structure often needs to fulfill new additional constraints that are not supported by classic data structures: (1) It should only use sublinear space, even if this leads to some small error in the answers. (2) As data sets frequently contain private information which needs to be protected, it should reveal nothing about individual data points, which is often modeled through differential privacy. Our ambitious goal is to design such groundbreaking new fully dynamic data structures for central problems on graphs and point sets.

Specifically, we will focus on problems with large practical relevance such as subgraph detection, k-core decomposition, and balanced graph partitioning as well as various clustering variants in general metric spaces. For these problems no fully dynamic data structures with small asymptotic running time are known and they have not even been studied in the small-space or differentially-private regime. However, using recent advanced in algorithms research it is now the right time to develop novel techniques to solve these challenging questions.

Thus, the goal of this project is to design algorithms for highly-relevant problems as well as advancing the field of data structures in general by moving it from a narrow focus on asymptotic complexity to a broader set of modern requirements with the goal of bridging the gap that currently exists between theory and practice. As data structures are used by every computer program the impact of this work will be far-reaching. With over 30 years of experience in algorithms research the PI is in the unique position to do so.
Unfold all
/
Fold all
More information & hyperlinks
Web resources: https://cordis.europa.eu/project/id/101019564
Start date: 01-01-2022
End date: 31-12-2026
Total budget - Public funding: 2 477 371,00 Euro - 2 477 371,00 Euro
Cordis data

Original description

Many real-world data sets change continuously, but their enormous size prohibits frequent re-processing of the whole data. Thus, there is an urgent need for efficient, fully dynamic data structures that maintain properties of the data set while supporting fast insertions and deletions. This is especially important for problems in data mining and network analysis, where a data structure often needs to fulfill new additional constraints that are not supported by classic data structures: (1) It should only use sublinear space, even if this leads to some small error in the answers. (2) As data sets frequently contain private information which needs to be protected, it should reveal nothing about individual data points, which is often modeled through differential privacy. Our ambitious goal is to design such groundbreaking new fully dynamic data structures for central problems on graphs and point sets.

Specifically, we will focus on problems with large practical relevance such as subgraph detection, k-core decomposition, and balanced graph partitioning as well as various clustering variants in general metric spaces. For these problems no fully dynamic data structures with small asymptotic running time are known and they have not even been studied in the small-space or differentially-private regime. However, using recent advanced in algorithms research it is now the right time to develop novel techniques to solve these challenging questions.

Thus, the goal of this project is to design algorithms for highly-relevant problems as well as advancing the field of data structures in general by moving it from a narrow focus on asymptotic complexity to a broader set of modern requirements with the goal of bridging the gap that currently exists between theory and practice. As data structures are used by every computer program the impact of this work will be far-reaching. With over 30 years of experience in algorithms research the PI is in the unique position to do so.

Status

SIGNED

Call topic

ERC-2020-ADG

Update Date

27-04-2024
Geographical location(s)
Structured mapping
Unfold all
/
Fold all
EU-Programme-Call
Horizon 2020
H2020-EU.1. EXCELLENT SCIENCE
H2020-EU.1.1. EXCELLENT SCIENCE - European Research Council (ERC)
ERC-2020
ERC-2020-ADG ERC ADVANCED GRANT