Data selection, integration, and simulation services

Summary
This deliverable will provide the first versions of (a) automated dataset selection strategies and feature augmentation methods, that will recommend the datasets that are fit-for-purpose for the current analytics task; (b) Analysis aware data integration and quality assurance services, that will support the interactive application of cleaning, interlinking, and enrichment methods to data gathered from various dispersed sources; (c) Data augmentation and simulation techniques, that will allow to detect the types and ranges of data that are missing in the datasets, in order to generate new data entries to balance the datasets with various augmentation techniques, and or deploy simulation models to produce new data. It will include prototype libraries and/or tools and will document their implementation and usage information.