"Manuscripts for (a) machine learning-based extracted disease phenotypes and sub-phenotypes and new outcomes which have been technically validated; and (b) methodology for ""big data""-driven disease definitions"