Wed May 04 2022

Cohort Augmentation

It is always said that the job of a Data Scientist constitutes 20% modeling and 80% data cleaning. Every dataset that you deal with presents various challenges, the majority of which focus on the question of feasibility of the dataset itself. One of the sub-questions we ask as part of this is: Are there enough useable examples in order for our predictive models to generalize across wider populations?