Robust training of machine learning interatomic potentials with dimensionality reduction and stratified sampling - npj Computational Materials