Harnessing Machine Learning for Scalable Dataset Expansion