Achieving Data Excellence in ML Research