Comparison of datasets in machine learning