This article discusses the importance of high-quality datasets in machine learning-assisted high-throughput screening for MOFs with biogas upgrading properties. The authors use a curated dataset to develop a highly accurate ML model and gain insight into the features of high-performing MOFs.