Beyond Accuracy: Grounding Evaluation Metrics for Human-Machine Learning Systems

NeurIPS 2020