HuggingFace community-driven open-source library of evaluation
Evaluate is a library that makes evaluating and comparing models and reporting their performance easier and more standardized.
It currently contains:
accuracy = load("accuracy")
, get any of these metrics ready to use for evaluating a ML model in any framework (Numpy/Pandas/PyTorch/TensorFlow/JAX).