GithubHelp home page GithubHelp logo

vaibhavpatil123 / evaluate Goto Github PK

View Code? Open in Web Editor NEW

This project forked from huggingface/evaluate

0.0 0.0 0.0 2.17 MB

๐Ÿค— Evaluate: A library for easily evaluating machine learning models and datasets.

Home Page: https://huggingface.co/docs/evaluate

License: Apache License 2.0

Python 99.94% Makefile 0.06%

evaluate's Introduction



Build GitHub Documentation GitHub release Contributor Covenant

๐Ÿค— Evaluate is a library that makes evaluating and comparing models and reporting their performance easier and more standardized.

It currently contains:

  • implementations of dozens of popular metrics: the existing metrics cover a variety of tasks spanning from NLP to Computer Vision, and include dataset-specific metrics for datasets. With a simple command like accuracy = load("accuracy"), get any of these metrics ready to use for evaluating a ML model in any framework (Numpy/Pandas/PyTorch/TensorFlow/JAX).
  • comparisons and measurements: comparisons are used to measure the difference between models and measurements are tools to evaluate datasets.
  • an easy way of adding new evaluation modules to the ๐Ÿค— Hub: you can create new evaluation modules and push them to a dedicated Space in the ๐Ÿค— Hub with evaluate-cli create [metric name], which allows you to see easily compare different metrics and their outputs for the same sets of references and predictions.

๐ŸŽ“ Documentation

๐Ÿ”Ž Find a metric, comparison, measurement on the Hub

๐ŸŒŸ Add a new evaluation module

๐Ÿค— Evaluate also has lots of useful features like:

  • Type checking: the input types are checked to make sure that you are using the right input formats for each metric
  • Metric cards: each metrics comes with a card that describes the values, limitations and their ranges, as well as providing examples of their usage and usefulness.
  • Community metrics: Metrics live on the Hugging Face Hub and you can easily add your own metrics for your project or to collaborate with others.

Installation

With pip

๐Ÿค— Evaluate can be installed from PyPi and has to be installed in a virtual environment (venv or conda for instance)

pip install evaluate

Usage

๐Ÿค— Evaluate's main methods are:

  • evaluate.list_evaluation_modules() to list the available metrics, comparisons and measurements
  • evaluate.load(module_name, **kwargs) to instantiate an evaluation module
  • results = module.compute(*kwargs) to compute the result of an evaluation module

Adding a new evaluation module

First install the necessary dependencies to create a new metric with the following command:

pip install evaluate[template]

Then you can get started with the following command which will create a new folder for your metric and display the necessary steps:

evaluate-cli create "Awesome Metric"

See this step-by-step guide in the documentation for detailed instructions.

Credits

Thanks to @marella for letting us use the evaluate namespace on PyPi previously used by his library.

evaluate's People

Contributors

lhoestq avatar lvwerra avatar albertvillanova avatar mariosasko avatar thomwolf avatar mathemakitten avatar lewtun avatar patrickvonplaten avatar fxmarty avatar mishig25 avatar stevhliu avatar sbrandeis avatar emibaylor avatar meg-huggingface avatar severo avatar philschmid avatar douwekiela avatar yjernite avatar sgugger avatar jplu avatar nimaboscarino avatar ricardorei avatar bryant1410 avatar lysandrejik avatar idoh avatar sanderland avatar richarddwang avatar osanseviero avatar ola13 avatar manueldeprada avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.