GithubHelp home page GithubHelp logo

phurwicz / hover Goto Github PK

View Code? Open in Web Editor NEW
316.0 6.0 18.0 196.23 MB

:speedboat: Label data at scale. Fun and precision included.

Home Page: https://phurwicz.github.io/hover

License: MIT License

Python 85.44% Jupyter Notebook 14.56%
visualization machine-learning bokeh data-labeling supervised-learning bulk-labeling text-classification image-classification audio-classification annotation-tool

hover's Introduction

Hover

Explore and label on a map of your data.

Get enough to feed your model in no time.

en zh

PyPI Version Conda Version Downloads Main Build Status Nightly Build Status Codacy Grade Codacy Coverage

hover is a tool for mass-labeling data points that can be represented by vectors.

  • Labeling is as easy as coloring a scatter plot.
  • Hover your mouse and lasso-select to inspect any cluster.
  • Use a variety of widgets to narrow down further.
  • Enter a suitable label and hit "Apply"!

GIF Demo

๐Ÿš€ Live Demos

  • edit & run code in your browser to get a labeling interface, with guides along the way.
  • go directly to an example labeling interface hosted on Binder.

โœจ Features

It's fast because it labels data in bulk.

๐Ÿ”ญ A semantic scatter plot of your data for labeling, equipped with

Tooltip for each point on mouse hover
Table view for inspecting selected points
Toggle buttons that clearly distinguish data subsets
Search widgets for ad-hoc data highlight

It's accurate because multiple components work together.

๐Ÿ”ฌ Supplementary views to use in conjunction with the annotator, including

`Finder`: filter data by search criteria
`SoftLabel`: active learning by in-the-loop model prediction score
`Snorkel`: custom functions for labeling and filtering

It's flexible (and fun!) because the process never gets old.

๐Ÿงฐ Additional tools and options that allow you to

Go to higher dimensions (3D? 4D?) and choose your xy-axes
Consecutively select across areas, dimensions, and views
Kick outliers and fix mistakes

๐Ÿ“ฆ Install

Python: 3.8+

OS: Linux & Mac & Windows

PyPI: pip install hover

Conda: conda install -c conda-forge hover

๐Ÿ“– Resources

๐ŸŽ Announcements

  • Jan 21, 2023 version 0.8.0 is now available. Check out the changelog for details ๐Ÿฅณ.

๐Ÿ”” Remarks

Shoutouts

  • Thanks to Bokeh because hover would not exist without linked plots and callbacks, or be nearly as good without embeddable server apps.
  • Thanks to Philip Vollet for sharing hover with the community even when it was really green.

Contributing

  • All feedbacks are welcome, especially what you find lacking and want it fixed!
  • ./requirements-dev.txt lists required packages for development.
  • Pull requests are advised to use a superset of the pre-commit hooks listed in .pre-commit-config.yaml.

Citation

If you have found hover useful to your work, please let us know ๐Ÿค—

@misc{hover,
  title={{hover}: label data at scale},
  url={https://github.com/phurwicz/hover},
  note={Open software from https://github.com/phurwicz/hover},
  author={
    Pavel Hurwicz and
    Haochuan Wei},
  year={2021},
}

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.