GithubHelp home page GithubHelp logo

geocene / trainset Goto Github PK

View Code? Open in Web Editor NEW
156.0 156.0 34.0 21.09 MB

A lightweight web application for brushing labels onto time series data; useful for building training sets.

Home Page: https://trainset.geocene.com/

License: MIT License

HTML 0.85% Vue 38.38% JavaScript 60.76%
brushing labeling labeling-tool machine-learning painting time-series-classification training-data

trainset's People

Contributors

ajaypillarisetti avatar broekhof avatar clagms avatar daterdots avatar dependabot[bot] avatar jeremyrcoyle avatar rafnuss avatar rushk014 avatar zekeshearer avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

trainset's Issues

Make it easier to click an individual point

It's pretty hard to click and individual point - sometimes it seems like I am hovering over the right spot, but then I click and it doesn't work. Just because my curser changes from the crosshairs to the pointer, it doesn't mean that when I click I will hit the point under my pointer.

Add shortcut key legend below chart

SELECTION
CLICK: toggle label for a point
CLICK & DRAG: apply TRUE label to selection
+SHIFT: apply FALSE label
+OPTION: centered selection
+CONTROL: full Y selection

NAVIGATON
or : pan
SHIFT + or : fast pan
or : zoom
CLICK bottom context bar: unfocus
CLICK & DRAG bottom context bar: adjust focus region

Does the parser allow more than one filename?

What happens when someone uploads a CSV where there are multiple values in the filename column. Do we allow this? Is there a rational use case for the CSV to be named something other than what's in the filename column? Do we want to support some case where someone uploads a CSV called all_my_files.csv that has like 20 different filenames (I guess with non-overlapping timestamps?) in it?

Danny's option: the CSV's filename has the match the filename column, and all values in the filename column have to be the same.

Title the chart with the filename

I should be able to easily see which file I am labeling while labeling - if I send a screenshot, the person I send it to should be able to know which file this is from.

Enable full Y brushing

Basically, if I hold CONTROL, make it work the way the context bar works. SHIFT+CONTROL should deselect everything in the x region I brushed

Import of partially labeled file fails

I partially labeled a 32k datapoint file, downloaded it, and tried to re-upload to continue labeling. Resulted in "Upload Failed | Make sure data is in the TRAINSET format. See help" error.

Rename brushes

TRAINSET will hopefully be used by a lot of non-cookstove people. Name the brushes:

invert
label true
label false

Change export-to-upload workflow

Right now I press export->upload->home page upload->finder->label, but I shouldn't have to go back to the homepage. When I press this upload button, I should be able to upload the new file:

image

Add cursor location feature

Have a box in the top right area that shows me the timestamp and value of the dot that my cursor is hovering over

Don't wrap banner

Let's remove this wrapping feature, or at least only make it show up when the window gets very small:
image

Add some padding on y axis to make brushing extremes easier

Right now it looks like the y axis goes exactly from data_min to data_max. This can make it really hard to click or brush points near the extremes. Let's draw the y-axis this way: y_max = data_max + (data_max - data_min) * 0.05 and y_min = data_min - (data_max - data_min) * 0.05

image

Set focus width to ~2k data points

On bigger files, like the 32k files, we can set the focus width (the selection in the context) to the entire width. This slows things down. Can we limit the width to a certain number of points?

Where should we host this for easy use by folks?

Options:
something like this: crossfilter.github.io/crossfilter/ (geocene.github.io/trainset?)
or: trainset.availabledomain
or: trainset.geocene.com
or: trainset.sumsarizer.com
or: deploy it yourself, bum! (Danny doesn't like this option)

Upload failed on 3E5 data points

Is one of the format rules that the dataset needs to have fewer rows than a certain threshold, or is this 3E5 dummy_input dataset improperly formatted?

image

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.