geocene / trainset Goto Github PK
View Code? Open in Web Editor NEWA lightweight web application for brushing labels onto time series data; useful for building training sets.
Home Page: https://trainset.geocene.com/
License: MIT License
A lightweight web application for brushing labels onto time series data; useful for building training sets.
Home Page: https://trainset.geocene.com/
License: MIT License
It's pretty hard to click and individual point - sometimes it seems like I am hovering over the right spot, but then I click and it doesn't work. Just because my curser changes from the crosshairs to the pointer, it doesn't mean that when I click I will hit the point under my pointer.
SELECTION
CLICK
: toggle label for a point
CLICK & DRAG
: apply TRUE label to selection
+SHIFT
: apply FALSE label
+OPTION
: centered selection
+CONTROL
: full Y selection
NAVIGATON
→
or ←
: pan
SHIFT
+ →
or ←
: fast pan
↑
or ↓
: zoom
CLICK
bottom context bar: unfocus
CLICK & DRAG
bottom context bar: adjust focus region
What happens when someone uploads a CSV where there are multiple values in the filename
column. Do we allow this? Is there a rational use case for the CSV to be named something other than what's in the filename
column? Do we want to support some case where someone uploads a CSV called all_my_files.csv
that has like 20 different filenames (I guess with non-overlapping timestamps?) in it?
Danny's option: the CSV's filename has the match the filename column, and all values in the filename column have to be the same.
Likely because line 84 in index.vue
doesn't take into account renaming of files after they have been labeled. Commenting out this line as a temporary fix works.
space
moves focus window to the right 90% of the window width
shift
+space
moves the focus window to the left 90% of the window width
Right now the chart resizing seems like more of a liability than a feature. If it takes less than 1 hour, let's fix it. If it take more than 1 hour, let's scrap it.
Make this available for download immediately below the "Upload" button and linked in the help docs.
sample_trainset.csv.zip
I should be able to easily see which file I am labeling while labeling - if I send a screenshot, the person I send it to should be able to know which file this is from.
Basically, if I hold CONTROL
, make it work the way the context bar works. SHIFT
+CONTROL
should deselect everything in the x region I brushed
The style is a little wonky, could use some better branding, spinners need help, and generally it just needs a little TLC. Geocene's style guide is here.
I partially labeled a 32k datapoint file, downloaded it, and tried to re-upload to continue labeling. Resulted in "Upload Failed | Make sure data is in the TRAINSET format. See help" error.
Default to TRUE brush, shift key held makes brush FALSE, get rid of INVERT brush
TRAINSET will hopefully be used by a lot of non-cookstove people. Name the brushes:
invert
label true
label false
if length(data) < 1000, then window size is min(length(data), 100)
if 1000<=length(data)<=10000, then window is 10% of the data points
if length(data)>10000, window size is 1000 data points
Make the color that purple instead of the red, and give it the same opacity as the black points
Have a box in the top right area that shows me the timestamp
and value
of the dot that my cursor is hovering over
Right move focus box right by 20%
Left move focus box left by 20%
Up zoom in 20%
Down zoom out 20%
Figure out a way to make it easy to make updates after it's deployed
If values in exported files are Null, import fails. Here's a sample file. How do we want to deal with this?
I cannot get any files to upload on fa3d6d0
All date strings are exported as UTC standard
Suffix is currently -lablr
. Change it to -labeled
.
On bigger files, like the 32k files, we can set the focus width (the selection in the context) to the entire width. This slows things down. Can we limit the width to a certain number of points?
Options:
something like this: crossfilter.github.io/crossfilter/ (geocene.github.io/trainset?)
or: trainset.availabledomain
or: trainset.geocene.com
or: trainset.sumsarizer.com
or: deploy it yourself, bum! (Danny doesn't like this option)
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.