Comments (7)
from hover.
Work in progress: first implement a vector-based search. Then try to upgrade to structural similarity.
If structural similarity works well, we can do the same for audio in its MFCC format.
from hover.
Update: consider an abstract vector search engine, or an abstract similarity-based search engine.
from hover.
Hi, I'm using hover to annote a dataset of image, and I encounter a small issue regarding the visualization of the image.
If the resolution is higher than the row width or row height define for the table we don't visualize very well.
To fix this you can just change in hover/core/local_config.py Line 60:
template="""<img src=<%= value %>>""",
To:
template="""<img src=<%= value % width="200" height="200">>""",
It will automatically resize the image in a 200x200 and so you can see image with for instance a resolution of 720x720
I can do a PR for that if you want but the modifications is very small
from hover.
Hi, @FlorianBertonBrightClue thank you for using hover and bringing this up!
In some use cases 200x200 can be difficult to see clearly. Actually we can make it configurable on the user side.
The code below should work for the upcoming 0.8.0 version, which is likely within a week.
hover.config["visual"]["table_image_width"] = 200
hover.config["visual"]["table_image_height"] = 200
Does this look good?
from hover.
And so at line 57/58 in local.config.py, you will put this ?
feature_col_kwargs["formatter"] = HTMLTemplateFormatter(
template=f'<img src=<%= value %> width="{hover.config["visual"]["table_image_width"]}" height="{hover.config["visual"]["table_image_height"]} >',
)
If yes, it should work and the user could configure it as ha wants by setting the hover.config.
I also have two questions for you :
-
It seems for now that we can't set the parameters for the DimensionalityReducer, would it be possible later ? Like for instance in umap you can choose the number of neighbor or the minimum distance.
-
Can we change the label for data that are already in train ? In my case I did some prelabelling and sometimes I want to change the label because two clusters are close one to each other and finally and want to merge them together
from hover.
And so at line 57/58 in local.config.py, you will put this ? feature_col_kwargs["formatter"] = HTMLTemplateFormatter( template=f'<img src=<%= value %> width="{hover.config["visual"]["table_image_width"]}" height="{hover.config["visual"]["table_image_height"]} >', )
Basically yes. This line reads the config only once though, so be sure to configure immediately after import hover
.
Customize DimensionalityReducer
- It seems for now that we can't set the parameters for the DimensionalityReducer, would it be possible later ? Like for instance in umap you can choose the number of neighbor or the minimum distance.
Technically you can. With dataset.compute_nd_embedding()
you can pass in keyword arguments that umap accepts. Hover attempts to “translate” crucial kwargs for compatibility (like “dimension” to different equivalents in umap and ivis) but will forward the rest.
This could be much better documented though.
Edit committed labels
- Can we change the label for data that are already in train ? In my case I did some prelabelling and sometimes I want to change the label because two clusters are close one to each other and finally and want to merge them together
You can do this a few ways depending on which one is convenient:
- in the selection table (the one where large images don’t show well right now), make edits in the label column and save the edits.
- access the underlying dataframe with
dataset.dfs["train"]
. - export to file, edit and load back.
You cannot change train labels directly the same way you label raw data in the scatter plot. “Commit” locks in the subset and label unless you take the “backdoors” above. This is to prevent mis-relabeling labeled data that happen to be (often for good reasons) mixed into in a selection.
from hover.
Related Issues (20)
- Add support for higher dimensionality than 2 HOT 4
- Should vectorizer be limited to returning 1-D arrays only? HOT 1
- Rework rule-based labeling function mechanism HOT 1
- Getting GCC compile error for installation on MacOS HOT 10
- Checklist: 0.6.0 release
- Phase out soft label / denoising components HOT 2
- Checklist: 0.7.0 release HOT 1
- Scalable Recipe Tests
- Feedback on Quickstart HOT 3
- QuickStart via Docker Container HOT 19
- bulk labeling selection should automatically update "view selection" HOT 7
- multi labeling HOT 2
- Setting custom colors HOT 5
- hover configuration for constants HOT 7
- Session not accessible on browser refresh HOT 2
- Enable tooltips in selection table HOT 3
- Accepting Huggingface transformers HOT 2
- Add an FAQ page to the docs
- Associating non_feature data with feature_key HOT 3
- Use of semi-supervised fit step for Umap and ivis HOT 2
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from hover.