lvdmaaten / lvdmaaten.github.io Goto Github PK
View Code? Open in Web Editor NEWWebsite of Laurens van der Maaten
License: MIT License
Website of Laurens van der Maaten
License: MIT License
Dear Dr. van der Maaten:
Could you help me enhance my understanding of how the perplexity parameter works. There are two questions.
Looking at the implementation, do I get it right that a reasonable upper bound on perplexity is equal to 1/3 of the minimal expected cluster size (for simplicity, assume we know what cluster sizes to expect).
On your home page, there is a question (“I get a strange ‘ball’ with uniformly distributed points”) and your suggestion is to reduce perplexity. Do you think the same “ball” effect can be see when perplexity is too low? If yes, how do you suggest we define a lower bound for perplexity?
Regarding 2), I have this digit images data set with 40,000 points that is supposed to contain 10 clusters of about the same size. When I subsample 2000 points and run default Rtsne (its implementation is very similar to yours) the embedding looks nice. However, it is far worse on the full data set. I figured it was because the default perplexity of 30 was too low compared to the typical cluster size, 4000, so I reset it to 30*20 = 600 and obtained a very nice embedding.
When the expected result is unknown, I guess one could try to use a similar subsampling approach to figure out how to increase perplexity. I was wondering if you know of a more analytical method or a rule of thumb.
Regards,
Nik Tuzov, PhD
Hi,
i have discovered an issue in your website https://lvdmaaten.github.io .
When used HTTPS than the CSS files won´t get loaded because they are included via HTTP. I think this could be fixed when using https instead of http in your url-attribute in _config.yml.
regards,
David
EDIT: I'm sorry, I misunderstood the line, your code is correct. You can ignore/adapt this issue as you like.
Hello,
In the python-version of the t-SNE implementation (lvdmaaten.github.io/tsne/code/tsne_python.zip), there is a small bug.
On line 148 it says (gains + 0.2)
instead of (gains * 0.2)
.
Sincerely,
Simon
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.