Topix is a simple website for doing even simpler Bayesian style text analysis on topics.
To start Topix just use Leiningen.
lein run
Then navigate to http://localhost:8001
All settings are made through environment variables, you may need to restart Topix for some of these to take effect. Here are the settings with their defaults.
TOPIX_PORT=8001
TOPIX_MONGO_DB=topix
To load Topix with training data use the topix-trainer project.
https://github.com/rodnaph/topix-trainer
Topix is very early in development, and I'm just starting to play with tweaking the matching, but early results still show favour over the correct categories.
Hopefully with more tweaking like multi-word matching, root stemming, and more training results will improve further.
Distributed under the Eclipse Public License, the same as Clojure.