The Apache Mahout™ project's goal is to build an environment for quickly creating scalable performant machine learning applications.
For additional information about Mahout, visit the Mahout Home Page
####Setting up your Environment
Whether you are using Mahout's Shell, running command line jobs or using it as a library to build your own apps you'll need to setup several environment variables. Edit your environment in ~/.bash_profile
for Mac or ~/.bashrc
for many linux distributions. Add the following
export MAHOUT_HOME=/path/to/mahout
export MAHOUT_LOCAL=true # for running standalone on your dev machine,
# unset MAHOUT_LOCAL for running on a cluster
You will need a $JAVA_HOME
, and if you are running on Spark, you will also need $SPARK_HOME
Note when running the spark-shell job it can help to set some JVM options so you don't run out of memory:
$MAHOUT_OPTS="-Xmx6g -XX:MaxPermSize=512m" mahout spark-shell
####Using Mahout as a Library Running any application that uses Mahout will require installing a binary or source version and setting the environment. To compile from source:
mvn -DskipTests clean install
- To run tests do
mvn test
- To set up your IDE, do
mvn eclipse:eclipse
ormvn idea:idea
To use maven, add the appropriate setting to your pom.xml or build.sbt following the template below.
To use the Samsara environment you'll need to include both the engine neutral math-scala dependency:
<dependency>
<groupId>org.apache.mahout</groupId>
<artifactId>mahout-math-scala_2.10</artifactId>
<version>${mahout.version}</version>
</dependency>
and a dependency for back end engine translation, e.g:
<dependency>
<groupId>org.apache.mahout</groupId>
<artifactId>mahout-spark_2.10</artifactId>
<version>${mahout.version}</version>
</dependency>
<dependency>
<groupId>org.apache.mahout</groupId>
<artifactId>mahout-flink_2.10</artifactId>
<version>${mahout.version}</version>
</dependency>
Note that due to an intermittent out-of-memory bug in a Flink test we have disabled it from the binary releases. To use Flink please uncomment the line in the root pom.xml in the <modules>
block so it reads <module>flink</module>
.
####Examples
For examples of how to use Mahout, see the examples directory located in examples/bin
For information on how to contribute, visit the How to Contribute Page
####Legal
Please see the NOTICE.txt
included in this directory for more information.
mahout's People
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.