apache mahout. prerequisites for building mahout java jdk 1.6 maven 3.0 or higher ( ). subversion...

13
Apache Mahout

Upload: brenda-kelley

Post on 13-Dec-2015

229 views

Category:

Documents


1 download

TRANSCRIPT

Page 1: Apache Mahout. Prerequisites for Building MAHOUT Java JDK 1.6 Maven 3.0 or higher (  ). Subversion (optional)

Apache Mahout

Page 2: Apache Mahout. Prerequisites for Building MAHOUT Java JDK 1.6 Maven 3.0 or higher (  ). Subversion (optional)

Prerequisites for Building MAHOUT

• Java JDK 1.6• Maven 3.0 or higher (

http://maven.apache.org/ ).• Subversion (optional)• Hadoop (used by Mahout)• Mahout (0.9)

Page 3: Apache Mahout. Prerequisites for Building MAHOUT Java JDK 1.6 Maven 3.0 or higher (  ). Subversion (optional)

Install maven 3

1. Open terminal ( Ctrl + Alt + T ).2. Install maven 3.

sudo apt-get install maven 3.

3. Verify that the installation is successful.mvn -v

Page 4: Apache Mahout. Prerequisites for Building MAHOUT Java JDK 1.6 Maven 3.0 or higher (  ). Subversion (optional)

ssh-key

• Setup ssh server– $sudo apt-get install openssh-server

• Setup Public/private key for accessing serverwithout passwd

– $ssh-keygen –t rsa– $cat ~/.ssh/id_rsa.pub >> ~/.ssh/authorized_keys– $ssh localhost– $exit

Page 5: Apache Mahout. Prerequisites for Building MAHOUT Java JDK 1.6 Maven 3.0 or higher (  ). Subversion (optional)

Hadoop

• To install Hadoop on your local box, go to http://www.apache.org/dyn/closer.cgi/hadoop/common/ and download hadoop

• Uncompress the archive:tar xvfz hadoop-1.1.2-bin.tar.gz

• Edit the file conf/hadoop-env.sh and add the following line:

export JAVA_HOME=<JDK DIRECTORY>

Page 6: Apache Mahout. Prerequisites for Building MAHOUT Java JDK 1.6 Maven 3.0 or higher (  ). Subversion (optional)

Hadoop

To configure HDFS, edit the file conf/core-site.xml and add the following property in configuration:

<configuration> <property> <name>fs.default.name</name> <value>hdfs://localhost:9000</value> </property></configuration>

Page 7: Apache Mahout. Prerequisites for Building MAHOUT Java JDK 1.6 Maven 3.0 or higher (  ). Subversion (optional)

Hadoop

Then format the HDFS filesystem:hadoop namenode -format

Run hadoopstart-all.sh

Page 8: Apache Mahout. Prerequisites for Building MAHOUT Java JDK 1.6 Maven 3.0 or higher (  ). Subversion (optional)

Mahout

• Change the directory where mahout is located cd mahout

• Build mahout mvn install

Page 9: Apache Mahout. Prerequisites for Building MAHOUT Java JDK 1.6 Maven 3.0 or higher (  ). Subversion (optional)

Mahout

Page 10: Apache Mahout. Prerequisites for Building MAHOUT Java JDK 1.6 Maven 3.0 or higher (  ). Subversion (optional)

Test

• copy the file u.data to HDFS:

• To run the mahout recommender, type:

Page 11: Apache Mahout. Prerequisites for Building MAHOUT Java JDK 1.6 Maven 3.0 or higher (  ). Subversion (optional)

Test

Page 12: Apache Mahout. Prerequisites for Building MAHOUT Java JDK 1.6 Maven 3.0 or higher (  ). Subversion (optional)

Output

Page 13: Apache Mahout. Prerequisites for Building MAHOUT Java JDK 1.6 Maven 3.0 or higher (  ). Subversion (optional)

Output