You are viewing an old version of this page. View the current version.

Compare with Current View Page History

« Previous Version 31 Next »

Getting Started with Hama on Mesos

Requirements

In order to run Hama on Mesos it is required that Mesos already be installed on the cluster. Instructions to set up Mesos may be found at the project website: http://mesos.apache.org/.

Building

Be sure to build Hama with your version of Mesos specified:

mvn clean install -Phadoop1 -Dhadoop.version=1.x.x -Dmesos.version=0.18.2

The groom servers will be set up by Mesos during execution of a job. In order for Mesos to do this, you will need to upload a built version of Hama to a place where the Mesos slaves can find it, such as the HDFS:

hadoop fs -put hama-7.0.0.tar.gz /hama-7.0.0.tar.gz

Configuration

There are several properties that must be set in hama-site.xml:

<property>
  <name>bsp.master.TaskWorkerManager.class</name>
  <value>org.apache.hama.bsp.MesosScheduler</value>
  <description>
    This tells the scheduler to use Mesos 
    in order to execute tasks of each job
  </description>
</property>
<property>
  <name>hama.mesos.executor.uri</name>
  <value>hdfs://hdfs.name.node:port/hama.tar.gz</value>
  <description> 
     This is the URI of the Hama distribution. Upload this yourself. 
  </description>
</property>
<property>
  <name>bsp.tasks.maximum.total</name>
  <value>10</value>
  <description>
    This is an override for the total maximum tasks that may be run.
    The default behavior is to determine a value based on the available groom
    servers. However, if using Mesos, the groom servers are not yet allocated. 
    So, a value indicating the maximum number of slots available 
    in the cluster is needed. 
  </description>
</property>
<property>
  <name>hama.mesos.master</name>
  <value>local</value>
  <description>
    This is the address of the Mesos master instance. If you're using
    Zookeeper for master election, use the Zookeeper address here (i.e.,
    zk://zk.apache.org:2181/hadoop/mesos).
  </description>
</property>

Hama requires one cpu and memory defined by bsp.child.java.opts for each slot. This means that a cluster with bsp.tasks.maximum.total set to 2 and bsp.child.jova.opts set to -Xmx1024m will need at least 2 cpus and and 2048m of memory.

Manually distributing the configuration is not necessary. Hama and Mesos will distribute the configuration and provide overrides where necessary.

Starting The BSPMaster

With Hama on Mesos you only need to set up and start the BSPMaster. After setting the configuration the bsp master may be started:

% $HAMA_HOME/bin/hama-daemon.sh start bspmaster

Testing

In order to test the setup, run an example from Hama:

$HAMA_HOME/bin/hama jar hama-examples-x.x.x.jar gen fastgen 100 10 randomgraph 2

Then verify that Mesos tasks are being created run. Do this by navigating to the Mesos status page and clicking on the id of the Hama framework. There should be tasks listed in either the active or completed tasks section.

Please email any questions to user@hama.apache.org

  • No labels