This document assume that you have already installed Hama cluster and you have tested it using some examples.

PageRank

  • Uses the PageRank algorithm described in the Google Pregel paper
  • Introduces partitioning and collective communication

Run PageRank on Hama Cluster

First of all, generate a symmetric adjacency matrix using the gen command.

  % bin/hama jar hama-examples-0.x.0.jar gen fastgen -v 100 -e 10 -o randomgraph -t 2

This will create a graph with 100 nodes and 1K edges and store 2 partitions on HDFS as the sequence file. You can adjust partition and tasks numbers to fit your cluster. Then, run PageRank using:

  % bin/hama jar hama-examples-0.x.0.jar pagerank -i randomgraph -o pagerankresult -t 4
  • No labels