Differences between revisions 13 and 14
Revision 13 as of 2014-02-13 01:01:39
Size: 752
Editor: edwardyoon
Comment:
Revision 14 as of 2015-06-14 23:26:11
Size: 773
Editor: edwardyoon
Comment:
Deletions are marked like this. Additions are marked like this.
Line 13: Line 13:
  % bin/hama jar hama-examples-0.x.0.jar gen fastgen 100 10 randomgraph 2   % bin/hama jar hama-examples-0.x.0.jar gen fastgen -v 100 -e 10 -o randomgraph -t 2
Line 19: Line 19:
  % bin/hama jar hama-examples-0.x.0.jar pagerank randomgraph pagerankresult 4   % bin/hama jar hama-examples-0.x.0.jar pagerank -i randomgraph -o pagerankresult -t 4

This document assume that you have already installed Hama cluster and you have tested it using some examples.

PageRank

  • Uses the PageRank algorithm described in the Google Pregel paper

  • Introduces partitioning and collective communication

Run PageRank on Hama Cluster

First of all, generate a symmetric adjacency matrix using the gen command.

  % bin/hama jar hama-examples-0.x.0.jar gen fastgen -v 100 -e 10 -o randomgraph -t 2

This will create a graph with 100 nodes and 1K edges and store 2 partitions on HDFS as the sequence file. You can adjust partition and tasks numbers to fit your cluster. Then, run PageRank using:

  % bin/hama jar hama-examples-0.x.0.jar pagerank -i randomgraph -o pagerankresult -t 4

PageRank (last edited 2015-06-14 23:26:11 by edwardyoon)