You are viewing an old version of this page. View the current version.

Compare with Current View Page History

« Previous Version 569 Next »

TableOfContents(4)

This effort is still a "work in progress". Please feel free to add comments. BRBut please make the content less visible by using smaller fonts. – Edward J. Yoon


Overview

This is intended to explain and illustrate the concept of Hama. There are two main parts:

  • How to store the matrices?
  • How to perform matrix operations using MapReduce?

Building Block

[http://wiki.apache.org/hama-data/attachments/Architecture/attachments/block.png]

Store Dense/Sparse Matrices

To store the matrices, Hama use a [http://hadoop.apache.org/hbase/ Hbase] – Matrices are basically tables. They are ways of storing numbers and other things. Typical matrix has rows and columns. Actually called a 2-way matrix because it has two dimensions. For example, you might have respondents-by-attitudes. Of course, you might collect the same data on the same people at 5 points in time. In that case, you either have 5 different 2-way matrices, or you could think of it as a 3-way matrix, that is respondent-by-attitude-by-time.

Just a thought, considering the depleted activity in HBase should we not explore ways to avoid HBase ? --Prasen

  • No labels