Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

Wiki Markup\[http://incubator.apache.org/hama Hama \] (means a hippopotamus in Korean) is a distributed scientific package on Hadoop for massive matrix and graph data. It is currently in incubation with Apache. The main goal of Hama is to provide computational tools for data-intensive scientific and industrial areas. It consists of two packages, which are the matrix package and the graph package.

The matrix package (means a hippopotamus in Korean) is a library of matrix operations on a Map/Reduce framework for a large-scale numerical analysis and data mining, that need the intensive computation power of matrix inversion, e.g., linear regression, PCA, SVM and etc. It will be useful for many scientific applications, e.g., physics computations, linear algebra, computational fluid dynamics, statistics, graphic rendering and many more.

Wiki Markup
\[http://incubator.apache.org/GraphPackage The graph package\], called \[http://incubator.apache.org/GraphPackage Angrapa\], is an large-scale graph data management framework for analytical processing. It is still an ongoing project. It will employ massive parallelism on Hadoop. It aims to achieve the scalability for tera bytes or peta bytes graph data. Angrapa will be used in a variety of scientific and industrial areas, such as data mining, machine learning, information retrieval, bioinformatics, and social networks, required to process large-scale graph data.

  • Scientific simulation and modeling
    • Wiki Markup
      Matrix-vector/\[:MatrixMultiply:matrix-matrix multiply\]
    • Soving linear systems
    • Wiki Markup
      \[:GraphAndMatrices: Scientific graphs\]
    • Scientific Business Intelligence
  • Information retrieval
    • Sorting
    • Wiki Markup
      Finding \[:EigenValuesAndEigenVectors: eigenvalues and eigenvectors\]
  • Computer graphics and computational geometry
    • Matrix multiply
    • Computing matrix determinate

General Information

  • Wiki Markup
    \[http://incubator.apache.org/hama Hama Homepage\]
  • Wiki Markup
    Hama \[:Architecture: Architecture\] and 0.1 Plans -- Work in progress
  • Wiki Markup
    \[:DSL: Hama DSL\] (Domain Specific Language) in Groovy -- Work in progress
  • Wiki Markup
    \[:Shell: Hama Shell\] -- Work in progress
  • Wiki Markup
    \[:Presentations: Presentations\] and \[:Articles: Articles\] about Hama
  • Wiki Markup
    \[:GettingStarted: Getting Started with Hama\]
  • Wiki Markup
    \[:MailingLists: Hama Mailing Lists\]
  • Wiki Markup
    \[:IRC: Hama IRC Channel\]
  • Wiki Markup
    Hama \[:PerformanceEvaluation: Performance Evaluation\]
  • Wiki Markup
    \[:PoweredBy: PoweredBy\], a list of sites and applications powered by Hama

User Documentation

  • Examples
    • Wiki Markup
      \[:MatMult: Matrix Multiplication\]
    • Graph representation using Hama
      • Wiki Markup
        \[:BFS: Breadth-First Search\] using Hama

Developer Documentation

distributed computing framework based on BSP (Bulk Synchronous Parallel) computing techniques for massive scientific computations (e.g., matrix, graph, network, ..., etc).

General Information

User Documentation

Developer Documentation

Machine Learning

New Sub-project Proposals

  • Guidelines for Sub-project Proposal
  • Wiki Markup
    \[:HowToContribute: How to contribute\]
  • Wiki Markup
    \[:Jira: Jira\] usage guidelines
  • Wiki Markup
    \[:BecomingDeveloper: Becoming a developer\]
  • Wiki Markup\[:HowToCommit: How to commit\]