Hadoop Streaming Alternatives
The following are some alternative libraries and toolkits that one can use to run streaming programs in their favorite language. These may be a layer atop HadoopStreaming or be a completely different approach to running preferred-language programs on Hadoop.
Python
- Dumbo https://github.com/klbostee/dumbo/wiki
- Pydoop http://sourceforge.net/apps/mediawiki/pydoop/index.php?title=Main_Page
- Happy http://code.google.com/p/happy/
- mrjob http://packages.python.org/mrjob/
Ruby
- Wukong http://mrflip.github.com/wukong/
- MRToolKit http://code.google.com/p/mrtoolkit/
Some good information/reading can also be found at: http://johanharjono.com/archives/735