Cloud9

A MapReduce Library for Hadoop

Welcome to the homepage for Cloud9, a MapReduce library for Hadoop designed to serve as both a teaching tool and a repository for code that may be broadly useful for a variety research problems in human language technology (information retrieval, natural language processing, etc.). Development of this code base began in late October 2007, so there isn't much here yet... However, it is available via anonymous Subversion checkout. Like Hadoop itself, Cloud9 is distributed under the Apache License.

The University of Maryland is one of six universities that's part of the IBM/Google cloud computing initiative. Ongoing efforts at Maryland include a cloud computing course in Spring 2008 and application of this technology to various research problems.

Quick Links

Content Pages

Related Projects

  • Cascading: pipe assembly patterns for Hadoop
  • Mahout: machine learning library on Hadoop
  • Pig: data analysis platform on top of Hadoop
  • HBase: an open source implementation of Google's BigTable
  • Hypertable: another open source implementation of Google's BigTable

Subversion Access

Explanation why the library is split across two separate repositories.

This page, first created: 27 Oct 2007; last updated: Creative Commons: Attribution-Noncommercial-Share Alike 3.0 United States Valid XHTML 1.0! Valid CSS!