Hadoop 1.0.3 installed, including hive 0.7 and the latest mahout and hbase
The Stern Center for Research Computing has recently upgraded hadoop and all of it’s related tools to the latest releases.
These include hadoop, hive, pig, hbase, mahout and pegasus.
These tools form the basis for the Stern “big data” infrastructure, which can be used to handle very large or complex data sets that are not amenable to more traditional data storage and analysis techniques. There are several pilot projects already underway and we are looking for more. Soon we expect to have a Sas-Hive link, so that sas users can store their data in hive and use HQL to manipulate it and then bring it into sas for processing.
See a member of the research computing staff for more information.