Announcements
-
Hadoop 1.0.3 installed, including hive 0.7 and the latest mahout and hbase
The Stern Center for Research Computing has recently upgraded hadoop and all of it’s related tools to the latest releases. These include hadoop, hive, pig, hbase, mahout and pegasus. These tools form the basis for the Stern “big data” infrastructure, which can be used to handle very large or complex data sets that are not [...]
-
Links for SAS users
UCLA SAS materials https://www.ats.ucla.edu/stat/SAS/sk/default.htm Is a complete intro and overview to SAS. It has examples, videos, data sets etc. A good way to both get started with SAS, as well as find advanced techniques.
-
Running hadoop, hive and mahout at the Stern Center for Research Computing
First, you must have your Stern userid enabled for hadoop, and a hadoop user created for your userid. To do that, please send an email to research@stern.nyu.edu, or call the help desk at 212-998-0180 and create a ticket for research computing. You should be able to run hadoop map-reduce jobs, hive and mahout. To start, [...]
-
Hadoop at Stern
Research Computing is now piloting a small hadoop cluster. If successful, we will try to get funds to enlarge it. Currently there are 6 processing nodes with about 16 cores and about 1TB of disk. In addition to hadoop and map-reduce, hive and mahout (0.5) are also installed. To use hadoop, please look at these [...]