Archive | Hadoop RSS feed for this archive

Hadoop – some basic setup

For those of you coming along on this journey, I want to take a quick step back.  Rather than assuming what you do and don’t know a little background to how to make this installation go. This post is not really “interesting findings” but rather a how-to for the install process. As I explained in […]

Leave a Comment Continue Reading →

Hadoop/Hive a few lessons learned

It has been a few days since the last set of posts, and quite honestly did not want to leave it hanging even this long, so I wanted to give a brief update to (at least help) cut-off some of the frustrations I have faced for those of you attempting the same path through the […]

Leave a Comment Continue Reading →

Hadoop +1 (add a node that is…)

This one gets a little finicky depending on you configuration, and how much horsepower you have available to you.  If you started of with my first post, and built a VM … ideally … you made a clone of the host once you had Hadoop running, which will make this “easier”.  You are quickly getting […]

2 Comments Continue Reading →

Single Cluster Hadoop – from Zero to Hadoop

Before I get into the installation… what is Hadoop anyway?  Normally I don’t like the “cut-past” approach of blogging, but in this case I make an exception from the Apache Docs: “The Hadoop Distributed File System (HDFS) is a distributed file system designed to run on commodity hardware. It has many similarities with existing distributed […]

4 Comments Continue Reading →