Tag Archives: HDFS

Hadoop – some basic setup

For those of you coming along on this journey, I want to take a quick step back.  Rather than assuming what you do and don’t know a little background to how to make this installation go. This post is not really “interesting findings” but rather a how-to for the install process. As I explained in […]

Leave a Comment Continue Reading →

Hadoop +1 (add a node that is…)

This one gets a little finicky depending on you configuration, and how much horsepower you have available to you.  If you started of with my first post, and built a VM … ideally … you made a clone of the host once you had Hadoop running, which will make this “easier”.  You are quickly getting […]

1 Comment Continue Reading →

Single Cluster Hadoop – from Zero to Hadoop

Before I get into the installation… what is Hadoop anyway?  Normally I don’t like the “cut-past” approach of blogging, but in this case I make an exception from the Apache Docs: “The Hadoop Distributed File System (HDFS) is a distributed file system designed to run on commodity hardware. It has many similarities with existing distributed […]

4 Comments Continue Reading →