Author Archives | stefbauer

Hadoop – some basic setup

For those of you coming along on this journey, I want to take a quick step back.  Rather than assuming what you do and don’t know a little background to how to make this installation go. This post is not really “interesting findings” but rather a how-to for the install process. As I explained in […]

Leave a Comment Continue Reading →

Hadoop/Hive a few lessons learned

It has been a few days since the last set of posts, and quite honestly did not want to leave it hanging even this long, so I wanted to give a brief update to (at least help) cut-off some of the frustrations I have faced for those of you attempting the same path through the […]

Leave a Comment Continue Reading →

Hadoop +1 (add a node that is…)

This one gets a little finicky depending on you configuration, and how much horsepower you have available to you.  If you started of with my first post, and built a VM … ideally … you made a clone of the host once you had Hadoop running, which will make this “easier”.  You are quickly getting […]

2 Comments Continue Reading →

From Hadoop to Hive

This is the next step after you have completed the initial setup to get Hadoop running, this will walk you through the steps to get Hive running, everything in the prior post is a prerequisite to this setup.  Just as my prior post, this is nothing “ground breaking”, but hopefully will provide a consolidated place […]

2 Comments Continue Reading →

Single Cluster Hadoop – from Zero to Hadoop

Before I get into the installation… what is Hadoop anyway?  Normally I don’t like the “cut-past” approach of blogging, but in this case I make an exception from the Apache Docs: “The Hadoop Distributed File System (HDFS) is a distributed file system designed to run on commodity hardware. It has many similarities with existing distributed […]

4 Comments Continue Reading →

Amazon – Give it a try

A little more than a week ago Amazon made some announcements, near and dear to my heart, warehousing.  Only this time, as you would expect from Amazon, it is moving to the cloud… with the announcement of their “Redshift” product.  I have been intrigued by some of the advantages that cloud computing provides, however, from […]

2 Comments Continue Reading →

You need a Zetta what?

Data, data, everywhere!  I have been doing some reading on data scale, and quite honestly it reminds me a lot of the kinds of feelings I had while taking an astronomy class in college… the scale is simply too large to get your head around. Before we get too far into this, try to imagine […]

1 Comment Continue Reading →

Change is in the air

Fall has come (and almost gone) … turkeys have been eaten, and we are heading head-long into the holiday season, so part of this post is a early “resolution” of sorts.  For a variety of reasons (that I am going to skip the exact specifics on for now but will become clear over the coming […]

Leave a Comment Continue Reading →

Denali Dependency Services

Introduction I have been on the “What’s New In CTP1 for Denali” speaking “tour” (if you count 2 as a tour) for the past week. What I can say from my interactions, and the feedback from the presentation, there is some real excitement for the new release (although dauntingly far away for some people). In […]

2 Comments Continue Reading →

PASS – Part 2

Introduction  I have to say that this has not turned into the “series” that I projected so optimistically as I departed for the PASS summit. It may still be a series, who knows, but for now, as I sit in the airport waiting for my flight back home I figure I would get a few thoughts […]

1 Comment Continue Reading →