Distributed Computing on GENI: Hadoop in a Slice

Description

GENI is an excellent tool for experimenting with distributed computing. Hadoop is a popular framework for storing and processing large distributed datasets. In this video Hadoop is used to demonstrate how to deploy scalable distributed applications across the GENI infrastructure.

The video walks participants through creating a Hadoop slice composed of three virtual machines that are a Hadoop cluster. The tutorial will lead you through creating the slice, observing the properties of the slice, and running a Hadoop example that sorts a large dataset. Upon completion of the exercise, participants will be able to experiment with scaling the Hadoop sorting application and should be able to apply their new skill to deploy other distributed applications.

Recorded August 26, 2016