[DEPRECATED] CentOS 6: Install Single-node Hadoop from Cloudera CDH

Overview

Guide for setting up a single-node Hadoop on CentOS using the Cloudera CDH repository.

Versions

  • CentOS 6.4
  • Oracle Java JDK 1.6
  • CDH 4
  • Hadoop 0.2

Prerequisties

Install

1. Download the yum repo file:

2. Install

Configure

1. Format the name node

Output:

2. Start namenode/datanode services

3. Optional: Start services on boot

4. Create directories

5. Create map/reduce directories

6. Start map/reduce services

7. Optional: Start services on boot

8. Optional: Create a home directory on the hdfs for the current user

9. Edit /etc/profile.d/hadoop.sh

10. Load into session

Test

1. Get a directory listing from hadoop hdfs

Output:

Note: results will vary based on user directories created

2. Navigate browser to http://<hostname>:50070
Hadoop NameNodeĀ localhost:8020 - Google Chrome_024

4. Navigate browser to http://<hostname>:50030
localhost Hadoop Map-Reduce Administration - Google Chrome_023

3. Run one of the examples

Output:

Sources

Leave a Reply

Your email address will not be published. Required fields are marked *

You may use these HTML tags and attributes: <a href="" title=""> <abbr title=""> <acronym title=""> <b> <blockquote cite=""> <cite> <code class="" title="" data-url=""> <del datetime=""> <em> <i> <q cite=""> <strike> <strong> <pre class="" title="" data-url=""> <span class="" title="" data-url="">