[Hadoop] Installing Hadoop for Single Node Cluster

1 분 소요

SSH Setup:

  • Install SSH and rsync
    • (NOTE: Secure Shell (SSH) is a protocol for cryptographic network for operating network services securely over an unsecured network.)
    • (NOTE: rsync (Remote Sync) is a remote and local file synchronization tool. It uses an algorithm that minimizes the amount of data copied by only moving the portions of files that have changed.)
    • (NOTE: A passphrase is a sequence of words or other text used to control access to a computer system, program or data. A passphrase is similar to a password in usage, but in generally required for added security.)

      $ sudo apt-get install ssh
      $ sudo apt-get install rsync

    • SSH without Passphrase setup:

      $ ssh-keygen -t rsa

      Press ‘Enter’ three times without entering anything.

      $ cat ~/.ssh/id_rsa.pub >> ~/.ssh/authorized_keys
      $ ssh localhost`

How to Download Hadoop?

How to install Hadoop?

  • Now open up the terminal (Press ctrl + option + T) and create a directory called ‘hadoop’ at the home directory

    $ mkdir hadoop
    $ sudo edit /etc/profile

  • Edit the /etc/profile and add these lines:

    HADOOP_INSTALL=/home/eliotjang/hadoop
    PATH=$PATH:$HADOOP_INSTALL/bin
    export PATH $ . /etc/profile
    $ reboot

  • Copy the downloaded Hadoop (hadoop-2.4.1.tar.gz) into the hadoop directory, and go to this directory using terminal, then un-tar the downloaded tar.gz file.
  • (NOTE: you should changed name eliotjang to your own name. check $ cd ./home and $ ls. you can know your own folder name.)
  • Follow the lines to go to the hadoop directory from home directory.

    $ tar xzf hadoop $ mv hadoop-2.4.1 hadoop $ rm hadoop-2.4.1.tar.gz

댓글남기기