HBASE sample

HDFS and HBASE setup instruction

This setup instruction is for Ubuntu Linux

Reqired software

Java 1.7 must be installed
SSH must be installed and SSHD must be running to use the Hadoop scripts that manages remote Hadoop demons


	# To install ssh and rsync
	$ sudo apt-get install ssh
	$ sudo apt-get install rsync

Download

To get a Hadoop distribution, download a recent stable release from one of the Apache Hadoop Mirrors This example uses Hadoop-2.7.1 and Hbase 1.1.1 To get the Hbase distribution, download form Apache Hbase Mirror Extract the .tar.gz file


	$ tar -xzf hadoop-2.7.1.tar.gz
	$ sudo mv hadoop-2.7.1 /usr/local/hadoop 
	$ tar -xzf hbase-1.1.1-bin.tar.gz
	$ sudo mv hbase-1.1.1 /usr/local/hbase

Setup hadoop and hbase

Create user group and user
Setup hadoop and hbase home


	# create user group and create user
	$ sudo addgroup hadoop
	$ sudo adduser hduser --ingroup hadoop
	# Provide sudo access to hduser
	$ sudo adduser hduser sudo
	$ su hduser
	# change the owner of hadoop and hbase location
	$ sudo chown -R hduser:hadoop /usr/local/hadoop
	$ sudo chown -R hduser:hadoop /usr/local/hbase
	# Set HADOOP_HOME and PATH, open ~/.bahrc file
	$ nano ~/.bashrc
	# Add this following lines at the end of file
		#HADOOP VARIABLES START
		export JAVA_HOME=/opt/Oracle_Java/jdk1.7.0_75/
		export PATH=$PATH:$JAVA_HOME/bin
	
		export HADOOP_HOME=/usr/local/hadoop 
		export HBASE_HOME=/usr/local/hbase
		export HADOOP_MAPRED_HOME=$HADOOP_HOME 
		export HADOOP_COMMON_HOME=$HADOOP_HOME 
		export HADOOP_HDFS_HOME=$HADOOP_HOME 
		export YARN_HOME=$HADOOP_HOME 
		export HADOOP_COMMON_LIB_NATIVE_DIR=$HADOOP_HOME/lib/native 
		export PATH=$PATH:$HADOOP_HOME/sbin:$HADOOP_HOME/bin:$HBASE_HOME/bin
		export HADOOP_INSTALL=$HADOOP_HOME 
		#HADOOP VARIABLES END

Prepare to setup hadoop single node cluster

Make changes in hadoop-env.sh and hbase-env.shfile


	# setup java home path in hadoop-env.sh
	$ nano /usr/local/hadoop/etc/hadoop/hadoop-env.sh
	# change the java home
	export JAVA_HOME=/opt/Oracle_Java/jdk1.7.0_75/
	# setup java home path in hbase-env.sh
	$ nano /usr/local/hbase/hbase-env.sh
	# change the java home
	export JAVA_HOME=/opt/Oracle_Java/jdk1.7.0_75/

Create name node and datanode storage location
Make chnges in hdfs-site.xml file


	$ mkdir -p /home/hduser/hadoop_store/hdfs/namenode
	$ mkdir -p /home/hduser/hadoop_store/hdfs/datanode
	#Changes in hdfs-site.xml
	$ nano /usr/local/hadoop/etc/hadoop/hdfs-site.xml
	# Changes in configuration section
		<configuration>
		<property>
		  <name>dfs.replication</name>
		  <value>1</value>
		 </property>
		 <property>
		   <name>dfs.namenode.name.dir</name>
		   <value>file:///home/hduser/hadoop_store/hdfs/namenode</value>
		 </property>
		 <property>
		   <name>dfs.datanode.data.dir</name>
		   <value>file:///home/hduser/hadoop_store/hdfs/datanode</value>
		 </property>
		</configuration>

Changes in hbase-site.xml


	$ nano /usr/local/hbase/conf/hbase-site.xml
	  <configuration>
		<property>
		    <name>hbase.rootdir</name>
		    <value>file:///home/hduser/hbase</value>
		 </property>
		 <property>
		    <name>hbase.zookeeper.property.dataDir</name>
		    <value>/home/hduser/zookeeper</value>
		  </property>
		</configuration>

Start hdfs and hbase


	# To Start hdfs
	$ start-dfs.sh
	# To Stop hdfs
	$ stop-dfs.sh
	# To start hbase
	$ start-hbase.sh
	# To stop hbase
	$ stop-hbase.sh

Run hbase shell

$ hbase shell hbase(main):001:0>
Create a table

hbase(main):001:0> create 'test_table','column_family'

Hadoop pseduo distributed mode

Change in etc/hadoop/core-site.xml


	<configuration>
    <property>
        <name>fs.defaultFS</name>
        <value>hdfs://localhost:9000</value>
    </property>
   </configuration>

Change in etc/hadoop/hdfs-site.xml


	<configuration>
    <property>
        <name>dfs.replication</name>
        <value>1</value>
    </property>
   </configuration>

Setup passphraseless ssh

If you cannot ssh to localhost without a passphrase, execute the following commands:


	$ ssh-keygen -t dsa -P '' -f ~/.ssh/id_dsa
	$ cat ~/.ssh/id_dsa.pub >> ~/.ssh/authorized_keys
	$ export HADOOP\_PREFIX=/usr/local/hadoop
	$ ssh localhost

Stop hbase and configure it for pseudo distributed mode Changes in hbase-site.xml


	<property>
  		<name>hbase.cluster.distributed</name>
	    <value>true</value>
	</property>
	<property>
  		<name>hbase.rootdir</name>
  		<value>hdfs://localhost:9000/hbase</value>
	</property>

Execution of hdfs

Format the file system


	$ hdfs namenode -format

Start name node and data node


	$ start-dfs.sh

The hadoop daemon log output is written to the $HADOOP_LOG_DIR directory (defaults to $HADOOP_HOME/logs).

Browse the web interface for the NameNode; by default it is available at:

http://localhost:50070/
Create hbase direstory

$ hdfs dfs -mkdir /hbase $ hdfs dfs -chown hduser:hadoop /hbase
Run hbase shell

$ hbase shell hbase(main):001:0>
Create a table

hbase(main):001:0> create 'test_table','column_family'

Connect hbase by thrift

What is Thrift?

"Thrift is a software framework for scalable cross-language services development. It combines a software stack with a code generation engine to build services that work efficiently and seamlessly between C++, Java, Python, PHP, Ruby, Erlang, Perl, Haskell, C#, Cocoa, Smalltalk, and OCaml."

Install thrift

Download thrift Thrift Download
Extract tar file and install

$ tar -xzf thrift-0.9.2.tar.gz $ mv thrift-0.9.2 /tmp/thrift $ sudo apt-get install libboost-dev libboost-test-dev libboost-program-options-dev libboost-system-dev libboost-filesystem-dev libevent-dev automake libtool flex bison pkg-config g++ libssl-dev $ cd /tmp/thrift/ $ ./configure $ make $ make install
Run hbase thrift client

$ hbase thrift2 start

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
src		src
README.md		README.md
pom.xml		pom.xml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

HBASE sample

HDFS and HBASE setup instruction

Reqired software

Download

Setup hadoop and hbase

Prepare to setup hadoop single node cluster

Hadoop pseduo distributed mode

Execution of hdfs

Connect hbase by thrift

What is Thrift?

Install thrift

About

Releases

Packages

Languages

tarunks/hbase-example

Folders and files

Latest commit

History

Repository files navigation

HBASE sample

HDFS and HBASE setup instruction

Reqired software

Download

Setup hadoop and hbase

Prepare to setup hadoop single node cluster

Hadoop pseduo distributed mode

Execution of hdfs

Connect hbase by thrift

What is Thrift?

Install thrift

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages