How to install Hortonworks Sandbox with Data Platform in Microsoft Azure? – Useful Guide – Part 2

Jak utworzyć środowisko Hortonworks Sandbox Data Platfrom w chmurze Microsoft Azure? - Część 2 How to install Hortonworks Sandbox with Data Platform in Microsoft Azure? - useful guide - Part 2
Share this post and Earn Free Points!

Hello! In previous tutorial we created Hortonworks Sandbox virutal machine in Azure" (Install Hortonworks Sandbox with Data Platform"). In this tutorial" I will show you how to connect to this VM and how to use Hortonworks" stack.

Introduction

Hortonworks

Hortonworks is a company that offers a variety of solutions related to Apache Hadoop", an open" source big data" technology, as well as its own products and services for storing, processing, and analyzing large data sets. These offerings include the Hortonworks" Data Platform (HDP), a distribution of Hadoop" and related tools, and the Hortonworks Sandbox", a pre-configured environment for learning, developing, and testing Hadoop" applications. In addition, the company provides professional services, training, and support to assist organizations in adopting and utilizing Hadoop" and related technologies. Hadoop" is commonly used for tasks such as data mining, machine learning", and log processing.

How to install Hortonworks Sandbox with Data Platform in Microsoft Azure?

Add additional inbound port rules

First of all we must add additional inbound port rules to VM. It will allow us to connect to Ambari" or Zeppelin for instance. Please open" previously created VM and click to “Networking” tab. In the right side you will find “Add inbound port rule”.

We will create two inboud port rule:

  1. Port 8080 – for Ambari"
  2. Port 8888 – for Hortonworks" Dashboard
How to install Hortonworks Sandbox with Data Platform in Microsoft Azure? - Part 2
How to install Hortonworks Sandbox with Data Platform in Microsoft Azure? - Part 2

In notifications you should see completed task like in the following screen.

BigData-ETL: Hortonworks in azure part2 img4

Now in “Networking” tab you should also see that your new inboud rules are also visible in the list.

Connect to Hortonworks Dasboard and Ambari

Let’s open" your favourite browser and open" URLs. Pattern is easy: http://<host_ip_address>:<port>

  1.  http://40.115.47.144:8888/  – Hortonworks" Dashboard
  2.  http://40.115.47.144:8080/  – Ambari"

When you open" the Ambari" URL please use: maria_dev as a hostname" nad password.

You can see many Apache projects listed in the left menu in Ambrai as HDFS", YARN, MapReduce2, Tez, Hive" and so on… At this moment I will not explain what is the role of each of them and how each works, because you could be scared and discouraged at the beginingat the beginning of your adventure in big data" world. I will cover and exmplain each of them in the next tutorials with nice example which will help you to understand how it is everything works :).

At this moment the most important is to see the green light before them.

Connect to Hortonworks Sandbox virtual machine via SSH

You can also connect to Hortonworks" virual machine via SSH. From Windows" you can use Putty" software for example. If you are using linux" (e.g.: Ubuntu") you can just use the Terminal console.

Setup connection to your virtual machine using pattern: <username>@<host_ip_address> (please save the connection for the future).

Username and password are the same as in the moment when you was creating this VM in previous tutorial".

Change user" to root. You will be requested for password (in this case please put password for your user", here for pawel).

[pawel@sandbox-host ~]$ sudo su

Next please connect to sandbox-hdp.hortonworks.com which is exposed by 172.17.0.2 IP address:

[root@sandbox-host pawel]# ssh 172.17.0.2


Please type: yes to continue connection attempt. After that you will be informed that you must change the password for root user". Current password for root user" is simply: Hadoop".

The authenticity of host '172.17.0.2 (172.17.0.2)' can't be established.
RSA key fingerprint is SHA256:oCHVVt8XBDItJbjH0XExlhePO93VcXJQGHx5WdiMhLE.
RSA key fingerprint is MD5:41:23:bd:19:8f:59:16:da:40:4c:42:7f:52:6f:e5:f3.
Are you sure you want to continue connecting (yes/no)? yes
Warning: Permanently added '172.17.0.2' (RSA) to the list of known hosts.
root@172.17.0.2's password:
You are required to change your password immediately (root enforced)
Changing password for root.
(current) UNIX password:
New password:
Retype new password:

When we logged into sandbox we can check if we are able to list the files on root folder in HDFS" for instance.

List the files in root folder in HDFS":

[root@sandbox-hdp ~]# hdfs dfs -ls /

You should see the output like this in the following picutre:

Summary

After this tutorial" you should be able to connect to your Hortonworks Sandbox" environment.

Could You Please Share This Post? 
I appreciate It And Thank YOU! :)
Have A Nice Day!

How useful was this post?

Click on a star to rate it!

Average rating 4.8 / 5. Vote count: 259

No votes so far! Be the first to rate this post.

As you found this post useful...

Follow us on social media!

We are sorry that this post was not useful for you!

Let us improve this post!

Tell us how we can improve this post?