Hello! In previous tutorial we created Hortonworks Sandbox virutal machine in Azure (Install Hortonworks Sandbox with Data Platform). In this tutorial I will show you how to connect to this VM and how to use Hortonworks stack.
Table of Contents
Hortonworks is a company that offers a variety of solutions related to Apache Hadoop, an open source big data technology, as well as its own products and services for storing, processing, and analyzing large data sets. These offerings include the Hortonworks Data Platform (HDP), a distribution of Hadoop and related tools, and the Hortonworks Sandbox, a pre-configured environment for learning, developing, and testing Hadoop applications. In addition, the company provides professional services, training, and support to assist organizations in adopting and utilizing Hadoop and related technologies. Hadoop is commonly used for tasks such as data mining, machine learning, and log processing.
Add additional inbound port rules
First of all we must add additional inbound port rules to VM. It will allow us to connect to Ambari or Zeppelin for instance. Please open previously created VM and click to “Networking” tab. In the right side you will find “Add inbound port rule”.
We will create two inboud port rule:
In notifications you should see completed task like in the following screen.
Now in “Networking” tab you should also see that your new inboud rules are also visible in the list.
Connect to Hortonworks Dasboard and Ambari
You can see many Apache projects listed in the left menu in Ambrai as HDFS, YARN, MapReduce2, Tez, Hive and so on… At this moment I will not explain what is the role of each of them and how each works, because you could be scared and discouraged at the beginingat the beginning of your adventure in big data world. I will cover and exmplain each of them in the next tutorials with nice example which will help you to understand how it is everything works :).
At this moment the most important is to see the green light before them.
Connect to Hortonworks Sandbox virtual machine via SSH
Setup connection to your virtual machine using pattern: <username>@<host_ip_address> (please save the connection for the future).
Username and password are the same as in the moment when you was creating this VM in previous tutorial.
[pawel@sandbox-host ~]$ sudo su
Next please connect to sandbox-hdp.hortonworks.com which is exposed by 172.17.0.2 IP address:
[root@sandbox-host pawel]# ssh 172.17.0.2
The authenticity of host '172.17.0.2 (172.17.0.2)' can't be established. RSA key fingerprint is SHA256:oCHVVt8XBDItJbjH0XExlhePO93VcXJQGHx5WdiMhLE. RSA key fingerprint is MD5:41:23:bd:19:8f:59:16:da:40:4c:42:7f:52:6f:e5:f3. Are you sure you want to continue connecting (yes/no)? yes Warning: Permanently added '172.17.0.2' (RSA) to the list of known hosts. firstname.lastname@example.org's password: You are required to change your password immediately (root enforced) Changing password for root. (current) UNIX password: New password: Retype new password:
When we logged into sandbox we can check if we are able to list the files on root folder in HDFS for instance.
List the files in root folder in HDFS:
[root@sandbox-hdp ~]# hdfs dfs -ls /
You should see the output like this in the following picutre:
Could You Please Share This Post? I appreciate It And Thank YOU! :) Have A Nice Day!
We are sorry that this post was not useful for you!
Let us improve this post!
Tell us how we can improve this post?